

SEQUENCE LISTING

<110> DIVERSA CORPORATION
      GRAY, Kevin
      ZHAO, Lishan
      CAYOUETTE, Michelle

<120> CELLULOLYTIC ENZYMES, NUCLEIC ACIDS ENCODING THEM
      AND METHODS FOR MAKING AND USING THEM

<130> 564462014241

<140> Not Yet Assigned
<141> Filed Concurrently Herewith

<150> US 60/772,786
<151> 2006-02-10

<160> 524

<170> PatentIn version 3.1

<210> 1
<211> 1374
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 1
atgtacaaac aactcgccct tgcttccctc tccctattcg ggcttgtaaa cgcccagcag     60

gtgggcactc agacgactga gaagcaccca gcgctgtcct ggaagacctg cactggcact    120

ggcggaaaaa gctgtacctc caagacggcc tcgattacca tcgatgctaa ctggcgatgg    180

gcccatgtta cgagcggata caccaactgc tacaccgaca acacctggaa ctccaccagc    240

tgcaaggatg gtgccacctg tgcgaagaac tgtgctatcg atggcgctga ctactctggc    300

acctacggta ttacgaccag ctccgatgct cttaccctca agttcgtcac caagggctcg    360

tactcgacca atatcggatc ccgcacctac ctcatggaca cagacagcaa gtaccagatg    420

ttcaacccca tcggcaagga gttcacgttc gatgtcgatg tttccaagct tccttgcggt    480

ttaaatggtg ctctgtactt cgtcgagatg gctgctgatg gtggtatggg caagggcaac    540

aacaaggctg gtgccaagta cggaactggc tactgcgatg cccaatgccc tcatgacgtg    600

aagtggatca acggtgcggc taattcggaa ggctgggagc catccagcaa tgataagaat    660

gccggaagcg gcaagtacgg cgcctgctgc ccggagatgg atatctggga ggccaactcc    720

atctctactg cctacacccc acacccctgc aagcagaacg gtatctttgc ctgcactggc    780

accgactgcg gtgatggcga caaccgatat ggcggcaact gcgacaaaga cggatgcgat    840

ttcaacagct accgcatggg tgtcaaggac ttctacggcc cgggaatgac ccttgacact    900

aacaagaaga tgaccgttgt gacccaattc atcggcagcg gcacatccct tactgaaatc    960

aagcgcttct acgtccagaa tggaaaggtt ttcaagaact cggcctctgc aatcgacggc   1020

gtcacgggta actccatcac ggacgacttc tgcgcagcgc aaaagaaggc ctttggcgac   1080

acttcctcct tcgctgatcg tggcggtctc aagggaatgg cctcctctct cgcaaagggt   1140

cacgtcttgg tcatgtcact gtgggacgac catgcggtta atatgctctg gctcgattcg   1200

acttatccca ccgataagga tgcctccact cccggtgtgg gtcgtggtac ttgtggaacg   1260

gactcaggca agccggaaga tgttgagagc aagtcgccgg atgcgcaggt tatctactcc   1320

aatattcgct ttggtcctat tggatctact ttcgatgagt ccgctgccgt ttaa         1374


<210> 2
<211> 457
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(18)

<220> 
<221> DOMAIN
<222> (20)...(453)
<223> Glycosyl hydrolase family 7

<400> 2
Met Tyr Lys Gln Leu Ala Leu Ala Ser Leu Ser Leu Phe Gly Leu Val
1               5                   10                  15      


Asn Ala Gln Gln Val Gly Thr Gln Thr Thr Glu Lys His Pro Ala Leu
            20                  25                  30          


Ser Trp Lys Thr Cys Thr Gly Thr Gly Gly Lys Ser Cys Thr Ser Lys
        35                  40                  45              


Thr Ala Ser Ile Thr Ile Asp Ala Asn Trp Arg Trp Ala His Val Thr
    50                  55                  60                  


Ser Gly Tyr Thr Asn Cys Tyr Thr Asp Asn Thr Trp Asn Ser Thr Ser
65                  70                  75                  80  


Cys Lys Asp Gly Ala Thr Cys Ala Lys Asn Cys Ala Ile Asp Gly Ala
                85                  90                  95      


Asp Tyr Ser Gly Thr Tyr Gly Ile Thr Thr Ser Ser Asp Ala Leu Thr
            100                 105                 110         


Leu Lys Phe Val Thr Lys Gly Ser Tyr Ser Thr Asn Ile Gly Ser Arg
        115                 120                 125             


Thr Tyr Leu Met Asp Thr Asp Ser Lys Tyr Gln Met Phe Asn Pro Ile
    130                 135                 140                 


Gly Lys Glu Phe Thr Phe Asp Val Asp Val Ser Lys Leu Pro Cys Gly
145                 150                 155                 160 


Leu Asn Gly Ala Leu Tyr Phe Val Glu Met Ala Ala Asp Gly Gly Met
                165                 170                 175     


Gly Lys Gly Asn Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys
            180                 185                 190         


Asp Ala Gln Cys Pro His Asp Val Lys Trp Ile Asn Gly Ala Ala Asn
        195                 200                 205             


Ser Glu Gly Trp Glu Pro Ser Ser Asn Asp Lys Asn Ala Gly Ser Gly
    210                 215                 220                 


Lys Tyr Gly Ala Cys Cys Pro Glu Met Asp Ile Trp Glu Ala Asn Ser
225                 230                 235                 240 


Ile Ser Thr Ala Tyr Thr Pro His Pro Cys Lys Gln Asn Gly Ile Phe
                245                 250                 255     


Ala Cys Thr Gly Thr Asp Cys Gly Asp Gly Asp Asn Arg Tyr Gly Gly
            260                 265                 270         


Asn Cys Asp Lys Asp Gly Cys Asp Phe Asn Ser Tyr Arg Met Gly Val
        275                 280                 285             


Lys Asp Phe Tyr Gly Pro Gly Met Thr Leu Asp Thr Asn Lys Lys Met
    290                 295                 300                 


Thr Val Val Thr Gln Phe Ile Gly Ser Gly Thr Ser Leu Thr Glu Ile
305                 310                 315                 320 


Lys Arg Phe Tyr Val Gln Asn Gly Lys Val Phe Lys Asn Ser Ala Ser
                325                 330                 335     


Ala Ile Asp Gly Val Thr Gly Asn Ser Ile Thr Asp Asp Phe Cys Ala
            340                 345                 350         


Ala Gln Lys Lys Ala Phe Gly Asp Thr Ser Ser Phe Ala Asp Arg Gly
        355                 360                 365             


Gly Leu Lys Gly Met Ala Ser Ser Leu Ala Lys Gly His Val Leu Val
    370                 375                 380                 


Met Ser Leu Trp Asp Asp His Ala Val Asn Met Leu Trp Leu Asp Ser
385                 390                 395                 400 


Thr Tyr Pro Thr Asp Lys Asp Ala Ser Thr Pro Gly Val Gly Arg Gly
                405                 410                 415     


Thr Cys Gly Thr Asp Ser Gly Lys Pro Glu Asp Val Glu Ser Lys Ser
            420                 425                 430         


Pro Asp Ala Gln Val Ile Tyr Ser Asn Ile Arg Phe Gly Pro Ile Gly
        435                 440                 445             


Ser Thr Phe Asp Glu Ser Ala Ala Val
    450                 455         


<210> 3
<211> 2577
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 3
atgctttcga accgccggtt gatccgtacg attccgcttg gcgccgccgc gtacagcgtg     60

ctgctggggc tggccggttg cagccagagc acggttgcga ccgcgccagc cgtcgagcct    120

accgcatcgc aggccatggt cgcgcagccg cagaactggc cgcgcgtgga ttggccgctt    180

ccgcacgatg cggcgctcga agcgcgcgtg gatgcgctgc tggcgacgat gagcgtggag    240

gagaaggtcg gccagatcgt gcagggcgac atcgacagca tcacgccgga agaccttcgc    300

aagtatcggc tgggttcgat cctggccggc ggcaattccg atccgggcag gaagtacaac    360

gccgctcctt cggaatggct ggcgctggcc gatgcgttct gggaagcctc gatggatacc    420

tctggcggcg gcaaggcgat cccggtgatc ttcggcatcg acgcggtgca tggacagagc    480

aacatcgtcg gagccacgct gtttccgcac aacatcggtt tgggcgcaac gcgcaatccc    540

gatctcatcc gggaaattgg ccgcatcacc gcggtggaaa cccgcgttac aggcatggaa    600

tggacgttcg cgccaacggt cgcggttccg caggacgacc gctgggggcg cagctacgag    660

ggctattcgg aaagcgccga tgtcgtcgcc agctttgcgc cggcgatggt cgaaggactg    720

cagggcatgg ccggtgcggc ggatttcctc gacgaccatc atgtgatgac gtcgatcaag    780

catttcctcg gggatggcgg cacgcgcgac ggcaaggatc agggcgacac cctcgcgagc    840

gaagcgcagc tgcgcgacat ccacgccgcc ggctacatca ccggcatcgc agcgggcgcg    900

caggcagtaa tggcctcgta caacagctgg catggcgaaa agatgcacgg gcgccgcgac    960

ctgctcaccg atgtgctgaa ggggcgcatg gatttcggcg gcttcgtggt cggcgattgg   1020

aatggccacg gtcaggtggc gggctgtacc aataccgatt gtccggcttc attcaacgcc   1080

ggcctggaca tggcgatggc gcccgacagc tggaagggcc tgtacgagag cacgctggcg   1140

cacgtgaagg caggcacgat cccgatgcag cggctggacg atgcggtgcg ccgtatcctg   1200

cgggtcaagt tccgcatggg gctgttcgag aagccgcggc cttcgcagcg tgcgctcggc   1260

ggcaagttcg aactgctcgg agcgccggag catcgcgcgg tggcacgtca ggcggtgcgt   1320

gaatccctgg tgttgctgaa gaaccagaac ggcctgttgc cgctttcccc gaatcagaga   1380

ctgctggtgg ccggcgacgg cgcgaacgat ctgggcaagc aatccggcgg ttggacgctc   1440

aactggcagg gcaccggcac cacgcgtgcc gattacccga acgcggattc gatctgggaa   1500

gggctcaagg cgcaggtcga agcggcgggc ggtcaggccg agcttgccgt cgatggccag   1560

taccggaacc gaccggacgc ggccatcgtc gtgttcggcg agaacccgta cgcggagttc   1620

cagggagaca tcccgaacct gctgtaccgc cccggagacg atggcgacct ggaactgatc   1680

cgtcgcctga aggctgaagg catcccggtg gttgcggtgt tcctgagcgg caggccgctg   1740

tggatgaatc gcgagatcaa cgcggccgat gccttcgtcg cggcctggtt gccgggctct   1800

gaaggtggcg gcgtcgccga tgtgctgctg cgtggcgcgg acggcaaggt ccagcacgat   1860

ttcaagggca agctgagttt ctcgtggccg cgtcgcgccg accagttcga caacaatgtc   1920

ggtcaggaaa actacgatcc gctgttcgcg ttcgggtatg gcctgactta tgccgacgac   1980

ggcgatgtgg cgccgttgtc cgaggattcg ggcatctccg gacaggtgtc gcaggcgaat   2040

gtgttcttct cgcatggcgt tgcgagccaa ggcttgaggc tgcgcctgat cggcgctgat   2100

ggcggcatcg cggacgtgat gcatccgcag gcgaagaccg ccgacggcac gctggttctt   2160

agcgcgatcg actaccagcg ccaggaaggt gcacgtcgtt tggtgtggac gggcagggcg   2220

accgcggaac tgctttccac cacgccgctc gacctgggcc gggaaaccaa cggcgacgtg   2280

aagatcgttg ccacgctgcg catcgacgcg cttccggccg atggcgacgt ggcgctgatc   2340

gcgcgcagtg gtgatcgcag tgccgaactt ccgatcggcg actggttcgc cagtcttccg   2400

cgcgggcagt ggttgagcgc gggcgtgctg ctcaagtgcc tgcgtgtcgc tggttcggat   2460

acggcgaagc tggacgcgcc gttctccatc cgtggcagcg cagggctgga actggcgctg   2520

gcgcaagtgg tggcggccac cgccgacgac acgcatctgg actgcccgat ccagtag      2577


<210> 4
<211> 858
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (443)...(659)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> DOMAIN
<222> (143)...(367)
<223> Glycosyl hydrolase family 3 N terminal domain

<400> 4
Met Leu Ser Asn Arg Arg Leu Ile Arg Thr Ile Pro Leu Gly Ala Ala
1               5                   10                  15      


Ala Tyr Ser Val Leu Leu Gly Leu Ala Gly Cys Ser Gln Ser Thr Val
            20                  25                  30          


Ala Thr Ala Pro Ala Val Glu Pro Thr Ala Ser Gln Ala Met Val Ala
        35                  40                  45              


Gln Pro Gln Asn Trp Pro Arg Val Asp Trp Pro Leu Pro His Asp Ala
    50                  55                  60                  


Ala Leu Glu Ala Arg Val Asp Ala Leu Leu Ala Thr Met Ser Val Glu
65                  70                  75                  80  


Glu Lys Val Gly Gln Ile Val Gln Gly Asp Ile Asp Ser Ile Thr Pro
                85                  90                  95      


Glu Asp Leu Arg Lys Tyr Arg Leu Gly Ser Ile Leu Ala Gly Gly Asn
            100                 105                 110         


Ser Asp Pro Gly Arg Lys Tyr Asn Ala Ala Pro Ser Glu Trp Leu Ala
        115                 120                 125             


Leu Ala Asp Ala Phe Trp Glu Ala Ser Met Asp Thr Ser Gly Gly Gly
    130                 135                 140                 


Lys Ala Ile Pro Val Ile Phe Gly Ile Asp Ala Val His Gly Gln Ser
145                 150                 155                 160 


Asn Ile Val Gly Ala Thr Leu Phe Pro His Asn Ile Gly Leu Gly Ala
                165                 170                 175     


Thr Arg Asn Pro Asp Leu Ile Arg Glu Ile Gly Arg Ile Thr Ala Val
            180                 185                 190         


Glu Thr Arg Val Thr Gly Met Glu Trp Thr Phe Ala Pro Thr Val Ala
        195                 200                 205             


Val Pro Gln Asp Asp Arg Trp Gly Arg Ser Tyr Glu Gly Tyr Ser Glu
    210                 215                 220                 


Ser Ala Asp Val Val Ala Ser Phe Ala Pro Ala Met Val Glu Gly Leu
225                 230                 235                 240 


Gln Gly Met Ala Gly Ala Ala Asp Phe Leu Asp Asp His His Val Met
                245                 250                 255     


Thr Ser Ile Lys His Phe Leu Gly Asp Gly Gly Thr Arg Asp Gly Lys
            260                 265                 270         


Asp Gln Gly Asp Thr Leu Ala Ser Glu Ala Gln Leu Arg Asp Ile His
        275                 280                 285             


Ala Ala Gly Tyr Ile Thr Gly Ile Ala Ala Gly Ala Gln Ala Val Met
    290                 295                 300                 


Ala Ser Tyr Asn Ser Trp His Gly Glu Lys Met His Gly Arg Arg Asp
305                 310                 315                 320 


Leu Leu Thr Asp Val Leu Lys Gly Arg Met Asp Phe Gly Gly Phe Val
                325                 330                 335     


Val Gly Asp Trp Asn Gly His Gly Gln Val Ala Gly Cys Thr Asn Thr
            340                 345                 350         


Asp Cys Pro Ala Ser Phe Asn Ala Gly Leu Asp Met Ala Met Ala Pro
        355                 360                 365             


Asp Ser Trp Lys Gly Leu Tyr Glu Ser Thr Leu Ala His Val Lys Ala
    370                 375                 380                 


Gly Thr Ile Pro Met Gln Arg Leu Asp Asp Ala Val Arg Arg Ile Leu
385                 390                 395                 400 


Arg Val Lys Phe Arg Met Gly Leu Phe Glu Lys Pro Arg Pro Ser Gln
                405                 410                 415     


Arg Ala Leu Gly Gly Lys Phe Glu Leu Leu Gly Ala Pro Glu His Arg
            420                 425                 430         


Ala Val Ala Arg Gln Ala Val Arg Glu Ser Leu Val Leu Leu Lys Asn
        435                 440                 445             


Gln Asn Gly Leu Leu Pro Leu Ser Pro Asn Gln Arg Leu Leu Val Ala
    450                 455                 460                 


Gly Asp Gly Ala Asn Asp Leu Gly Lys Gln Ser Gly Gly Trp Thr Leu
465                 470                 475                 480 


Asn Trp Gln Gly Thr Gly Thr Thr Arg Ala Asp Tyr Pro Asn Ala Asp
                485                 490                 495     


Ser Ile Trp Glu Gly Leu Lys Ala Gln Val Glu Ala Ala Gly Gly Gln
            500                 505                 510         


Ala Glu Leu Ala Val Asp Gly Gln Tyr Arg Asn Arg Pro Asp Ala Ala
        515                 520                 525             


Ile Val Val Phe Gly Glu Asn Pro Tyr Ala Glu Phe Gln Gly Asp Ile
    530                 535                 540                 


Pro Asn Leu Leu Tyr Arg Pro Gly Asp Asp Gly Asp Leu Glu Leu Ile
545                 550                 555                 560 


Arg Arg Leu Lys Ala Glu Gly Ile Pro Val Val Ala Val Phe Leu Ser
                565                 570                 575     


Gly Arg Pro Leu Trp Met Asn Arg Glu Ile Asn Ala Ala Asp Ala Phe
            580                 585                 590         


Val Ala Ala Trp Leu Pro Gly Ser Glu Gly Gly Gly Val Ala Asp Val
        595                 600                 605             


Leu Leu Arg Gly Ala Asp Gly Lys Val Gln His Asp Phe Lys Gly Lys
    610                 615                 620                 


Leu Ser Phe Ser Trp Pro Arg Arg Ala Asp Gln Phe Asp Asn Asn Val
625                 630                 635                 640 


Gly Gln Glu Asn Tyr Asp Pro Leu Phe Ala Phe Gly Tyr Gly Leu Thr
                645                 650                 655     


Tyr Ala Asp Asp Gly Asp Val Ala Pro Leu Ser Glu Asp Ser Gly Ile
            660                 665                 670         


Ser Gly Gln Val Ser Gln Ala Asn Val Phe Phe Ser His Gly Val Ala
        675                 680                 685             


Ser Gln Gly Leu Arg Leu Arg Leu Ile Gly Ala Asp Gly Gly Ile Ala
    690                 695                 700                 


Asp Val Met His Pro Gln Ala Lys Thr Ala Asp Gly Thr Leu Val Leu
705                 710                 715                 720 


Ser Ala Ile Asp Tyr Gln Arg Gln Glu Gly Ala Arg Arg Leu Val Trp
                725                 730                 735     


Thr Gly Arg Ala Thr Ala Glu Leu Leu Ser Thr Thr Pro Leu Asp Leu
            740                 745                 750         


Gly Arg Glu Thr Asn Gly Asp Val Lys Ile Val Ala Thr Leu Arg Ile
        755                 760                 765             


Asp Ala Leu Pro Ala Asp Gly Asp Val Ala Leu Ile Ala Arg Ser Gly
    770                 775                 780                 


Asp Arg Ser Ala Glu Leu Pro Ile Gly Asp Trp Phe Ala Ser Leu Pro
785                 790                 795                 800 


Arg Gly Gln Trp Leu Ser Ala Gly Val Leu Leu Lys Cys Leu Arg Val
                805                 810                 815     


Ala Gly Ser Asp Thr Ala Lys Leu Asp Ala Pro Phe Ser Ile Arg Gly
            820                 825                 830         


Ser Ala Gly Leu Glu Leu Ala Leu Ala Gln Val Val Ala Ala Thr Ala
        835                 840                 845             


Asp Asp Thr His Leu Asp Cys Pro Ile Gln
    850                 855             


<210> 5
<211> 2139
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 5
atggctgaga atttacgacc tatctatctc gattcgtcgc gctcgatcgc tgaacgcgtt     60

caggatttgc tttctcgcat gacactggat gaaaaaatca gccagatgag taatgcggtc    120

tccgccatac cacgcctgaa catgccagcc tacgacttct ggagtgaagc cctgcatgga    180

gtcgccagga atggcaaggc gaccgtcttc ccacaggcca tcggcatggc tgccacctgg    240

gaccctgcct tagtgcggcg tattggaaac gcaatcagcg atgagggccg agcaaaatac    300

cacgctgctc tgaaaaggcg agccttcaca ggtcagtatc aaggcctgtg tttttggagc    360

ccaaacatta accaggaccg ggacgttcgt tggggaagag gacaggagac ctacggagaa    420

gatccttacc tcacgggtga attgggtgta gcatttgtgc gcggcattca gggtgaccat    480

cccaggtatt tgaaggctgc cgcgtgtgcc aagcatttcg ctgtccattc tggtccggaa    540

gcacttcgcc atgtctttaa tgccgtcccc tccagtcgtg atctgcagga tacctatctg    600

ccggccttta aaaagttggt tcaagaaggc aaagttgaat cggtgatggg cgcttacaat    660

gcggtttacg gcgtcccttg caatgccagc gagttcttgc tcaatcaaac actccgaaaa    720

gagtggggat ttgaaggtca tgtcgtctca gactgcggtg ccatcaccga tctccaccgc    780

agccataaat acaccaagga cgctgccgaa tcagcagcga tggcaattaa ggcagggtgc    840

gacctatctt gtgaccatgt ttattacgag agcattggcg aggctgtcga acgaggcctt    900

ctgagtgtag ccgatgtaga tcgggctcta gcccgcacct tgagtacgcg cttcaagctg    960

ggcatgtttg atcctgatga gatggttccg tatgcttcga tcccgttgag cgtgatcgac   1020

agcccagaac atcgacagtt ggcctacgag gcggcagtca agtcggtggt gctgctcaaa   1080

aacaaaaacg acatcttgcc tattgcacct gagaccactt cgataatggt cgcgggtcca   1140

aatgcggcgg ttgtaaatgt cctgcttggt aattattatg gcttcaacgg ccgcatggtc   1200

actgtgttgg aagggatcac cgatgcgctg ccggagggca tgggcatgga atatcaccag   1260

ggtatgatgc tgacagatac cggcacaacg cctgacaact ggtcaatcgg tatggctgcc   1320

agggctggac tgaccatagc ctgtatgggt atctcaccct tgatggaagg tgaagagggc   1380

gaggcgctgc tcgttgaaca cggtggtgac cgtagttcaa tcgagctgcc caaagctcag   1440

gtcgattacc tgcggaagtt gaacatcgcc ggagccagga tcgtgctggt tctgtttggt   1500

ggcagtgcaa tagcgctggg cgaagccgaa gatctggtcg aagcgattat ccatgtctgg   1560

tatcctggcg aggaaggtgg tcatgctgta gcggatatcc tgttcggcaa ggctacccca   1620

tccggcaaac tacccattac ctttcccaag gccaccagcc agcttccgcc ctttgaagat   1680

tacagcatga aagacagaac ctaccgctac gcaacctggg agccgctcta tccgtttgga   1740

ttcggattga gttacacgac ttttgcttac tcggatctaa aactcgacca gtctaccctc   1800

aaagctggcg attcgcttga ggtttcccta aggctgacca atacaggtga attagccggc   1860

gaagaagtag ttcaggtcta tataacagat ctcgaggctt caaacgtcgt gcctatccac   1920

aagctggcag cattccggcg tgtagctcta cagcccggtg agagccagtt gctttccttc   1980

agcatcgctc cagaatcgat gatgttcgta gacgatgatg gcaatcagca gcttgagcca   2040

ggcgatttcc ggctgacgat cggtggcagt tcgcctggcc agcgtagcca gacattaggt   2100

gcgcccgaac cattgagcga caccttctcc gttcaataa                          2139


<210> 6
<211> 712
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (356)...(587)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> DOMAIN
<222> (40)...(285)
<223> Glycosyl hydrolase family 3 N terminal domain

<400> 6
Met Ala Glu Asn Leu Arg Pro Ile Tyr Leu Asp Ser Ser Arg Ser Ile
1               5                   10                  15      


Ala Glu Arg Val Gln Asp Leu Leu Ser Arg Met Thr Leu Asp Glu Lys
            20                  25                  30          


Ile Ser Gln Met Ser Asn Ala Val Ser Ala Ile Pro Arg Leu Asn Met
        35                  40                  45              


Pro Ala Tyr Asp Phe Trp Ser Glu Ala Leu His Gly Val Ala Arg Asn
    50                  55                  60                  


Gly Lys Ala Thr Val Phe Pro Gln Ala Ile Gly Met Ala Ala Thr Trp
65                  70                  75                  80  


Asp Pro Ala Leu Val Arg Arg Ile Gly Asn Ala Ile Ser Asp Glu Gly
                85                  90                  95      


Arg Ala Lys Tyr His Ala Ala Leu Lys Arg Arg Ala Phe Thr Gly Gln
            100                 105                 110         


Tyr Gln Gly Leu Cys Phe Trp Ser Pro Asn Ile Asn Gln Asp Arg Asp
        115                 120                 125             


Val Arg Trp Gly Arg Gly Gln Glu Thr Tyr Gly Glu Asp Pro Tyr Leu
    130                 135                 140                 


Thr Gly Glu Leu Gly Val Ala Phe Val Arg Gly Ile Gln Gly Asp His
145                 150                 155                 160 


Pro Arg Tyr Leu Lys Ala Ala Ala Cys Ala Lys His Phe Ala Val His
                165                 170                 175     


Ser Gly Pro Glu Ala Leu Arg His Val Phe Asn Ala Val Pro Ser Ser
            180                 185                 190         


Arg Asp Leu Gln Asp Thr Tyr Leu Pro Ala Phe Lys Lys Leu Val Gln
        195                 200                 205             


Glu Gly Lys Val Glu Ser Val Met Gly Ala Tyr Asn Ala Val Tyr Gly
    210                 215                 220                 


Val Pro Cys Asn Ala Ser Glu Phe Leu Leu Asn Gln Thr Leu Arg Lys
225                 230                 235                 240 


Glu Trp Gly Phe Glu Gly His Val Val Ser Asp Cys Gly Ala Ile Thr
                245                 250                 255     


Asp Leu His Arg Ser His Lys Tyr Thr Lys Asp Ala Ala Glu Ser Ala
            260                 265                 270         


Ala Met Ala Ile Lys Ala Gly Cys Asp Leu Ser Cys Asp His Val Tyr
        275                 280                 285             


Tyr Glu Ser Ile Gly Glu Ala Val Glu Arg Gly Leu Leu Ser Val Ala
    290                 295                 300                 


Asp Val Asp Arg Ala Leu Ala Arg Thr Leu Ser Thr Arg Phe Lys Leu
305                 310                 315                 320 


Gly Met Phe Asp Pro Asp Glu Met Val Pro Tyr Ala Ser Ile Pro Leu
                325                 330                 335     


Ser Val Ile Asp Ser Pro Glu His Arg Gln Leu Ala Tyr Glu Ala Ala
            340                 345                 350         


Val Lys Ser Val Val Leu Leu Lys Asn Lys Asn Asp Ile Leu Pro Ile
        355                 360                 365             


Ala Pro Glu Thr Thr Ser Ile Met Val Ala Gly Pro Asn Ala Ala Val
    370                 375                 380                 


Val Asn Val Leu Leu Gly Asn Tyr Tyr Gly Phe Asn Gly Arg Met Val
385                 390                 395                 400 


Thr Val Leu Glu Gly Ile Thr Asp Ala Leu Pro Glu Gly Met Gly Met
                405                 410                 415     


Glu Tyr His Gln Gly Met Met Leu Thr Asp Thr Gly Thr Thr Pro Asp
            420                 425                 430         


Asn Trp Ser Ile Gly Met Ala Ala Arg Ala Gly Leu Thr Ile Ala Cys
        435                 440                 445             


Met Gly Ile Ser Pro Leu Met Glu Gly Glu Glu Gly Glu Ala Leu Leu
    450                 455                 460                 


Val Glu His Gly Gly Asp Arg Ser Ser Ile Glu Leu Pro Lys Ala Gln
465                 470                 475                 480 


Val Asp Tyr Leu Arg Lys Leu Asn Ile Ala Gly Ala Arg Ile Val Leu
                485                 490                 495     


Val Leu Phe Gly Gly Ser Ala Ile Ala Leu Gly Glu Ala Glu Asp Leu
            500                 505                 510         


Val Glu Ala Ile Ile His Val Trp Tyr Pro Gly Glu Glu Gly Gly His
        515                 520                 525             


Ala Val Ala Asp Ile Leu Phe Gly Lys Ala Thr Pro Ser Gly Lys Leu
    530                 535                 540                 


Pro Ile Thr Phe Pro Lys Ala Thr Ser Gln Leu Pro Pro Phe Glu Asp
545                 550                 555                 560 


Tyr Ser Met Lys Asp Arg Thr Tyr Arg Tyr Ala Thr Trp Glu Pro Leu
                565                 570                 575     


Tyr Pro Phe Gly Phe Gly Leu Ser Tyr Thr Thr Phe Ala Tyr Ser Asp
            580                 585                 590         


Leu Lys Leu Asp Gln Ser Thr Leu Lys Ala Gly Asp Ser Leu Glu Val
        595                 600                 605             


Ser Leu Arg Leu Thr Asn Thr Gly Glu Leu Ala Gly Glu Glu Val Val
    610                 615                 620                 


Gln Val Tyr Ile Thr Asp Leu Glu Ala Ser Asn Val Val Pro Ile His
625                 630                 635                 640 


Lys Leu Ala Ala Phe Arg Arg Val Ala Leu Gln Pro Gly Glu Ser Gln
                645                 650                 655     


Leu Leu Ser Phe Ser Ile Ala Pro Glu Ser Met Met Phe Val Asp Asp
            660                 665                 670         


Asp Gly Asn Gln Gln Leu Glu Pro Gly Asp Phe Arg Leu Thr Ile Gly
        675                 680                 685             


Gly Ser Ser Pro Gly Gln Arg Ser Gln Thr Leu Gly Ala Pro Glu Pro
    690                 695                 700                 


Leu Ser Asp Thr Phe Ser Val Gln
705                 710         


<210> 7
<211> 2265
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 7
atgaaccggg aagtccccac tgtttcgcca cgaccgctgc tggtcggcat gatcgctgtc     60

ctgctggcgg caccggccgc cgccaagccg ccggcctacg cgacgccggc cgaaaaagcc    120

ttcgtcgacg ccctgatggc gaagatgacg gtcgaggaga agctgggcca gttgaaccag    180

ccggccgggg tgggcaacaa caccggcccg gcggccatga ccggcaacga ggaccagatc    240

cgcaagggcg agatcggcac ctatctgggt acccagggcg cggtgctgac ctgccgcctg    300

cagcggatcg cggtggagca gtcccggctg ggcatcccgc tgatgttcgg ctacgacgtg    360

atccacggcc accgcaccgt cttcccggtg ccgctgggcg agtccgccag cttcgatccg    420

gtcgaagtgc agcatgccgc ccgtgtggcg gcgatcgagg cctcggcgca cggcatccac    480

tggacctacg cgccgatggt cgacatcgcg cgtgatccgc gctggggtcg cgtggtggaa    540

ggcgcgggcg aagatcccta tctgggatcg gtactcgcgg cggcgcgcgt gcgcggcttc    600

cagggcgatg acctgcgcaa gccggatgcg gtgctggcca cggccaagca cttcgtcgcg    660

tacggcgcgg ccgagggcgg ccgcgattac gacgtggccg acatttccga acgcacgctg    720

cacgaggtct acctgccgcc gttcaaggcc gcggtcgatg ccggcgcgca atccatcatg    780

gccgcgttca acgagatcgc cggcgtgccc atgcacgcgc accggccgct gatcgaggat    840

ctgctgcgca aggagtgggg ctgggacggc ctgctggtca gcgattacac cggcgtgatg    900

gaactgatgc cgcacggcat cgccgccgac cgcaagcagg cgggtgcgct cggcctgcgt    960

gccggcgtcg acgtcgacat ggtcagccag atctacgtga aggacctgcc ggccgaagtg   1020

aaggcgggcc gggtgccgat ggcccaggtg gatgcgtcgg tgcgccgtgt gctcaacgca   1080

aagtaccggc tgggcctgtt cgacgatccc taccggtcct gcaccgacga tggcgcgcac   1140

gagcgcgcga tgacgctgac gccggagcac cgcgccgacg cacgccggat ggcgcagaaa   1200

tcgctggtgc tgctggagaa cggcaacgac gtgctgccgc tgtcgaagtc ggtccgcacg   1260

ctggcggtga tcggcccgct cgccgaccat cgccgcgcga tgctcggcaa ctgggcggtg   1320

gccggtcgcg aagaggacgc ggtgacgccg atcgagggcc tgaaggccgc gctgggtgac   1380

ggtacccggc tggtgatcgc gaagggcgcc gacatcgaca gccaggacac gtccggtttc   1440

gcgcaggccg tcgcggcggc gaagcaggcc gatgcggtgg tgatgttcct cggcgaacac   1500

ccggacatga gcgcggaagc gaacaaccgc acctcgctgg acttgcccgg cgtgcaggaa   1560

caacttgcat tggcggtcgc cgcgaccggc aagccggtgg tggccgtgct gttgaacgga   1620

cgcccgctgt cgatcggtgg cctgaagggc aaggtgccgg cgatcctcga agcctggttc   1680

cccggggtcg agggcggtca tgcgatcacc gacgtactgt tcggcgacgt caatccgtcg   1740

gccaagctgc cggtgacgtt cccgcacaac gtcggccagg tgccgatcta ctacgcgcac   1800

aagaacaccg gccgtccgcc gagcgagcag gagaaataca ccagcaagta cctcgacgtg   1860

ccgtggacgc cgctgtacgc cttcggccac ggactcagct acaccacgtt ccgctacgac   1920

gcgccggtcg tggcgaagaa gacgctggcg ccttccgccc tgcagcagca ggtgagcgtg   1980

cgcatcacca acaccggcaa gcgggagggc gtcgaggtgg tgcaactgta tgtgcgggac   2040

gatgtcgcca ccgtcacccg tccggtgaaa cagttgcgcg gcttccagcg cgtgtcgctg   2100

gcgcctggcg aatcgaagac ggtcacgttc gacctcggct tcgaggatct ggcgatgtac   2160

gacgaccgca tgcagcaggt ggtggagccg ggcacgttca ccgtgtatgt cggcggcagt   2220

tccgaccgca cgcgtgaggt cgctttcgag gtcgccgccc aatag                   2265


<210> 8
<211> 754
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(28)

<220> 
<221> DOMAIN
<222> (402)...(636)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> DOMAIN
<222> (104)...(328)
<223> Glycosyl hydrolase family 3 N terminal domain

<400> 8
Met Asn Arg Glu Val Pro Thr Val Ser Pro Arg Pro Leu Leu Val Gly
1               5                   10                  15      


Met Ile Ala Val Leu Leu Ala Ala Pro Ala Ala Ala Lys Pro Pro Ala
            20                  25                  30          


Tyr Ala Thr Pro Ala Glu Lys Ala Phe Val Asp Ala Leu Met Ala Lys
        35                  40                  45              


Met Thr Val Glu Glu Lys Leu Gly Gln Leu Asn Gln Pro Ala Gly Val
    50                  55                  60                  


Gly Asn Asn Thr Gly Pro Ala Ala Met Thr Gly Asn Glu Asp Gln Ile
65                  70                  75                  80  


Arg Lys Gly Glu Ile Gly Thr Tyr Leu Gly Thr Gln Gly Ala Val Leu
                85                  90                  95      


Thr Cys Arg Leu Gln Arg Ile Ala Val Glu Gln Ser Arg Leu Gly Ile
            100                 105                 110         


Pro Leu Met Phe Gly Tyr Asp Val Ile His Gly His Arg Thr Val Phe
        115                 120                 125             


Pro Val Pro Leu Gly Glu Ser Ala Ser Phe Asp Pro Val Glu Val Gln
    130                 135                 140                 


His Ala Ala Arg Val Ala Ala Ile Glu Ala Ser Ala His Gly Ile His
145                 150                 155                 160 


Trp Thr Tyr Ala Pro Met Val Asp Ile Ala Arg Asp Pro Arg Trp Gly
                165                 170                 175     


Arg Val Val Glu Gly Ala Gly Glu Asp Pro Tyr Leu Gly Ser Val Leu
            180                 185                 190         


Ala Ala Ala Arg Val Arg Gly Phe Gln Gly Asp Asp Leu Arg Lys Pro
        195                 200                 205             


Asp Ala Val Leu Ala Thr Ala Lys His Phe Val Ala Tyr Gly Ala Ala
    210                 215                 220                 


Glu Gly Gly Arg Asp Tyr Asp Val Ala Asp Ile Ser Glu Arg Thr Leu
225                 230                 235                 240 


His Glu Val Tyr Leu Pro Pro Phe Lys Ala Ala Val Asp Ala Gly Ala
                245                 250                 255     


Gln Ser Ile Met Ala Ala Phe Asn Glu Ile Ala Gly Val Pro Met His
            260                 265                 270         


Ala His Arg Pro Leu Ile Glu Asp Leu Leu Arg Lys Glu Trp Gly Trp
        275                 280                 285             


Asp Gly Leu Leu Val Ser Asp Tyr Thr Gly Val Met Glu Leu Met Pro
    290                 295                 300                 


His Gly Ile Ala Ala Asp Arg Lys Gln Ala Gly Ala Leu Gly Leu Arg
305                 310                 315                 320 


Ala Gly Val Asp Val Asp Met Val Ser Gln Ile Tyr Val Lys Asp Leu
                325                 330                 335     


Pro Ala Glu Val Lys Ala Gly Arg Val Pro Met Ala Gln Val Asp Ala
            340                 345                 350         


Ser Val Arg Arg Val Leu Asn Ala Lys Tyr Arg Leu Gly Leu Phe Asp
        355                 360                 365             


Asp Pro Tyr Arg Ser Cys Thr Asp Asp Gly Ala His Glu Arg Ala Met
    370                 375                 380                 


Thr Leu Thr Pro Glu His Arg Ala Asp Ala Arg Arg Met Ala Gln Lys
385                 390                 395                 400 


Ser Leu Val Leu Leu Glu Asn Gly Asn Asp Val Leu Pro Leu Ser Lys
                405                 410                 415     


Ser Val Arg Thr Leu Ala Val Ile Gly Pro Leu Ala Asp His Arg Arg
            420                 425                 430         


Ala Met Leu Gly Asn Trp Ala Val Ala Gly Arg Glu Glu Asp Ala Val
        435                 440                 445             


Thr Pro Ile Glu Gly Leu Lys Ala Ala Leu Gly Asp Gly Thr Arg Leu
    450                 455                 460                 


Val Ile Ala Lys Gly Ala Asp Ile Asp Ser Gln Asp Thr Ser Gly Phe
465                 470                 475                 480 


Ala Gln Ala Val Ala Ala Ala Lys Gln Ala Asp Ala Val Val Met Phe
                485                 490                 495     


Leu Gly Glu His Pro Asp Met Ser Ala Glu Ala Asn Asn Arg Thr Ser
            500                 505                 510         


Leu Asp Leu Pro Gly Val Gln Glu Gln Leu Ala Leu Ala Val Ala Ala
        515                 520                 525             


Thr Gly Lys Pro Val Val Ala Val Leu Leu Asn Gly Arg Pro Leu Ser
    530                 535                 540                 


Ile Gly Gly Leu Lys Gly Lys Val Pro Ala Ile Leu Glu Ala Trp Phe
545                 550                 555                 560 


Pro Gly Val Glu Gly Gly His Ala Ile Thr Asp Val Leu Phe Gly Asp
                565                 570                 575     


Val Asn Pro Ser Ala Lys Leu Pro Val Thr Phe Pro His Asn Val Gly
            580                 585                 590         


Gln Val Pro Ile Tyr Tyr Ala His Lys Asn Thr Gly Arg Pro Pro Ser
        595                 600                 605             


Glu Gln Glu Lys Tyr Thr Ser Lys Tyr Leu Asp Val Pro Trp Thr Pro
    610                 615                 620                 


Leu Tyr Ala Phe Gly His Gly Leu Ser Tyr Thr Thr Phe Arg Tyr Asp
625                 630                 635                 640 


Ala Pro Val Val Ala Lys Lys Thr Leu Ala Pro Ser Ala Leu Gln Gln
                645                 650                 655     


Gln Val Ser Val Arg Ile Thr Asn Thr Gly Lys Arg Glu Gly Val Glu
            660                 665                 670         


Val Val Gln Leu Tyr Val Arg Asp Asp Val Ala Thr Val Thr Arg Pro
        675                 680                 685             


Val Lys Gln Leu Arg Gly Phe Gln Arg Val Ser Leu Ala Pro Gly Glu
    690                 695                 700                 


Ser Lys Thr Val Thr Phe Asp Leu Gly Phe Glu Asp Leu Ala Met Tyr
705                 710                 715                 720 


Asp Asp Arg Met Gln Gln Val Val Glu Pro Gly Thr Phe Thr Val Tyr
                725                 730                 735     


Val Gly Gly Ser Ser Asp Arg Thr Arg Glu Val Ala Phe Glu Val Ala
            740                 745                 750         


Ala Gln
        


<210> 9
<211> 1590
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 9
atgtctgcct tgaactcttt caatatgtac aagagcgccc tcatcttggg ctccttgctg     60

gcaacagctg gtgctcagca aattggtact tataccgctg aaacccatcc ctctctgagc    120

tggtctactt gcaaatcggg tggtagctgc accacaaact ccggtgccat tacgttagat    180

gccaactggc gttgggtcca tggtgtcaat accagcacca actgctacac tggcaacact    240

tggaatagcg ccatctgcga cactgatgca tcctgtgccc aggactgtgc tctcgatggt    300

gctgactact ctggcacgta cggtatcact acctccggca actcattgcg cctgaacttc    360

gttaccggtt ccaacgtcgg atctcgtact tacctgatgg ccgataacac ccactaccaa    420

atcttcgact tgttgaacca ggagttcacc ttcaccgtcg atgtctccca cctcccttgc    480

ggtttgaacg gtgccctcta cttcgtgacc atggatgccg acggtggcgt ctccaagtac    540

cccaacaaca aggccggtgc tcagtacggt gttggatact gtgactctca atgtcctcgt    600

gacttgaagt tcatcgctgg tcaggccaac gttgagggct ggacgccctc cgccaacaac    660

gccaacactg gaattggcaa tcacggagct tgctgcgcgg agcttgatat ctgggaggca    720

aacagcatct cagaggcctt gactcctcac ccttgcgata cacccggtct atctgtttgc    780

actactgatg cctgcggtgg tacctacagc tctgatcgtt acgccggtac ctgcgaccct    840

gatggatgtg acttcaaccc ttaccgtctt ggtgtcactg acttctacgg ctccggcaag    900

accgttgaca ccaccaagcc ctttaccgtt gtgactcaat tcgtcactaa cgacggtacc    960

tccaccggtt ccctctccga gatcagacgt tactacgttc agaacggcgt tgtcatcccc   1020

cagccttcct ccaagatctc cggaatcagc ggaaatgtca tcaactccga ctactgcgct   1080

gctgaaatct ccacctttgg cgggactgcc tccttcagca aacacggtgg cttgacaaac   1140

atggccgctg gtatggaagc tggtatggtc ttggtcatga gtttgtggga cgactacgcc   1200

gtcaacatgc tctggctcga cagcacctac cctacaaacg caactggtac ccccggtgcc   1260

gctcgtggta cctgcgctac cacttctggg gaccccaaga ccgttgaatc acaatccggc   1320

agctcctatg tcacattctc tgacattcgg gttggtcctt tcaattctac gttcagcggt   1380

ggttctagca ccggtggcag cactactact accgccagcc gcaccaccac cacctcggcc   1440

tcttccacct ctacttccag cacctctact ggcactggag tcgctggtca ctggggtcag   1500

tgtggtggcc agggctggac tggtcctacc acctgtgtta gtggaaccac atgcaccgtc   1560

gtgaaccctt actactctca atgtttgtaa                                    1590


<210> 10
<211> 529
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (497)...(525)
<223> Fungal cellulose binding domain

<220> 
<221> DOMAIN
<222> (27)...(460)
<223> Glycosyl hydrolase family 7

<400> 10
Met Ser Ala Leu Asn Ser Phe Asn Met Tyr Lys Ser Ala Leu Ile Leu
1               5                   10                  15      


Gly Ser Leu Leu Ala Thr Ala Gly Ala Gln Gln Ile Gly Thr Tyr Thr
            20                  25                  30          


Ala Glu Thr His Pro Ser Leu Ser Trp Ser Thr Cys Lys Ser Gly Gly
        35                  40                  45              


Ser Cys Thr Thr Asn Ser Gly Ala Ile Thr Leu Asp Ala Asn Trp Arg
    50                  55                  60                  


Trp Val His Gly Val Asn Thr Ser Thr Asn Cys Tyr Thr Gly Asn Thr
65                  70                  75                  80  


Trp Asn Ser Ala Ile Cys Asp Thr Asp Ala Ser Cys Ala Gln Asp Cys
                85                  90                  95      


Ala Leu Asp Gly Ala Asp Tyr Ser Gly Thr Tyr Gly Ile Thr Thr Ser
            100                 105                 110         


Gly Asn Ser Leu Arg Leu Asn Phe Val Thr Gly Ser Asn Val Gly Ser
        115                 120                 125             


Arg Thr Tyr Leu Met Ala Asp Asn Thr His Tyr Gln Ile Phe Asp Leu
    130                 135                 140                 


Leu Asn Gln Glu Phe Thr Phe Thr Val Asp Val Ser His Leu Pro Cys
145                 150                 155                 160 


Gly Leu Asn Gly Ala Leu Tyr Phe Val Thr Met Asp Ala Asp Gly Gly
                165                 170                 175     


Val Ser Lys Tyr Pro Asn Asn Lys Ala Gly Ala Gln Tyr Gly Val Gly
            180                 185                 190         


Tyr Cys Asp Ser Gln Cys Pro Arg Asp Leu Lys Phe Ile Ala Gly Gln
        195                 200                 205             


Ala Asn Val Glu Gly Trp Thr Pro Ser Ala Asn Asn Ala Asn Thr Gly
    210                 215                 220                 


Ile Gly Asn His Gly Ala Cys Cys Ala Glu Leu Asp Ile Trp Glu Ala
225                 230                 235                 240 


Asn Ser Ile Ser Glu Ala Leu Thr Pro His Pro Cys Asp Thr Pro Gly
                245                 250                 255     


Leu Ser Val Cys Thr Thr Asp Ala Cys Gly Gly Thr Tyr Ser Ser Asp
            260                 265                 270         


Arg Tyr Ala Gly Thr Cys Asp Pro Asp Gly Cys Asp Phe Asn Pro Tyr
        275                 280                 285             


Arg Leu Gly Val Thr Asp Phe Tyr Gly Ser Gly Lys Thr Val Asp Thr
    290                 295                 300                 


Thr Lys Pro Phe Thr Val Val Thr Gln Phe Val Thr Asn Asp Gly Thr
305                 310                 315                 320 


Ser Thr Gly Ser Leu Ser Glu Ile Arg Arg Tyr Tyr Val Gln Asn Gly
                325                 330                 335     


Val Val Ile Pro Gln Pro Ser Ser Lys Ile Ser Gly Ile Ser Gly Asn
            340                 345                 350         


Val Ile Asn Ser Asp Tyr Cys Ala Ala Glu Ile Ser Thr Phe Gly Gly
        355                 360                 365             


Thr Ala Ser Phe Ser Lys His Gly Gly Leu Thr Asn Met Ala Ala Gly
    370                 375                 380                 


Met Glu Ala Gly Met Val Leu Val Met Ser Leu Trp Asp Asp Tyr Ala
385                 390                 395                 400 


Val Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Asn Ala Thr Gly
                405                 410                 415     


Thr Pro Gly Ala Ala Arg Gly Thr Cys Ala Thr Thr Ser Gly Asp Pro
            420                 425                 430         


Lys Thr Val Glu Ser Gln Ser Gly Ser Ser Tyr Val Thr Phe Ser Asp
        435                 440                 445             


Ile Arg Val Gly Pro Phe Asn Ser Thr Phe Ser Gly Gly Ser Ser Thr
    450                 455                 460                 


Gly Gly Ser Thr Thr Thr Thr Ala Ser Arg Thr Thr Thr Thr Ser Ala
465                 470                 475                 480 


Ser Ser Thr Ser Thr Ser Ser Thr Ser Thr Gly Thr Gly Val Ala Gly
                485                 490                 495     


His Trp Gly Gln Cys Gly Gly Gln Gly Trp Thr Gly Pro Thr Thr Cys
            500                 505                 510         


Val Ser Gly Thr Thr Cys Thr Val Val Asn Pro Tyr Tyr Ser Gln Cys
        515                 520                 525             


Leu
    


<210> 11
<211> 1395
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 11
atgcagagaa catcagcttg ggcactgctc cttctggcgc agattgccac tgctcagcag     60

accgtctggg gacaatgtgg tggtatcggc tactctggac cgaccagctg tgttgcagga    120

tcttcttgta gcacccagaa ctcttactac gcccaatgtc tcccgggcag tggaaacggc    180

ggtggcggtg cggcaaccac gaccacgact gctggacaaa ccaccaagac caccatggcc    240

accaccacca ccacttcaac caagacctca gctggtagtg gcggcagcac cactactgct    300

cctcctgcta gcaacagtgg caaccccttc aagggatacc agccttacgt gaacccgtac    360

tacgcttccg aggttcagag cctggctatt ccctctctgg cagcctctct ggcgcccaag    420

gccagcgcgg tggccaaggt cccatccttc gtttggctgg acactgctgc taaggtccct    480

actatgggca cttacttggc agacatcaag gccaagaacg cggctggtgc taacccaccc    540

attgccggta tctttgtcgt ttacgatctt cctgaccgtg actgcgctgc tcttgccagt    600

aacggcgagt actccatcgc caacggcggt gttgccaact acaagaagta cattgactcg    660

atccgcgctc agcttctcaa gtaccctgat gtgcacacca tcctggtcat cgaacccgac    720

agtctcgcca acctggtcac caacatgaac gtcgccaaat gctcgggtgc tcacgacgcc    780

tacctggagt gcactgacta tgcactcaag cagctcaact tgcccaacgt tgccatgtac    840

cttgatgccg gacacgctgg ctggcttgga tggcccgcca acattggacc cgctgccgac    900

ctcttcgcca gtgtgtacaa gaatgccggc tctcccgccg ccgtccgtgg attggccacc    960

aacgttgcca actacaacgc ctggtccatc tccacctgcc catcttacac tcagggtgac   1020

cagaactgtg acgagaagcg ctacatcaac gccctcgctc ctctcctccg cgcgaacggc   1080

ttcgacgccc acttcatcat ggacacctcc cgtaacggtg tccagcccac taagcaacaa   1140

gcctggggtg actggtgcaa cgtcattggc actggcttcg gtaccccctt caccaccgac   1200

actggtgatg ctcttcagga cgctttcatc tgggtcaagc ccggtggcga gtgtgacggt   1260

acctcggaca catcctctcc tcgctacgac gcccactgcg gatacagcga tgccctcgag   1320

ccggcccccg aggctggaac ttggttccaa gcctacttcg agcagctgct cgtcaacgcc   1380

aacccaagct tctaa                                                    1395


<210> 12
<211> 464
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(18)

<220> 
<221> DOMAIN
<222> (22)...(50)
<223> Fungal cellulose binding domain

<220> 
<221> DOMAIN
<222> (117)...(431)
<223> Glycosyl hydrolases family 6

<400> 12
Met Gln Arg Thr Ser Ala Trp Ala Leu Leu Leu Leu Ala Gln Ile Ala
1               5                   10                  15      


Thr Ala Gln Gln Thr Val Trp Gly Gln Cys Gly Gly Ile Gly Tyr Ser
            20                  25                  30          


Gly Pro Thr Ser Cys Val Ala Gly Ser Ser Cys Ser Thr Gln Asn Ser
        35                  40                  45              


Tyr Tyr Ala Gln Cys Leu Pro Gly Ser Gly Asn Gly Gly Gly Gly Ala
    50                  55                  60                  


Ala Thr Thr Thr Thr Thr Ala Gly Gln Thr Thr Lys Thr Thr Met Ala
65                  70                  75                  80  


Thr Thr Thr Thr Thr Ser Thr Lys Thr Ser Ala Gly Ser Gly Gly Ser
                85                  90                  95      


Thr Thr Thr Ala Pro Pro Ala Ser Asn Ser Gly Asn Pro Phe Lys Gly
            100                 105                 110         


Tyr Gln Pro Tyr Val Asn Pro Tyr Tyr Ala Ser Glu Val Gln Ser Leu
        115                 120                 125             


Ala Ile Pro Ser Leu Ala Ala Ser Leu Ala Pro Lys Ala Ser Ala Val
    130                 135                 140                 


Ala Lys Val Pro Ser Phe Val Trp Leu Asp Thr Ala Ala Lys Val Pro
145                 150                 155                 160 


Thr Met Gly Thr Tyr Leu Ala Asp Ile Lys Ala Lys Asn Ala Ala Gly
                165                 170                 175     


Ala Asn Pro Pro Ile Ala Gly Ile Phe Val Val Tyr Asp Leu Pro Asp
            180                 185                 190         


Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr Ser Ile Ala Asn
        195                 200                 205             


Gly Gly Val Ala Asn Tyr Lys Lys Tyr Ile Asp Ser Ile Arg Ala Gln
    210                 215                 220                 


Leu Leu Lys Tyr Pro Asp Val His Thr Ile Leu Val Ile Glu Pro Asp
225                 230                 235                 240 


Ser Leu Ala Asn Leu Val Thr Asn Met Asn Val Ala Lys Cys Ser Gly
                245                 250                 255     


Ala His Asp Ala Tyr Leu Glu Cys Thr Asp Tyr Ala Leu Lys Gln Leu
            260                 265                 270         


Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala Gly His Ala Gly Trp
        275                 280                 285             


Leu Gly Trp Pro Ala Asn Ile Gly Pro Ala Ala Asp Leu Phe Ala Ser
    290                 295                 300                 


Val Tyr Lys Asn Ala Gly Ser Pro Ala Ala Val Arg Gly Leu Ala Thr
305                 310                 315                 320 


Asn Val Ala Asn Tyr Asn Ala Trp Ser Ile Ser Thr Cys Pro Ser Tyr
                325                 330                 335     


Thr Gln Gly Asp Gln Asn Cys Asp Glu Lys Arg Tyr Ile Asn Ala Leu
            340                 345                 350         


Ala Pro Leu Leu Arg Ala Asn Gly Phe Asp Ala His Phe Ile Met Asp
        355                 360                 365             


Thr Ser Arg Asn Gly Val Gln Pro Thr Lys Gln Gln Ala Trp Gly Asp
    370                 375                 380                 


Trp Cys Asn Val Ile Gly Thr Gly Phe Gly Thr Pro Phe Thr Thr Asp
385                 390                 395                 400 


Thr Gly Asp Ala Leu Gln Asp Ala Phe Ile Trp Val Lys Pro Gly Gly
                405                 410                 415     


Glu Cys Asp Gly Thr Ser Asp Thr Ser Ser Pro Arg Tyr Asp Ala His
            420                 425                 430         


Cys Gly Tyr Ser Asp Ala Leu Glu Pro Ala Pro Glu Ala Gly Thr Trp
        435                 440                 445             


Phe Gln Ala Tyr Phe Glu Gln Leu Leu Val Asn Ala Asn Pro Ser Phe
    450                 455                 460                 


<210> 13
<211> 1377
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 13
atggacatga tgatgacccg actcactttc cccgccggtt tcgtgtgggg cgcggcgact     60

tcttcctatc aaattgaggg cgcatgggat gaggatggca agggcgaatc catttgggat    120

cgttttgccc acaccccagg caaggtgatc gatggctcga atggcgatgt cgcatgtgac    180

cactatcatc gctatcgcga agatgttgcg ctgatggccg gcctcggctt gcaagcctat    240

cgattttcgg tggcatggcc gcgcatcctg cccgaaggtc gaggacgcgt caaccaggca    300

ggactcgact tttacagccg gctggtcgac gaactgcttg ccgccaacat gaccccgttt    360

gccaccctct atcactggga tttgccccag gcattgcagg atcagggtgg ttggcctgcg    420

cgtgccactg ccgaagcctt tgtcgaatat gccgacgtga tctcgcgcca tttgggcgac    480

cgcgtcaaac agtggatcac ccacaatgag ccgtggtgtg tggcgctctt gagccaccag    540

atcggcgaac acgcgcccgg ctggcaagat tggcccgccg cactgctggc cggacatcac    600

gtacttctct cgcatgggtg ggctgtgcct gtcattcgtg ccaacatccc agacgctgag    660

gtaggtataa cgctaaactt caccccggct gttgccgcct caccgagccg cgccgatttt    720

gaggcaacac gctggttcga cggctattac aaccgctggt tcctcgatgc gctctatggc    780

cgcggctatc ccgcagacat ggtggcggac tatagcaagg cggggcatct gcccaatggc    840

cctgactttg tgcattccaa tgacctcgct gccattgccg cgctgaccga tttcctgggc    900

gtcaatatgt acacgcgtga ggtcgtgcgc gcagcggacg caccagataa tttgccgcgc    960

acacgctttg ccgcgccggc ggaggaacac accgaaatgg ggtgggaggt gcatcccgac   1020

agcctctatc gtttgctgtt gcggctgggc acgaactatc caattgaaaa gctctatatt   1080

acggagaatg gtgcgagcta tggggatggg ccggacgctg aggggcgcat ccatgaccaa   1140

cggcgcattg actatttgcg cgaccacttg gccgcgtgtc accgcgcgat tgaggccggg   1200

gtgccgctgc aaggctattt ccagtggagc ttactggaca attttgagtg ggcgaggggc   1260

tatacccagc gttttggcat ggtgtgggta gactttgcta cacagcagcg tataccgaag   1320

gccagcgccg agtggtatcg cgaggcaatc cgccacaatg gttttgctgt tgagtag      1377


<210> 14
<211> 458
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (6)...(455)
<223> Glycosyl hydrolase family 1

<400> 14
Met Asp Met Met Met Thr Arg Leu Thr Phe Pro Ala Gly Phe Val Trp
1               5                   10                  15      


Gly Ala Ala Thr Ser Ser Tyr Gln Ile Glu Gly Ala Trp Asp Glu Asp
            20                  25                  30          


Gly Lys Gly Glu Ser Ile Trp Asp Arg Phe Ala His Thr Pro Gly Lys
        35                  40                  45              


Val Ile Asp Gly Ser Asn Gly Asp Val Ala Cys Asp His Tyr His Arg
    50                  55                  60                  


Tyr Arg Glu Asp Val Ala Leu Met Ala Gly Leu Gly Leu Gln Ala Tyr
65                  70                  75                  80  


Arg Phe Ser Val Ala Trp Pro Arg Ile Leu Pro Glu Gly Arg Gly Arg
                85                  90                  95      


Val Asn Gln Ala Gly Leu Asp Phe Tyr Ser Arg Leu Val Asp Glu Leu
            100                 105                 110         


Leu Ala Ala Asn Met Thr Pro Phe Ala Thr Leu Tyr His Trp Asp Leu
        115                 120                 125             


Pro Gln Ala Leu Gln Asp Gln Gly Gly Trp Pro Ala Arg Ala Thr Ala
    130                 135                 140                 


Glu Ala Phe Val Glu Tyr Ala Asp Val Ile Ser Arg His Leu Gly Asp
145                 150                 155                 160 


Arg Val Lys Gln Trp Ile Thr His Asn Glu Pro Trp Cys Val Ala Leu
                165                 170                 175     


Leu Ser His Gln Ile Gly Glu His Ala Pro Gly Trp Gln Asp Trp Pro
            180                 185                 190         


Ala Ala Leu Leu Ala Gly His His Val Leu Leu Ser His Gly Trp Ala
        195                 200                 205             


Val Pro Val Ile Arg Ala Asn Ile Pro Asp Ala Glu Val Gly Ile Thr
    210                 215                 220                 


Leu Asn Phe Thr Pro Ala Val Ala Ala Ser Pro Ser Arg Ala Asp Phe
225                 230                 235                 240 


Glu Ala Thr Arg Trp Phe Asp Gly Tyr Tyr Asn Arg Trp Phe Leu Asp
                245                 250                 255     


Ala Leu Tyr Gly Arg Gly Tyr Pro Ala Asp Met Val Ala Asp Tyr Ser
            260                 265                 270         


Lys Ala Gly His Leu Pro Asn Gly Pro Asp Phe Val His Ser Asn Asp
        275                 280                 285             


Leu Ala Ala Ile Ala Ala Leu Thr Asp Phe Leu Gly Val Asn Met Tyr
    290                 295                 300                 


Thr Arg Glu Val Val Arg Ala Ala Asp Ala Pro Asp Asn Leu Pro Arg
305                 310                 315                 320 


Thr Arg Phe Ala Ala Pro Ala Glu Glu His Thr Glu Met Gly Trp Glu
                325                 330                 335     


Val His Pro Asp Ser Leu Tyr Arg Leu Leu Leu Arg Leu Gly Thr Asn
            340                 345                 350         


Tyr Pro Ile Glu Lys Leu Tyr Ile Thr Glu Asn Gly Ala Ser Tyr Gly
        355                 360                 365             


Asp Gly Pro Asp Ala Glu Gly Arg Ile His Asp Gln Arg Arg Ile Asp
    370                 375                 380                 


Tyr Leu Arg Asp His Leu Ala Ala Cys His Arg Ala Ile Glu Ala Gly
385                 390                 395                 400 


Val Pro Leu Gln Gly Tyr Phe Gln Trp Ser Leu Leu Asp Asn Phe Glu
                405                 410                 415     


Trp Ala Arg Gly Tyr Thr Gln Arg Phe Gly Met Val Trp Val Asp Phe
            420                 425                 430         


Ala Thr Gln Gln Arg Ile Pro Lys Ala Ser Ala Glu Trp Tyr Arg Glu
        435                 440                 445             


Ala Ile Arg His Asn Gly Phe Ala Val Glu
    450                 455             


<210> 15
<211> 2331
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 15
atgaaaattc gctcgcttct gctgctcatt tcgatcctcc tcggtgtagt ttcgcctggg     60

tttggccagt cggtgctcga cagcgatacg atctttcgcg aggtttcacg acggccgatg    120

aggcctaata acgaggcccg gatcgaagcg ctcatacgac agatgacggt cgaagaaaag    180

gtcggccaga tgacgcagct cacgatcgat atggtcacga gcggcgatga tcaggcggta    240

cagatcgaca acgcaaagct tgaaaaggcg gtcgttcaat acggcgttgg ttcgatcctc    300

aacgtcaaca atcaggcttt aacactcgat cattggcatc ggatcattgg gccgatccag    360

caagcggctc agcggactcg gctgaagatc ccggtcattt acggtgtcga ttcgatccac    420

ggagcaaact acgttcaggg ctcaacgctg tttccgcaag agctcggaat ggcgtcgact    480

tggaatccaa atttgatgcg atgggcggcg gagatcacgg cgaaggaaac gcgggcggcc    540

ggcatcccct ggagcttttc acccgttctc gacgttggcc gaaatcaagc atggccgcga    600

ttgtgggaaa cgttcggcga agatccgtat ctggcgacag tgatgggaac cgcattcgta    660

cgcggtctcg aaggcgatga tgttgcgagc ggcaagcacg tcgccgcgag cctaaagcac    720

tacgtcggat acagcatctc gacgaccggc cgcgatcgca cgcccgctgt cattccagag    780

cattaccttc gcgaatattt gctgccgccg tttgctgcgg cagtaaaggc tggagccagg    840

acggtgatga tcaactccgc cgagatcaac ggagttccgg gtcacatcaa caagcacctg    900

atgaccgatg tcctcaaggg cgaacttggc tttgacggtt tcattgtttc ggattgggac    960

gacattaaaa agctcgtttc gcaatggcgc gtcgcggccg atgaaaaaga agccacgcgg   1020

ctcgcggtga tggccggcat cgatatgagt atggtgccgc tgagctacag tttctcagat   1080

catctgatcg ccctagtgaa agaaggtaag gttccgatgt cgcggatcga cgacgcggtt   1140

cgccgcattc ttcgcgtgaa attcgaactt ggattgttca agaatgcgat gccggatccg   1200

tcgcttcgtt caaatcacgg cagccccgaa tacaccgccg tggctgccgc agcggctcgc   1260

ggatcgatca ttctcctcaa gaatcaaaac aacattcttc cactgtcgaa aacggccaag   1320

gtccttgtca ccggcccgac agctgactca atgatctcgc taaataacgg ctggacatac   1380

gtctggcagg gctcggagcc gtcgctctac ccgaaggata agccgacgat ccagaaggcg   1440

atcgaagcaa agaccggtcc ggcaaacttc aagtatgtac cgggaacgcg actcgttcgc   1500

cgagccggca gtccgtcgaa cagcaacccg actgacattg atgaggaagt ggatatcgcc   1560

gccgctgtcc aggccgcgaa gtcgagtgat gtcgtcgttc tctgcctggg tgaaggttcg   1620

tacaccgagc atcccggaag catcgcggat ctcaccttgc ccgagcttca acttcagttc   1680

gccgagcgaa tgatcgccac ggggaagccg atagtacttg tattgtcgca aggccgtcct   1740

cgtgtgatca gccgcattgc ggacagggtc gccgggatag tcctcgcgtt caatcccggc   1800

aacgaaggtg gccgtgcgtt cgctgacgtt ttattcggcg actataatcc ggatggaaag   1860

ttgccggtaa catatcctcg ctcgcccgga tatctcacga cctacgacga aaacatcttc   1920

gaacgtgtaa tggacgcacg aaaagcgctg acatttcaac cgcagttcga gtttggatac   1980

ggtttgagct atacgacctt tagctactcg aatataaaac tcgcctccga ccgaatgcag   2040

cgaaatggaa ctctgtcatt ctccgtcgat gtagccaaca ccggaaaacg gcggggaact   2100

gagacggtca tcgtctacgt ccgcgacgaa gtcgccgggc taacgccggc cgcaaagcgt   2160

gtcagaagat tcgcaaaagg aaatttggag ccgggagaat cgaagacgtt caatttcaca   2220

ctacgtcccg atgatctcgg atacttcgga ctcgacaaca aactgcagat cgagccgggc   2280

gagttcacga tcatgatcgg cgatcgaacc gccaagttca cgttggaata a            2331


<210> 16
<211> 776
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(22)

<220> 
<221> DOMAIN
<222> (122)...(352)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (423)...(666)
<223> Glycosyl hydrolase family 3 C terminal domain

<400> 16
Met Lys Ile Arg Ser Leu Leu Leu Leu Ile Ser Ile Leu Leu Gly Val
1               5                   10                  15      


Val Ser Pro Gly Phe Gly Gln Ser Val Leu Asp Ser Asp Thr Ile Phe
            20                  25                  30          


Arg Glu Val Ser Arg Arg Pro Met Arg Pro Asn Asn Glu Ala Arg Ile
        35                  40                  45              


Glu Ala Leu Ile Arg Gln Met Thr Val Glu Glu Lys Val Gly Gln Met
    50                  55                  60                  


Thr Gln Leu Thr Ile Asp Met Val Thr Ser Gly Asp Asp Gln Ala Val
65                  70                  75                  80  


Gln Ile Asp Asn Ala Lys Leu Glu Lys Ala Val Val Gln Tyr Gly Val
                85                  90                  95      


Gly Ser Ile Leu Asn Val Asn Asn Gln Ala Leu Thr Leu Asp His Trp
            100                 105                 110         


His Arg Ile Ile Gly Pro Ile Gln Gln Ala Ala Gln Arg Thr Arg Leu
        115                 120                 125             


Lys Ile Pro Val Ile Tyr Gly Val Asp Ser Ile His Gly Ala Asn Tyr
    130                 135                 140                 


Val Gln Gly Ser Thr Leu Phe Pro Gln Glu Leu Gly Met Ala Ser Thr
145                 150                 155                 160 


Trp Asn Pro Asn Leu Met Arg Trp Ala Ala Glu Ile Thr Ala Lys Glu
                165                 170                 175     


Thr Arg Ala Ala Gly Ile Pro Trp Ser Phe Ser Pro Val Leu Asp Val
            180                 185                 190         


Gly Arg Asn Gln Ala Trp Pro Arg Leu Trp Glu Thr Phe Gly Glu Asp
        195                 200                 205             


Pro Tyr Leu Ala Thr Val Met Gly Thr Ala Phe Val Arg Gly Leu Glu
    210                 215                 220                 


Gly Asp Asp Val Ala Ser Gly Lys His Val Ala Ala Ser Leu Lys His
225                 230                 235                 240 


Tyr Val Gly Tyr Ser Ile Ser Thr Thr Gly Arg Asp Arg Thr Pro Ala
                245                 250                 255     


Val Ile Pro Glu His Tyr Leu Arg Glu Tyr Leu Leu Pro Pro Phe Ala
            260                 265                 270         


Ala Ala Val Lys Ala Gly Ala Arg Thr Val Met Ile Asn Ser Ala Glu
        275                 280                 285             


Ile Asn Gly Val Pro Gly His Ile Asn Lys His Leu Met Thr Asp Val
    290                 295                 300                 


Leu Lys Gly Glu Leu Gly Phe Asp Gly Phe Ile Val Ser Asp Trp Asp
305                 310                 315                 320 


Asp Ile Lys Lys Leu Val Ser Gln Trp Arg Val Ala Ala Asp Glu Lys
                325                 330                 335     


Glu Ala Thr Arg Leu Ala Val Met Ala Gly Ile Asp Met Ser Met Val
            340                 345                 350         


Pro Leu Ser Tyr Ser Phe Ser Asp His Leu Ile Ala Leu Val Lys Glu
        355                 360                 365             


Gly Lys Val Pro Met Ser Arg Ile Asp Asp Ala Val Arg Arg Ile Leu
    370                 375                 380                 


Arg Val Lys Phe Glu Leu Gly Leu Phe Lys Asn Ala Met Pro Asp Pro
385                 390                 395                 400 


Ser Leu Arg Ser Asn His Gly Ser Pro Glu Tyr Thr Ala Val Ala Ala
                405                 410                 415     


Ala Ala Ala Arg Gly Ser Ile Ile Leu Leu Lys Asn Gln Asn Asn Ile
            420                 425                 430         


Leu Pro Leu Ser Lys Thr Ala Lys Val Leu Val Thr Gly Pro Thr Ala
        435                 440                 445             


Asp Ser Met Ile Ser Leu Asn Asn Gly Trp Thr Tyr Val Trp Gln Gly
    450                 455                 460                 


Ser Glu Pro Ser Leu Tyr Pro Lys Asp Lys Pro Thr Ile Gln Lys Ala
465                 470                 475                 480 


Ile Glu Ala Lys Thr Gly Pro Ala Asn Phe Lys Tyr Val Pro Gly Thr
                485                 490                 495     


Arg Leu Val Arg Arg Ala Gly Ser Pro Ser Asn Ser Asn Pro Thr Asp
            500                 505                 510         


Ile Asp Glu Glu Val Asp Ile Ala Ala Ala Val Gln Ala Ala Lys Ser
        515                 520                 525             


Ser Asp Val Val Val Leu Cys Leu Gly Glu Gly Ser Tyr Thr Glu His
    530                 535                 540                 


Pro Gly Ser Ile Ala Asp Leu Thr Leu Pro Glu Leu Gln Leu Gln Phe
545                 550                 555                 560 


Ala Glu Arg Met Ile Ala Thr Gly Lys Pro Ile Val Leu Val Leu Ser
                565                 570                 575     


Gln Gly Arg Pro Arg Val Ile Ser Arg Ile Ala Asp Arg Val Ala Gly
            580                 585                 590         


Ile Val Leu Ala Phe Asn Pro Gly Asn Glu Gly Gly Arg Ala Phe Ala
        595                 600                 605             


Asp Val Leu Phe Gly Asp Tyr Asn Pro Asp Gly Lys Leu Pro Val Thr
    610                 615                 620                 


Tyr Pro Arg Ser Pro Gly Tyr Leu Thr Thr Tyr Asp Glu Asn Ile Phe
625                 630                 635                 640 


Glu Arg Val Met Asp Ala Arg Lys Ala Leu Thr Phe Gln Pro Gln Phe
                645                 650                 655     


Glu Phe Gly Tyr Gly Leu Ser Tyr Thr Thr Phe Ser Tyr Ser Asn Ile
            660                 665                 670         


Lys Leu Ala Ser Asp Arg Met Gln Arg Asn Gly Thr Leu Ser Phe Ser
        675                 680                 685             


Val Asp Val Ala Asn Thr Gly Lys Arg Arg Gly Thr Glu Thr Val Ile
    690                 695                 700                 


Val Tyr Val Arg Asp Glu Val Ala Gly Leu Thr Pro Ala Ala Lys Arg
705                 710                 715                 720 


Val Arg Arg Phe Ala Lys Gly Asn Leu Glu Pro Gly Glu Ser Lys Thr
                725                 730                 735     


Phe Asn Phe Thr Leu Arg Pro Asp Asp Leu Gly Tyr Phe Gly Leu Asp
            740                 745                 750         


Asn Lys Leu Gln Ile Glu Pro Gly Glu Phe Thr Ile Met Ile Gly Asp
        755                 760                 765             


Arg Thr Ala Lys Phe Thr Leu Glu
    770                 775     


<210> 17
<211> 1419
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 17
atgaaacggt ttcctgacaa cttcgtgtgg ggtgctgcca cctccgccta tcagatcgag     60

ggcgcggtcc gcgaagacgg ccgtggcgag tcaatctggg atcgattcgc cgccgagccc    120

ggccgcatca gcgatggctc cgacgccagc gtcgcctgcg atcactaccg gcgcgtcggc    180

gaggacgtcg agatcctcga gtggctcggg gtccgcgcgt atcggttctc gatcgcgtgg    240

ccgcgtgtgc tgccgagcgg ccgcggcacc gtcaacacgg cgggcctcga cttctacgat    300

cgtctcgtcg atcgcttgct cgcgagcggc atcgagccat tcgtcacgct caaccattgg    360

gacctgccga ccgcgctcca cgatgaagga agctggccgt cgcgcgacac cgtcggtgcg    420

ttcgtcagct acgcagagat cgtgatgcgg cgcctcggcg atcgcgtccg ccgcgtgtgc    480

actcacaacg agccgtggtg catcagcacg ctcggcttcg gcaacggcga gcatgcgccc    540

ggcgaacggt cgtggccgcg cgcgctcgcc gctgcgcatc acttgctgct ctcgcacggc    600

ctcgccgtcc aggcgatccg cgccgtcgcg cccgccgcgc aggtcggcat cgttctcaat    660

ctggtcccga ccgagcccgc ctcgtcgagc gaagcggacc gcgacgcggc gcgcgcattc    720

gacggcagct tcaaccgctg gttcctcgat ccgctctacg gccgcggcta cccgctcgac    780

gtgatcgacg atcacattcg cgccggccac ctcgccgacg ccgagctccc gttcgtgcag    840

gacggcgacc tgcgcacgat cgcgacgcgc accgactatc tcggcatcaa ctactactcg    900

cgcgcggtga tgcgttcgac ggcggtgccc gaccacgaca acttgccgcc gtcggtgatg    960

gcctccggcg agaagaccga catgggctgg gaggtcgcgc ccagcggcct cgtcgcgatc   1020

ctgcgccgtg tccacgcgga ctacgcgccg ccgcggctct acatcaccga gaacggcgcc   1080

gcgtacggca ccgcgcccga tgcgaatggc cgcgtgcgcg atgtcgcgcg ccagcgctat   1140

ctatggagcc acttcgccgc cgcgcaccag gcgatcgcgg aaggcatccc gctcgccggc   1200

tacttcttgt ggtcgctgct cgacaacttc gagtgggccc agggctacag caaacgcttc   1260

ggcttgttct gggtcgacta cgaaacccag gcgcgcctgg cgaaagactc ggcgcacctg   1320

tgtcgccgca tcatccgcga caacgcgctc accgagctgg agcacgatct cgcggcggtg   1380

aacgagaccg agcacgacct cgcggcggtg aacgcatga                          1419


<210> 18
<211> 472
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(450)
<223> Glycosyl hydrolase family 1

<400> 18
Met Lys Arg Phe Pro Asp Asn Phe Val Trp Gly Ala Ala Thr Ser Ala
1               5                   10                  15      


Tyr Gln Ile Glu Gly Ala Val Arg Glu Asp Gly Arg Gly Glu Ser Ile
            20                  25                  30          


Trp Asp Arg Phe Ala Ala Glu Pro Gly Arg Ile Ser Asp Gly Ser Asp
        35                  40                  45              


Ala Ser Val Ala Cys Asp His Tyr Arg Arg Val Gly Glu Asp Val Glu
    50                  55                  60                  


Ile Leu Glu Trp Leu Gly Val Arg Ala Tyr Arg Phe Ser Ile Ala Trp
65                  70                  75                  80  


Pro Arg Val Leu Pro Ser Gly Arg Gly Thr Val Asn Thr Ala Gly Leu
                85                  90                  95      


Asp Phe Tyr Asp Arg Leu Val Asp Arg Leu Leu Ala Ser Gly Ile Glu
            100                 105                 110         


Pro Phe Val Thr Leu Asn His Trp Asp Leu Pro Thr Ala Leu His Asp
        115                 120                 125             


Glu Gly Ser Trp Pro Ser Arg Asp Thr Val Gly Ala Phe Val Ser Tyr
    130                 135                 140                 


Ala Glu Ile Val Met Arg Arg Leu Gly Asp Arg Val Arg Arg Val Cys
145                 150                 155                 160 


Thr His Asn Glu Pro Trp Cys Ile Ser Thr Leu Gly Phe Gly Asn Gly
                165                 170                 175     


Glu His Ala Pro Gly Glu Arg Ser Trp Pro Arg Ala Leu Ala Ala Ala
            180                 185                 190         


His His Leu Leu Leu Ser His Gly Leu Ala Val Gln Ala Ile Arg Ala
        195                 200                 205             


Val Ala Pro Ala Ala Gln Val Gly Ile Val Leu Asn Leu Val Pro Thr
    210                 215                 220                 


Glu Pro Ala Ser Ser Ser Glu Ala Asp Arg Asp Ala Ala Arg Ala Phe
225                 230                 235                 240 


Asp Gly Ser Phe Asn Arg Trp Phe Leu Asp Pro Leu Tyr Gly Arg Gly
                245                 250                 255     


Tyr Pro Leu Asp Val Ile Asp Asp His Ile Arg Ala Gly His Leu Ala
            260                 265                 270         


Asp Ala Glu Leu Pro Phe Val Gln Asp Gly Asp Leu Arg Thr Ile Ala
        275                 280                 285             


Thr Arg Thr Asp Tyr Leu Gly Ile Asn Tyr Tyr Ser Arg Ala Val Met
    290                 295                 300                 


Arg Ser Thr Ala Val Pro Asp His Asp Asn Leu Pro Pro Ser Val Met
305                 310                 315                 320 


Ala Ser Gly Glu Lys Thr Asp Met Gly Trp Glu Val Ala Pro Ser Gly
                325                 330                 335     


Leu Val Ala Ile Leu Arg Arg Val His Ala Asp Tyr Ala Pro Pro Arg
            340                 345                 350         


Leu Tyr Ile Thr Glu Asn Gly Ala Ala Tyr Gly Thr Ala Pro Asp Ala
        355                 360                 365             


Asn Gly Arg Val Arg Asp Val Ala Arg Gln Arg Tyr Leu Trp Ser His
    370                 375                 380                 


Phe Ala Ala Ala His Gln Ala Ile Ala Glu Gly Ile Pro Leu Ala Gly
385                 390                 395                 400 


Tyr Phe Leu Trp Ser Leu Leu Asp Asn Phe Glu Trp Ala Gln Gly Tyr
                405                 410                 415     


Ser Lys Arg Phe Gly Leu Phe Trp Val Asp Tyr Glu Thr Gln Ala Arg
            420                 425                 430         


Leu Ala Lys Asp Ser Ala His Leu Cys Arg Arg Ile Ile Arg Asp Asn
        435                 440                 445             


Ala Leu Thr Glu Leu Glu His Asp Leu Ala Ala Val Asn Glu Thr Glu
    450                 455                 460                 


His Asp Leu Ala Ala Val Asn Ala
465                 470         


<210> 19
<211> 1374
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 19
atgttgcgat atctttccat cgttgccgcc acggcaattc tgaccggagt tgaagctcag     60

caatcagtct ggggacaatg tggcggccaa ggctggtctg gcgcgacttc atgcgccgcc    120

ggttctacgt gcagcactct aaacccttac tacgcacaat gtatccctgg taccgctact    180

tcaactacat tggtgaaaac aacgtcttct accagcgtcg gaacgacatc gccgccaaca    240

acaaccacga cgaaagctag taccactgct actaccactg ccgctgcatc cggaaaccct    300

ttctctggtt accagcttta tgccaatccg tactattctt cagaagtaca cactcttgcc    360

atcccatctt tgactggctc gctcgctgct gctgctacca aagctgccga gatcccctca    420

tttgtctggc ttgacacggc agccaaagtg cctacaatgg gcacctactt ggccaacatt    480

gaggctgcaa acaaggctgg cgccagccca cctattgccg gtatcttcgt tgtctatgac    540

ctgcctgacc gtgactgtgc agctgctgca agtaatggcg aatacactgt agcaaacaac    600

ggtgttgcaa actacaaggc ttacatcgac agcattgtgg cacagttgaa agcttatccc    660

gatgtgcaca caatccttat cattgagcct gatagtctcg ccaacatggt caccaatctg    720

tctacagcca agtgtgctga ggctcaatct gcatactatg agtgcgtcaa ctacgcattg    780

atcaacctca acttggccaa cgtggccatg tacattgatg ctggtcatgc tggttggctc    840

ggatggtctg cgaatctttc accagcggct caactcttcg caacagtcta taagaatgca    900

agtgcccctg catctcttcg tggattggcc accaacgttg ccaactacaa cgcttggtcg    960

atcagcagcc caccctcata cacatctggc gactccaact acgacgaaaa gctctacatc   1020

aacgctttgt ctcctctcct gacatctaac ggctggccta acgctcactt catcatggat   1080

acttcccgaa acggtgttca accgactaag cagcaggcat ggggtgactg gtgcaatgtg   1140

atcggaaccg gcttcggtgt tcaaccgaca acaaatactg gtgacccact tgaggatgcc   1200

tttgtctggg tcaagccagg tggtgaaagt gatggtacat caaacagttc cgctactcgt   1260

tacgatttcc attgcggcta cagtgatgca cttcaacccg cccccgaggc tgggacttgg   1320

ttccaagcat actttgtcca gcttttgaca aatgccaacc cagctttggt ctag         1374


<210> 20
<211> 457
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (23)...(51)
<223> Fungal cellulose binding domain

<220> 
<221> DOMAIN
<222> (108)...(423)
<223> Glycosyl hydrolases family 6

<400> 20
Met Leu Arg Tyr Leu Ser Ile Val Ala Ala Thr Ala Ile Leu Thr Gly
1               5                   10                  15      


Val Glu Ala Gln Gln Ser Val Trp Gly Gln Cys Gly Gly Gln Gly Trp
            20                  25                  30          


Ser Gly Ala Thr Ser Cys Ala Ala Gly Ser Thr Cys Ser Thr Leu Asn
        35                  40                  45              


Pro Tyr Tyr Ala Gln Cys Ile Pro Gly Thr Ala Thr Ser Thr Thr Leu
    50                  55                  60                  


Val Lys Thr Thr Ser Ser Thr Ser Val Gly Thr Thr Ser Pro Pro Thr
65                  70                  75                  80  


Thr Thr Thr Thr Lys Ala Ser Thr Thr Ala Thr Thr Thr Ala Ala Ala
                85                  90                  95      


Ser Gly Asn Pro Phe Ser Gly Tyr Gln Leu Tyr Ala Asn Pro Tyr Tyr
            100                 105                 110         


Ser Ser Glu Val His Thr Leu Ala Ile Pro Ser Leu Thr Gly Ser Leu
        115                 120                 125             


Ala Ala Ala Ala Thr Lys Ala Ala Glu Ile Pro Ser Phe Val Trp Leu
    130                 135                 140                 


Asp Thr Ala Ala Lys Val Pro Thr Met Gly Thr Tyr Leu Ala Asn Ile
145                 150                 155                 160 


Glu Ala Ala Asn Lys Ala Gly Ala Ser Pro Pro Ile Ala Gly Ile Phe
                165                 170                 175     


Val Val Tyr Asp Leu Pro Asp Arg Asp Cys Ala Ala Ala Ala Ser Asn
            180                 185                 190         


Gly Glu Tyr Thr Val Ala Asn Asn Gly Val Ala Asn Tyr Lys Ala Tyr
        195                 200                 205             


Ile Asp Ser Ile Val Ala Gln Leu Lys Ala Tyr Pro Asp Val His Thr
    210                 215                 220                 


Ile Leu Ile Ile Glu Pro Asp Ser Leu Ala Asn Met Val Thr Asn Leu
225                 230                 235                 240 


Ser Thr Ala Lys Cys Ala Glu Ala Gln Ser Ala Tyr Tyr Glu Cys Val
                245                 250                 255     


Asn Tyr Ala Leu Ile Asn Leu Asn Leu Ala Asn Val Ala Met Tyr Ile
            260                 265                 270         


Asp Ala Gly His Ala Gly Trp Leu Gly Trp Ser Ala Asn Leu Ser Pro
        275                 280                 285             


Ala Ala Gln Leu Phe Ala Thr Val Tyr Lys Asn Ala Ser Ala Pro Ala
    290                 295                 300                 


Ser Leu Arg Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Ala Trp Ser
305                 310                 315                 320 


Ile Ser Ser Pro Pro Ser Tyr Thr Ser Gly Asp Ser Asn Tyr Asp Glu
                325                 330                 335     


Lys Leu Tyr Ile Asn Ala Leu Ser Pro Leu Leu Thr Ser Asn Gly Trp
            340                 345                 350         


Pro Asn Ala His Phe Ile Met Asp Thr Ser Arg Asn Gly Val Gln Pro
        355                 360                 365             


Thr Lys Gln Gln Ala Trp Gly Asp Trp Cys Asn Val Ile Gly Thr Gly
    370                 375                 380                 


Phe Gly Val Gln Pro Thr Thr Asn Thr Gly Asp Pro Leu Glu Asp Ala
385                 390                 395                 400 


Phe Val Trp Val Lys Pro Gly Gly Glu Ser Asp Gly Thr Ser Asn Ser
                405                 410                 415     


Ser Ala Thr Arg Tyr Asp Phe His Cys Gly Tyr Ser Asp Ala Leu Gln
            420                 425                 430         


Pro Ala Pro Glu Ala Gly Thr Trp Phe Gln Ala Tyr Phe Val Gln Leu
        435                 440                 445             


Leu Thr Asn Ala Asn Pro Ala Leu Val
    450                 455         


<210> 21
<211> 1338
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 21
atgcttaccc ttgctttcct ctccctgctg gctgcggcca atgcgcagaa ggctggcacg     60

ctccagtcgg agactcaccc gcgtatgacc tggtccaggt gctccgccgg aggcagctgc    120

accaccacca acggcgaggt cgtcatcgat gccaactggc gatggcttca tactgtcagc    180

ggttcgcaga actgctacga cggcaacaag tggaccagcg cctgctcctc cgagtccgac    240

tgcagccaga actgcgccgt cgagggtgcc gactactcgg gcacctacgg cgcgtccacg    300

agcggcaacg ccctgacgct caagttcgtc accacccacc agtacggcaa gaacatcggc    360

tcccgcctgt acctcatggc cagccagagc cagtatcaga tgttcaccct gctcaacaac    420

gagctggctt ttgatgtcga cctgtcccag atcgagtgcg gtctcaatgc tgccctctac    480

tttgttgcca tggacgccga tggcggcatg tcccgtaact cggccaacac ggccggcgct    540

aagtttggta ctggatactg tgatgctcag tgtgctcgtg atctcaagtt tgttggtggc    600

aaggcgaaca gtgacggctg gaagccctcc gacaacgatg ccaatgctgg cgttggaaag    660

tacggtgcct gctgcgccga gattgatatc tgggagtcca acgctcactc ctttgccctg    720

actcctcacc cttgcaccga gaacacctac cacgtctgca ctgactccaa ctgcggtgga    780

acttattcgg acgaccgatt cgccggaaag tgcgacgcca acggctgcga ttacaaccct    840

taccgcatgg gtaataccga cttctacggc aagggcatga ctgttgatac caccaagaag    900

ttcaccgtcg tgacctcgtt ccagcgcaac aacatgactc agtacctcgt ccagaacggc    960

cgcaagttcc tcatccccgc ccccacggat accagcatct cggacagcag cagcatcacc   1020

cccgagttct gtgacaatgt cttcaccgcc ttcgacgacc gcaaccgctt cgaggaggtc   1080

ggtggctggg accagctcaa cgccgctctg tccattccca tggtgctcgt catgtccatc   1140

tgggacgacc actacgccaa catgctctgg ctcgactcca tctacccccc tgagaagaag   1200

ggccagcccg gcgctgcccg tggtcggtgc cccgagaact ctggcgttcc ctctgaggtt   1260

gagtctcagt acgccaacgc ccaagttgtc tggtccaaca tccgctgggg ccccgtcggc   1320

tcgactgtcc gtctctga                                                 1338


<210> 22
<211> 445
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (17)...(445)
<223> Glycosyl hydrolase family 7

<400> 22
Met Leu Thr Leu Ala Phe Leu Ser Leu Leu Ala Ala Ala Asn Ala Gln
1               5                   10                  15      


Lys Ala Gly Thr Leu Gln Ser Glu Thr His Pro Arg Met Thr Trp Ser
            20                  25                  30          


Arg Cys Ser Ala Gly Gly Ser Cys Thr Thr Thr Asn Gly Glu Val Val
        35                  40                  45              


Ile Asp Ala Asn Trp Arg Trp Leu His Thr Val Ser Gly Ser Gln Asn
    50                  55                  60                  


Cys Tyr Asp Gly Asn Lys Trp Thr Ser Ala Cys Ser Ser Glu Ser Asp
65                  70                  75                  80  


Cys Ser Gln Asn Cys Ala Val Glu Gly Ala Asp Tyr Ser Gly Thr Tyr
                85                  90                  95      


Gly Ala Ser Thr Ser Gly Asn Ala Leu Thr Leu Lys Phe Val Thr Thr
            100                 105                 110         


His Gln Tyr Gly Lys Asn Ile Gly Ser Arg Leu Tyr Leu Met Ala Ser
        115                 120                 125             


Gln Ser Gln Tyr Gln Met Phe Thr Leu Leu Asn Asn Glu Leu Ala Phe
    130                 135                 140                 


Asp Val Asp Leu Ser Gln Ile Glu Cys Gly Leu Asn Ala Ala Leu Tyr
145                 150                 155                 160 


Phe Val Ala Met Asp Ala Asp Gly Gly Met Ser Arg Asn Ser Ala Asn
                165                 170                 175     


Thr Ala Gly Ala Lys Phe Gly Thr Gly Tyr Cys Asp Ala Gln Cys Ala
            180                 185                 190         


Arg Asp Leu Lys Phe Val Gly Gly Lys Ala Asn Ser Asp Gly Trp Lys
        195                 200                 205             


Pro Ser Asp Asn Asp Ala Asn Ala Gly Val Gly Lys Tyr Gly Ala Cys
    210                 215                 220                 


Cys Ala Glu Ile Asp Ile Trp Glu Ser Asn Ala His Ser Phe Ala Leu
225                 230                 235                 240 


Thr Pro His Pro Cys Thr Glu Asn Thr Tyr His Val Cys Thr Asp Ser
                245                 250                 255     


Asn Cys Gly Gly Thr Tyr Ser Asp Asp Arg Phe Ala Gly Lys Cys Asp
            260                 265                 270         


Ala Asn Gly Cys Asp Tyr Asn Pro Tyr Arg Met Gly Asn Thr Asp Phe
        275                 280                 285             


Tyr Gly Lys Gly Met Thr Val Asp Thr Thr Lys Lys Phe Thr Val Val
    290                 295                 300                 


Thr Ser Phe Gln Arg Asn Asn Met Thr Gln Tyr Leu Val Gln Asn Gly
305                 310                 315                 320 


Arg Lys Phe Leu Ile Pro Ala Pro Thr Asp Thr Ser Ile Ser Asp Ser
                325                 330                 335     


Ser Ser Ile Thr Pro Glu Phe Cys Asp Asn Val Phe Thr Ala Phe Asp
            340                 345                 350         


Asp Arg Asn Arg Phe Glu Glu Val Gly Gly Trp Asp Gln Leu Asn Ala
        355                 360                 365             


Ala Leu Ser Ile Pro Met Val Leu Val Met Ser Ile Trp Asp Asp His
    370                 375                 380                 


Tyr Ala Asn Met Leu Trp Leu Asp Ser Ile Tyr Pro Pro Glu Lys Lys
385                 390                 395                 400 


Gly Gln Pro Gly Ala Ala Arg Gly Arg Cys Pro Glu Asn Ser Gly Val
                405                 410                 415     


Pro Ser Glu Val Glu Ser Gln Tyr Ala Asn Ala Gln Val Val Trp Ser
            420                 425                 430         


Asn Ile Arg Trp Gly Pro Val Gly Ser Thr Val Arg Leu
        435                 440                 445 


<210> 23
<211> 2289
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 23
atgaaagata ttcagtccct tattgcccaa atgactcttg aagaaaaggc tgcgctatgc     60

actggtgcca gcccctggac atcagtcccg gtggaaaggc tcggcatccc ggagatgatc    120

gtttccgacg gcccgcatgg tgtgcgccgc gtcccggatg tcaacgcgat tgcagtcaaa    180

agtctgcctg cgacctgctt ccccacggcc tcctgcctgg catccacctg ggatgtggat    240

ttgatccgga agatggggga agctttgggt gaggaatgca tcgctctgaa tgtggatgtg    300

ctgcttggcc cgggtgcaaa tatgaaacgc tcgccgctgg gcggtcgcaa cttcgaatat    360

ttttctgagg atccctatct ggccggcgaa atggctgcca gcatcatcaa tggcatccag    420

agcaagggtg tgggtacgtc actcaagcat tatgccgcca acaaccagga attccagcgc    480

ttcagcatta gcgccgaagt ggacgaacgt accctgcggg agatctacct gcccgccttt    540

gaaaaagccg tcaagcaggc ccagccgtgg acggtgatgt gctcgtacaa caaggtcaac    600

ggcacgtttg catcggagca ttaccatctc ctgactgaaa tcctgaagca ggaatggggc    660

ttcgagggcc tggtcgtctc ggactggggc gcggtgcgcg accgtgtggc cgcgctgaaa    720

ggcgggctgg attgggaaat gcccggtccc caggaacgcc gcgtcaaggc cgtcgtggaa    780

gccgtccgct ccggccagct cgacgaagcc attctggacg agtccgtccg ccgcatcctg    840

cgcatcatct tcatggcccg agaaacaccc aaaaacggat ccttcgacgt cgacgctcat    900

catgagctgg cccacaagat tgccagcgaa ggcatggtaa tgctgaagaa caatggcatc    960

ctgcctctca aggaccagca gcacattgca gtgatcggcc atgcagccga gaacgctcac   1020

ttccagggcg gcggcagctc ccacatcaac ccgaccaggg tggccgtccc gttcaaggag   1080

ctgcaagctc aggcggggaa tgccgagctg acctatgcgg agggctatcc gaccgacaaa   1140

tccttccgcc aggacatgat cgatcaggca gtgaagctcg cccaagctgc cgatgtggcc   1200

gtactgtata tcgccctgcc gaccttcaag gaatccgaag gttatgaccg cccggatctg   1260

gacctgaccg atcagcagat cgcgctgatc aaggctgtcg ccaaggtcca gcctaacacg   1320

gtcgtcgttc tgaacaacgg tgcgccggtg gcgatgagcg cctggatcga tgacgtggcc   1380

gccgtgctcg aaagctggat gatgggacag gcaggcggtg cggcgattgc ggatgtcctc   1440

tttggcaggg tcaacccatc cggcaaactg gccgagacct tcccactcaa actcgccgat   1500

acgcccgcgc atctcaactg gccgggcggc gccggagaag tccgctatgg cgaagggctg   1560

ttcatcggct atcgctacta tgatgccaaa gaaatgccgg tcctgttccc ctttggacac   1620

gggttaagtt acaccacgtt ctcctatagc aatgccaaag cgtcggcgaa aaccttcaag   1680

gatgtggacg gactgacagt ctcagtagat gttaccaata caggcagcgt agcgggcaag   1740

gaaactatcc aggtgtacgt ccacgatcag aagtccgggc tggtgcgacc gcaaaaggaa   1800

ctgaaaggct tcgcgaaagt ggaacttcag ccgggcgaga ccaagaccgt ctcgatcgac   1860

ttggatttcc gtgcctttgc ctactatcac ccggaataca aacagtggat tactgaggac   1920

ggagagttcg acattctgat cggcgcctcc tccgcggata ttcgctgcag gctgactgtg   1980

gcactcgaat cgaccctgga cctgccctgc attctcgaca aggaatccac catccgtgag   2040

tggctggccg acccgcgtgg caaagccatc ctggagcccg aatacgccct gatcgaaacc   2100

cggggccgca gaattctcgg cggggaatcg gagcgttatg gcaatgatgg cgccctgggc   2160

atggatgtca tggatatgtt caatgacatg ccgctggtca gtgtgctgat gttccagcag   2220

ggcgagctgc ccatgccggc ggaagaaatc gtggatggat tactggcccg ggttcatagc   2280

aagagctaa                                                           2289


<210> 24
<211> 762
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (312)...(524)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> DOMAIN
<222> (28)...(248)
<223> Glycosyl hydrolase family 3 N terminal domain

<400> 24
Met Lys Asp Ile Gln Ser Leu Ile Ala Gln Met Thr Leu Glu Glu Lys
1               5                   10                  15      


Ala Ala Leu Cys Thr Gly Ala Ser Pro Trp Thr Ser Val Pro Val Glu
            20                  25                  30          


Arg Leu Gly Ile Pro Glu Met Ile Val Ser Asp Gly Pro His Gly Val
        35                  40                  45              


Arg Arg Val Pro Asp Val Asn Ala Ile Ala Val Lys Ser Leu Pro Ala
    50                  55                  60                  


Thr Cys Phe Pro Thr Ala Ser Cys Leu Ala Ser Thr Trp Asp Val Asp
65                  70                  75                  80  


Leu Ile Arg Lys Met Gly Glu Ala Leu Gly Glu Glu Cys Ile Ala Leu
                85                  90                  95      


Asn Val Asp Val Leu Leu Gly Pro Gly Ala Asn Met Lys Arg Ser Pro
            100                 105                 110         


Leu Gly Gly Arg Asn Phe Glu Tyr Phe Ser Glu Asp Pro Tyr Leu Ala
        115                 120                 125             


Gly Glu Met Ala Ala Ser Ile Ile Asn Gly Ile Gln Ser Lys Gly Val
    130                 135                 140                 


Gly Thr Ser Leu Lys His Tyr Ala Ala Asn Asn Gln Glu Phe Gln Arg
145                 150                 155                 160 


Phe Ser Ile Ser Ala Glu Val Asp Glu Arg Thr Leu Arg Glu Ile Tyr
                165                 170                 175     


Leu Pro Ala Phe Glu Lys Ala Val Lys Gln Ala Gln Pro Trp Thr Val
            180                 185                 190         


Met Cys Ser Tyr Asn Lys Val Asn Gly Thr Phe Ala Ser Glu His Tyr
        195                 200                 205             


His Leu Leu Thr Glu Ile Leu Lys Gln Glu Trp Gly Phe Glu Gly Leu
    210                 215                 220                 


Val Val Ser Asp Trp Gly Ala Val Arg Asp Arg Val Ala Ala Leu Lys
225                 230                 235                 240 


Gly Gly Leu Asp Trp Glu Met Pro Gly Pro Gln Glu Arg Arg Val Lys
                245                 250                 255     


Ala Val Val Glu Ala Val Arg Ser Gly Gln Leu Asp Glu Ala Ile Leu
            260                 265                 270         


Asp Glu Ser Val Arg Arg Ile Leu Arg Ile Ile Phe Met Ala Arg Glu
        275                 280                 285             


Thr Pro Lys Asn Gly Ser Phe Asp Val Asp Ala His His Glu Leu Ala
    290                 295                 300                 


His Lys Ile Ala Ser Glu Gly Met Val Met Leu Lys Asn Asn Gly Ile
305                 310                 315                 320 


Leu Pro Leu Lys Asp Gln Gln His Ile Ala Val Ile Gly His Ala Ala
                325                 330                 335     


Glu Asn Ala His Phe Gln Gly Gly Gly Ser Ser His Ile Asn Pro Thr
            340                 345                 350         


Arg Val Ala Val Pro Phe Lys Glu Leu Gln Ala Gln Ala Gly Asn Ala
        355                 360                 365             


Glu Leu Thr Tyr Ala Glu Gly Tyr Pro Thr Asp Lys Ser Phe Arg Gln
    370                 375                 380                 


Asp Met Ile Asp Gln Ala Val Lys Leu Ala Gln Ala Ala Asp Val Ala
385                 390                 395                 400 


Val Leu Tyr Ile Ala Leu Pro Thr Phe Lys Glu Ser Glu Gly Tyr Asp
                405                 410                 415     


Arg Pro Asp Leu Asp Leu Thr Asp Gln Gln Ile Ala Leu Ile Lys Ala
            420                 425                 430         


Val Ala Lys Val Gln Pro Asn Thr Val Val Val Leu Asn Asn Gly Ala
        435                 440                 445             


Pro Val Ala Met Ser Ala Trp Ile Asp Asp Val Ala Ala Val Leu Glu
    450                 455                 460                 


Ser Trp Met Met Gly Gln Ala Gly Gly Ala Ala Ile Ala Asp Val Leu
465                 470                 475                 480 


Phe Gly Arg Val Asn Pro Ser Gly Lys Leu Ala Glu Thr Phe Pro Leu
                485                 490                 495     


Lys Leu Ala Asp Thr Pro Ala His Leu Asn Trp Pro Gly Gly Ala Gly
            500                 505                 510         


Glu Val Arg Tyr Gly Glu Gly Leu Phe Ile Gly Tyr Arg Tyr Tyr Asp
        515                 520                 525             


Ala Lys Glu Met Pro Val Leu Phe Pro Phe Gly His Gly Leu Ser Tyr
    530                 535                 540                 


Thr Thr Phe Ser Tyr Ser Asn Ala Lys Ala Ser Ala Lys Thr Phe Lys
545                 550                 555                 560 


Asp Val Asp Gly Leu Thr Val Ser Val Asp Val Thr Asn Thr Gly Ser
                565                 570                 575     


Val Ala Gly Lys Glu Thr Ile Gln Val Tyr Val His Asp Gln Lys Ser
            580                 585                 590         


Gly Leu Val Arg Pro Gln Lys Glu Leu Lys Gly Phe Ala Lys Val Glu
        595                 600                 605             


Leu Gln Pro Gly Glu Thr Lys Thr Val Ser Ile Asp Leu Asp Phe Arg
    610                 615                 620                 


Ala Phe Ala Tyr Tyr His Pro Glu Tyr Lys Gln Trp Ile Thr Glu Asp
625                 630                 635                 640 


Gly Glu Phe Asp Ile Leu Ile Gly Ala Ser Ser Ala Asp Ile Arg Cys
                645                 650                 655     


Arg Leu Thr Val Ala Leu Glu Ser Thr Leu Asp Leu Pro Cys Ile Leu
            660                 665                 670         


Asp Lys Glu Ser Thr Ile Arg Glu Trp Leu Ala Asp Pro Arg Gly Lys
        675                 680                 685             


Ala Ile Leu Glu Pro Glu Tyr Ala Leu Ile Glu Thr Arg Gly Arg Arg
    690                 695                 700                 


Ile Leu Gly Gly Glu Ser Glu Arg Tyr Gly Asn Asp Gly Ala Leu Gly
705                 710                 715                 720 


Met Asp Val Met Asp Met Phe Asn Asp Met Pro Leu Val Ser Val Leu
                725                 730                 735     


Met Phe Gln Gln Gly Glu Leu Pro Met Pro Ala Glu Glu Ile Val Asp
            740                 745                 750         


Gly Leu Leu Ala Arg Val His Ser Lys Ser
        755                 760         


<210> 25
<211> 1581
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 25
atgttctcca agactgctct tctgtcttcc atcttcgccg ctgcggcaac ggcccagcag     60

gtcggcaccc ttaccaccga gacccatcct tcactgccaa tgcagagctg cactgctgct    120

ggatcctgta cgactatcaa tacggcggtg accttggatg ccaactggag atggctccac    180

acaaccagcg gctataccaa ctgctacact ggcaacgctt ggaacaccac gctttgcccc    240

gatggcaaga cgtgtgctgc taactgtgcc ctcgacggcg ccgattatcc tggcacatac    300

ggtgtgactg ctagtggcaa tgctctcaag ctcaacttcg tgaccaacgg acagtactcc    360

aagaacattg gctcgcggtt gtacctcatg gcctcggatt ccaagtacca gatgttcaag    420

ctattgaacc gggagttcac attcgacgtc gatgtctcca acctgccatg tggtctgaac    480

ggggcccttt accttgtcga gatggacgag gatggcggaa tggccaagta ttccaccaac    540

aaggctggtg ccaagtacgg aactggctat tgtgataccc aatgcccgca cgacatcaag    600

ttcatcaacg gcgaggctaa tgtggacgga tggacaccga gctccaacga tgccaacgct    660

ggaaccggta cttacggatc ttgctgccac gaaatggata tctgggaggc gaattccatc    720

tcagccgcct acactcctca cgtatgcagc aaggatggcc aagtgcgctg ctccggcatt    780

gactgcggag acggcgacaa ccgctacaag ggtatctgtg acaaggatgg ctgcgatttc    840

aacagctacc gcatgggtga taaatccttc tacggcaagg gcctgaccgt cgacacgtct    900

agcaagttca ctgttgtcac tcagttcatc accaatgacg gcaccgacac gggcacgctc    960

tccgaaattc gccgtatcta cgtccagaat ggcaaggtca tccagaacag caataccaag   1020

attactggcg ttaccaccac caactccatc actgacaagt tctgcacgga gcagaagacc   1080

accttcggcg ataccaacac cttcagctcc atgggcggcc taaagacgat gggtggaccc   1140

cttggccgtg gtatggtcct cgccctctcc gtctgggacg accactcagt caacatgctc   1200

tggcttgact ccacctaccc caccgacaag agtgccgaca ctcccggcgt cggtcgcggt   1260

acctgcgcaa tcacttccgg tgttccgtct gacatcgaaa cctcggctgc cagctccagc   1320

gtcacctact cgaatatcaa ggttggaaag ctgaactcga cctacaccgg cacctctact   1380

aacccctcca acccatcttc ctctgccaag ccagcctcgt cttccacttc cgcaactggc   1440

tccaccccat ctaacccatc tactggcgga accgtcgcta agtggggaca gtgcggtggc   1500

atcggatact ctggcgccac aacttgcgtc tctggaagca cctgccacaa gatcaacgac   1560

tactacagcc agtgctacta a                                             1581


<210> 26
<211> 526
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (20)...(457)
<223> Glycosyl hydrolase family 7

<220> 
<221> DOMAIN
<222> (494)...(522)
<223> Fungal cellulose binding domain

<400> 26
Met Phe Ser Lys Thr Ala Leu Leu Ser Ser Ile Phe Ala Ala Ala Ala
1               5                   10                  15      


Thr Ala Gln Gln Val Gly Thr Leu Thr Thr Glu Thr His Pro Ser Leu
            20                  25                  30          


Pro Met Gln Ser Cys Thr Ala Ala Gly Ser Cys Thr Thr Ile Asn Thr
        35                  40                  45              


Ala Val Thr Leu Asp Ala Asn Trp Arg Trp Leu His Thr Thr Ser Gly
    50                  55                  60                  


Tyr Thr Asn Cys Tyr Thr Gly Asn Ala Trp Asn Thr Thr Leu Cys Pro
65                  70                  75                  80  


Asp Gly Lys Thr Cys Ala Ala Asn Cys Ala Leu Asp Gly Ala Asp Tyr
                85                  90                  95      


Pro Gly Thr Tyr Gly Val Thr Ala Ser Gly Asn Ala Leu Lys Leu Asn
            100                 105                 110         


Phe Val Thr Asn Gly Gln Tyr Ser Lys Asn Ile Gly Ser Arg Leu Tyr
        115                 120                 125             


Leu Met Ala Ser Asp Ser Lys Tyr Gln Met Phe Lys Leu Leu Asn Arg
    130                 135                 140                 


Glu Phe Thr Phe Asp Val Asp Val Ser Asn Leu Pro Cys Gly Leu Asn
145                 150                 155                 160 


Gly Ala Leu Tyr Leu Val Glu Met Asp Glu Asp Gly Gly Met Ala Lys
                165                 170                 175     


Tyr Ser Thr Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp
            180                 185                 190         


Thr Gln Cys Pro His Asp Ile Lys Phe Ile Asn Gly Glu Ala Asn Val
        195                 200                 205             


Asp Gly Trp Thr Pro Ser Ser Asn Asp Ala Asn Ala Gly Thr Gly Thr
    210                 215                 220                 


Tyr Gly Ser Cys Cys His Glu Met Asp Ile Trp Glu Ala Asn Ser Ile
225                 230                 235                 240 


Ser Ala Ala Tyr Thr Pro His Val Cys Ser Lys Asp Gly Gln Val Arg
                245                 250                 255     


Cys Ser Gly Ile Asp Cys Gly Asp Gly Asp Asn Arg Tyr Lys Gly Ile
            260                 265                 270         


Cys Asp Lys Asp Gly Cys Asp Phe Asn Ser Tyr Arg Met Gly Asp Lys
        275                 280                 285             


Ser Phe Tyr Gly Lys Gly Leu Thr Val Asp Thr Ser Ser Lys Phe Thr
    290                 295                 300                 


Val Val Thr Gln Phe Ile Thr Asn Asp Gly Thr Asp Thr Gly Thr Leu
305                 310                 315                 320 


Ser Glu Ile Arg Arg Ile Tyr Val Gln Asn Gly Lys Val Ile Gln Asn
                325                 330                 335     


Ser Asn Thr Lys Ile Thr Gly Val Thr Thr Thr Asn Ser Ile Thr Asp
            340                 345                 350         


Lys Phe Cys Thr Glu Gln Lys Thr Thr Phe Gly Asp Thr Asn Thr Phe
        355                 360                 365             


Ser Ser Met Gly Gly Leu Lys Thr Met Gly Gly Pro Leu Gly Arg Gly
    370                 375                 380                 


Met Val Leu Ala Leu Ser Val Trp Asp Asp His Ser Val Asn Met Leu
385                 390                 395                 400 


Trp Leu Asp Ser Thr Tyr Pro Thr Asp Lys Ser Ala Asp Thr Pro Gly
                405                 410                 415     


Val Gly Arg Gly Thr Cys Ala Ile Thr Ser Gly Val Pro Ser Asp Ile
            420                 425                 430         


Glu Thr Ser Ala Ala Ser Ser Ser Val Thr Tyr Ser Asn Ile Lys Val
        435                 440                 445             


Gly Lys Leu Asn Ser Thr Tyr Thr Gly Thr Ser Thr Asn Pro Ser Asn
    450                 455                 460                 


Pro Ser Ser Ser Ala Lys Pro Ala Ser Ser Ser Thr Ser Ala Thr Gly
465                 470                 475                 480 


Ser Thr Pro Ser Asn Pro Ser Thr Gly Gly Thr Val Ala Lys Trp Gly
                485                 490                 495     


Gln Cys Gly Gly Ile Gly Tyr Ser Gly Ala Thr Thr Cys Val Ser Gly
            500                 505                 510         


Ser Thr Cys His Lys Ile Asn Asp Tyr Tyr Ser Gln Cys Tyr
        515                 520                 525     


<210> 27
<211> 1188
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 27
atgctcctct cagccgccac tctgatcgca tttgccgcgg gcgccattgg cgccccggct     60

tccaccatcg acacagtggc tccccgtcag gcacctggag cctgctcgag ccctgtccag    120

ctggacgcca agaccaacgt ctggacttcg tacacgctcc accccaacag cttctacagg    180

gccgaggttg aggcagcagc cgcttccatg tcctccagcg atgctgccag ggctctcaag    240

gtcgcagaca ttggcacttt cctctgggct gacaccatcg ccaacattga ccgcgttgag    300

ccggcgcttc aagatgttcc ttgcaatcat atctttggtc tggttgtcta tgatctgcct    360

ggccgcgact gcgctgctaa ggcgtccaac ggagagctcc cggttggtgc gatcaaccga    420

tacaagactg agtacatcga caagcttgcc gctctcatca agaagtactc caacactgct    480

tttgctctcg ttattgagcc cgattcgctg cctaacctgg tcaccaactc caatgtcgct    540

gcctgccaac aatctgccgc tgggtaccgt gagggtgttg cttatgccct caagaccctc    600

aacctcccca acgttgtcca gtacatcgat gctggtcacg gcggatggct tggctggaac    660

gataacctcc agcctggtgc tagggagctc gccaatgcgt acaagaacgc cggcagcccc    720

tcccagttcc gtggctttgc caccaacgtc gccggctgga accaatggga tgccgagccc    780

ggtgagttcg ctggcgcgtc tgatgcccaa tggaacaaag cccagaacga gaagaagtat    840

gttgagctct ttggcgccgc cctttccagc gccggcatgc caaaccatgc cattgttgat    900

accggacgga gcggcaagcc cggcggacgc aaggaatggg gtgactggtg caacgtcgtc    960

aactctggat ttggccgacg gcccagctcc tcgactggct ctaccctcac tgatgccttt   1020

gtctgggtca agcctggcgg tgaatcggat ggcactagcg atacctctgc tgctcgttat   1080

gattctttct gtggcaagga tgacgccttt aagccttctc ctgaagcggg ccagtggaac   1140

caggccttct tcgagcagct gcttaacaat gccagcccag cattctga                1188


<210> 28
<211> 395
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(22)

<220> 
<221> DOMAIN
<222> (55)...(362)
<223> Glycosyl hydrolases family 6

<400> 28
Met Leu Leu Ser Ala Ala Thr Leu Ile Ala Phe Ala Ala Gly Ala Ile
1               5                   10                  15      


Gly Ala Pro Ala Ser Thr Ile Asp Thr Val Ala Pro Arg Gln Ala Pro
            20                  25                  30          


Gly Ala Cys Ser Ser Pro Val Gln Leu Asp Ala Lys Thr Asn Val Trp
        35                  40                  45              


Thr Ser Tyr Thr Leu His Pro Asn Ser Phe Tyr Arg Ala Glu Val Glu
    50                  55                  60                  


Ala Ala Ala Ala Ser Met Ser Ser Ser Asp Ala Ala Arg Ala Leu Lys
65                  70                  75                  80  


Val Ala Asp Ile Gly Thr Phe Leu Trp Ala Asp Thr Ile Ala Asn Ile
                85                  90                  95      


Asp Arg Val Glu Pro Ala Leu Gln Asp Val Pro Cys Asn His Ile Phe
            100                 105                 110         


Gly Leu Val Val Tyr Asp Leu Pro Gly Arg Asp Cys Ala Ala Lys Ala
        115                 120                 125             


Ser Asn Gly Glu Leu Pro Val Gly Ala Ile Asn Arg Tyr Lys Thr Glu
    130                 135                 140                 


Tyr Ile Asp Lys Leu Ala Ala Leu Ile Lys Lys Tyr Ser Asn Thr Ala
145                 150                 155                 160 


Phe Ala Leu Val Ile Glu Pro Asp Ser Leu Pro Asn Leu Val Thr Asn
                165                 170                 175     


Ser Asn Val Ala Ala Cys Gln Gln Ser Ala Ala Gly Tyr Arg Glu Gly
            180                 185                 190         


Val Ala Tyr Ala Leu Lys Thr Leu Asn Leu Pro Asn Val Val Gln Tyr
        195                 200                 205             


Ile Asp Ala Gly His Gly Gly Trp Leu Gly Trp Asn Asp Asn Leu Gln
    210                 215                 220                 


Pro Gly Ala Arg Glu Leu Ala Asn Ala Tyr Lys Asn Ala Gly Ser Pro
225                 230                 235                 240 


Ser Gln Phe Arg Gly Phe Ala Thr Asn Val Ala Gly Trp Asn Gln Trp
                245                 250                 255     


Asp Ala Glu Pro Gly Glu Phe Ala Gly Ala Ser Asp Ala Gln Trp Asn
            260                 265                 270         


Lys Ala Gln Asn Glu Lys Lys Tyr Val Glu Leu Phe Gly Ala Ala Leu
        275                 280                 285             


Ser Ser Ala Gly Met Pro Asn His Ala Ile Val Asp Thr Gly Arg Ser
    290                 295                 300                 


Gly Lys Pro Gly Gly Arg Lys Glu Trp Gly Asp Trp Cys Asn Val Val
305                 310                 315                 320 


Asn Ser Gly Phe Gly Arg Arg Pro Ser Ser Ser Thr Gly Ser Thr Leu
                325                 330                 335     


Thr Asp Ala Phe Val Trp Val Lys Pro Gly Gly Glu Ser Asp Gly Thr
            340                 345                 350         


Ser Asp Thr Ser Ala Ala Arg Tyr Asp Ser Phe Cys Gly Lys Asp Asp
        355                 360                 365             


Ala Phe Lys Pro Ser Pro Glu Ala Gly Gln Trp Asn Gln Ala Phe Phe
    370                 375                 380                 


Glu Gln Leu Leu Asn Asn Ala Ser Pro Ala Phe
385                 390                 395 


<210> 29
<211> 1251
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 29
atggctgatg catccgtttt tccagaaaat tttttgtggg gtgtggcggg cgcggcgcac     60

cagatcgaag gcaataatgt caacagcgac tcgtgggtgc tggaacacat ccccggcggg    120

ccattcgccg agccagcggg cgatgcctgt gaccactatc accgctaccc ccaagacatc    180

gccctcatcg catcgctcgg ttttaacagc taccgcttct cgattgaatg ggcgcggatt    240

gagccggaag aaggcgagtt ttccaacgct gagttggagc attaccgccg gatgctggcg    300

gcctgccacg aacacggcct gacgccgatt gtcacctatc atcatttcac ctctccgcgc    360

tggttcgccg ctaaaggcgg atgggaagtc ctcgccaatg ccgattactt cgcccgctac    420

tgcgaaaagg cgacggcgca cctcggtgac ctgatcggcg cggcctgcac cctcaacgaa    480

cctaatcttg gcttgctcat tcagagtatg ggcttcacac cgccggatga ggttgttgct    540

aaagcgcctt atcgcgctgc tgctgccaaa gccgttggca gcgatcagtt ttcggctttt    600

cccaactgtc agcatgggcc agcgcgggat acctttctga aggcgcaccc gatggctgtc    660

gcggcgatta aaagcgggcg cggcgatttc ccggtaggca tcacactggc gatgtccgac    720

catcaggctg tccccggcgg cgaggcgcat cgtgacaagt tccgccatga tgtcgatgac    780

atttttctcg atctggcgaa ggatgatgat ttcgtcggcg tacagactta cagccgcacc    840

cgtttcggcc cggaagggat gctgcgcggc gaagaaggtg tcccggtgac gcagatgggc    900

tacgaattct ggccggaagc gctggaaggc acgattcgtt atgcggcggc ttataccggg    960

cgtccagtgc tggtgacaga gaacggcatc ggcacggaaa atgacgccga tcgtatcgaa   1020

tatgtgggca gggctttggc aggtgttagc cgctgcttgc aggatggcat cgacgtgcgc   1080

ggctactgct actggtcgat tttcgacaat tttgagtggg gttttggcta ccgtcccaaa   1140

ttcgggttga tcgccgttga ccgcgccacg caggaacgag cacctaaacc cagcgcggca   1200

tggctcggcg aaatcgcgcg ttcaaagggg caatttcaga gtggtttcta a            1251


<210> 30
<211> 416
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (3)...(411)
<223> Glycosyl hydrolase family 1

<400> 30
Met Ala Asp Ala Ser Val Phe Pro Glu Asn Phe Leu Trp Gly Val Ala
1               5                   10                  15      


Gly Ala Ala His Gln Ile Glu Gly Asn Asn Val Asn Ser Asp Ser Trp
            20                  25                  30          


Val Leu Glu His Ile Pro Gly Gly Pro Phe Ala Glu Pro Ala Gly Asp
        35                  40                  45              


Ala Cys Asp His Tyr His Arg Tyr Pro Gln Asp Ile Ala Leu Ile Ala
    50                  55                  60                  


Ser Leu Gly Phe Asn Ser Tyr Arg Phe Ser Ile Glu Trp Ala Arg Ile
65                  70                  75                  80  


Glu Pro Glu Glu Gly Glu Phe Ser Asn Ala Glu Leu Glu His Tyr Arg
                85                  90                  95      


Arg Met Leu Ala Ala Cys His Glu His Gly Leu Thr Pro Ile Val Thr
            100                 105                 110         


Tyr His His Phe Thr Ser Pro Arg Trp Phe Ala Ala Lys Gly Gly Trp
        115                 120                 125             


Glu Val Leu Ala Asn Ala Asp Tyr Phe Ala Arg Tyr Cys Glu Lys Ala
    130                 135                 140                 


Thr Ala His Leu Gly Asp Leu Ile Gly Ala Ala Cys Thr Leu Asn Glu
145                 150                 155                 160 


Pro Asn Leu Gly Leu Leu Ile Gln Ser Met Gly Phe Thr Pro Pro Asp
                165                 170                 175     


Glu Val Val Ala Lys Ala Pro Tyr Arg Ala Ala Ala Ala Lys Ala Val
            180                 185                 190         


Gly Ser Asp Gln Phe Ser Ala Phe Pro Asn Cys Gln His Gly Pro Ala
        195                 200                 205             


Arg Asp Thr Phe Leu Lys Ala His Pro Met Ala Val Ala Ala Ile Lys
    210                 215                 220                 


Ser Gly Arg Gly Asp Phe Pro Val Gly Ile Thr Leu Ala Met Ser Asp
225                 230                 235                 240 


His Gln Ala Val Pro Gly Gly Glu Ala His Arg Asp Lys Phe Arg His
                245                 250                 255     


Asp Val Asp Asp Ile Phe Leu Asp Leu Ala Lys Asp Asp Asp Phe Val
            260                 265                 270         


Gly Val Gln Thr Tyr Ser Arg Thr Arg Phe Gly Pro Glu Gly Met Leu
        275                 280                 285             


Arg Gly Glu Glu Gly Val Pro Val Thr Gln Met Gly Tyr Glu Phe Trp
    290                 295                 300                 


Pro Glu Ala Leu Glu Gly Thr Ile Arg Tyr Ala Ala Ala Tyr Thr Gly
305                 310                 315                 320 


Arg Pro Val Leu Val Thr Glu Asn Gly Ile Gly Thr Glu Asn Asp Ala
                325                 330                 335     


Asp Arg Ile Glu Tyr Val Gly Arg Ala Leu Ala Gly Val Ser Arg Cys
            340                 345                 350         


Leu Gln Asp Gly Ile Asp Val Arg Gly Tyr Cys Tyr Trp Ser Ile Phe
        355                 360                 365             


Asp Asn Phe Glu Trp Gly Phe Gly Tyr Arg Pro Lys Phe Gly Leu Ile
    370                 375                 380                 


Ala Val Asp Arg Ala Thr Gln Glu Arg Ala Pro Lys Pro Ser Ala Ala
385                 390                 395                 400 


Trp Leu Gly Glu Ile Ala Arg Ser Lys Gly Gln Phe Gln Ser Gly Phe
                405                 410                 415     


<210> 31
<211> 1338
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 31
atgtcttttc ctagtaactt tttatgggga gcagcaactg ctgcctatca gattgaaggc     60

gctgtaaatg aggatggtcg taaaccatgc atttgggata ccttatcaaa agggcatgta    120

gtttacgatg aaacaggaga gaatgcctgt gaccattacc accgctttga agaagatatt    180

aagctgatga aagagatcgg gcttaaatgt tatagattct cggttgcttg gcctcgcatc    240

atccctgatg gcactggtgc tgttaatgaa aagggtattg agttctatgt aaagcttgta    300

aagctgttaa aagaaaatga cattgaacct atcgttacct tatatcactg ggatcttcca    360

tactcacttt atttaaaagg tggttggaca aatcctgaga tccaagagtg gtttttagag    420

tacacaaagg ttgtagtaaa ggctctatct ccatatgtaa agtacttcct tacatttaat    480

gagcctcagt gttttatcgg tatttcttat gtaggtggtg ttcatgctcc atttttagat    540

gagcctacat cattattacc tgctacaaga aatgtcttac tagcacacgg taaagctgta    600

aaagctatta gagagctagc tccaaacgca aaagtaggct ttgctccaac aggtgctgtc    660

tatgcacctc acaaccacac agaatatgag tacaagaaag cttatgaact tactttctca    720

gacagaagag gtccattctc agttgcttgg tggtgtgatc ctgtatttaa aggatctatt    780

actccacgcg cagcttctta catgaatatt gccccagaga attttatgac ggaagttgag    840

tggcagattg taactgagaa attagatttc ttaggcttta acttctatca gtgcgacggc    900

attcaggaaa atggtcagaa gtacccagac aatacattct gtggaggacc tgtaactgct    960

atgaactggc ctattacacc tgagggtatg tattacgcag ttaagttctt aaaagagcgt   1020

tatcaaagac caattctgat aacagaaaac ggcatggcaa atactgattt tgtaatgcta   1080

gacggtaaag ttcatgatcc acagagaatt gattatgttc accgttatct tttagcttta   1140

aataaagcta ttgaagaggg aatagaggta attggctata actactggtc acttatggat   1200

aactttgagt gggctgaagg ttataaatat agatttggtc ttatctatgt tgattacaga   1260

actcaaaaga ggacactaaa agactctgct tactggtata agaacgtaat tgcttcaaat   1320

ggagagaatt tgttctag                                                 1338


<210> 32
<211> 445
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(442)
<223> Glycosyl hydrolase family 1

<400> 32
Met Ser Phe Pro Ser Asn Phe Leu Trp Gly Ala Ala Thr Ala Ala Tyr
1               5                   10                  15      


Gln Ile Glu Gly Ala Val Asn Glu Asp Gly Arg Lys Pro Cys Ile Trp
            20                  25                  30          


Asp Thr Leu Ser Lys Gly His Val Val Tyr Asp Glu Thr Gly Glu Asn
        35                  40                  45              


Ala Cys Asp His Tyr His Arg Phe Glu Glu Asp Ile Lys Leu Met Lys
    50                  55                  60                  


Glu Ile Gly Leu Lys Cys Tyr Arg Phe Ser Val Ala Trp Pro Arg Ile
65                  70                  75                  80  


Ile Pro Asp Gly Thr Gly Ala Val Asn Glu Lys Gly Ile Glu Phe Tyr
                85                  90                  95      


Val Lys Leu Val Lys Leu Leu Lys Glu Asn Asp Ile Glu Pro Ile Val
            100                 105                 110         


Thr Leu Tyr His Trp Asp Leu Pro Tyr Ser Leu Tyr Leu Lys Gly Gly
        115                 120                 125             


Trp Thr Asn Pro Glu Ile Gln Glu Trp Phe Leu Glu Tyr Thr Lys Val
    130                 135                 140                 


Val Val Lys Ala Leu Ser Pro Tyr Val Lys Tyr Phe Leu Thr Phe Asn
145                 150                 155                 160 


Glu Pro Gln Cys Phe Ile Gly Ile Ser Tyr Val Gly Gly Val His Ala
                165                 170                 175     


Pro Phe Leu Asp Glu Pro Thr Ser Leu Leu Pro Ala Thr Arg Asn Val
            180                 185                 190         


Leu Leu Ala His Gly Lys Ala Val Lys Ala Ile Arg Glu Leu Ala Pro
        195                 200                 205             


Asn Ala Lys Val Gly Phe Ala Pro Thr Gly Ala Val Tyr Ala Pro His
    210                 215                 220                 


Asn His Thr Glu Tyr Glu Tyr Lys Lys Ala Tyr Glu Leu Thr Phe Ser
225                 230                 235                 240 


Asp Arg Arg Gly Pro Phe Ser Val Ala Trp Trp Cys Asp Pro Val Phe
                245                 250                 255     


Lys Gly Ser Ile Thr Pro Arg Ala Ala Ser Tyr Met Asn Ile Ala Pro
            260                 265                 270         


Glu Asn Phe Met Thr Glu Val Glu Trp Gln Ile Val Thr Glu Lys Leu
        275                 280                 285             


Asp Phe Leu Gly Phe Asn Phe Tyr Gln Cys Asp Gly Ile Gln Glu Asn
    290                 295                 300                 


Gly Gln Lys Tyr Pro Asp Asn Thr Phe Cys Gly Gly Pro Val Thr Ala
305                 310                 315                 320 


Met Asn Trp Pro Ile Thr Pro Glu Gly Met Tyr Tyr Ala Val Lys Phe
                325                 330                 335     


Leu Lys Glu Arg Tyr Gln Arg Pro Ile Leu Ile Thr Glu Asn Gly Met
            340                 345                 350         


Ala Asn Thr Asp Phe Val Met Leu Asp Gly Lys Val His Asp Pro Gln
        355                 360                 365             


Arg Ile Asp Tyr Val His Arg Tyr Leu Leu Ala Leu Asn Lys Ala Ile
    370                 375                 380                 


Glu Glu Gly Ile Glu Val Ile Gly Tyr Asn Tyr Trp Ser Leu Met Asp
385                 390                 395                 400 


Asn Phe Glu Trp Ala Glu Gly Tyr Lys Tyr Arg Phe Gly Leu Ile Tyr
                405                 410                 415     


Val Asp Tyr Arg Thr Gln Lys Arg Thr Leu Lys Asp Ser Ala Tyr Trp
            420                 425                 430         


Tyr Lys Asn Val Ile Ala Ser Asn Gly Glu Asn Leu Phe
        435                 440                 445 


<210> 33
<211> 1527
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 33
atgtaccgca ttctcgccac cgcctcggct ctgctggcaa ccgcccgtgc ccagcaagcc     60

tgcaccctca acgccgaaag caagcctgcc ttgacctggt ccaagtgcac atccagcggc    120

tgcagcaacg tccgcggatc tgtcgtggtt gacgccaact ggcgatggac ccatagcacc    180

tccagcagca ccaactgcta caccggcaac acctgggaca agactctctg ccccgatgga    240

aagacctgcg ctgacaagtg ctgtcttgat ggtgccgact actctggcac ctacggagtc    300

acctcgagcg gcaaccagct caacctcaag tttgtgactg ttggaccata cagcaccaat    360

gttggcagcc gtctctacct catggaggat gagaacaact accagatgtt cgacctcctg    420

ggcaacgaat tcacctttga tgtcgatgtc aacaacatcg gatgcggcct gaacggcgcc    480

ctctacttcg tctccatgga caaggatggt ggcaagagcc gcttcagcac caacaaggct    540

ggtgccaagt acggaactgg ctactgcgat gcccagtgcc ctcgcgatgt caagttcatc    600

aacggagttg ccaactccga cgactggcag ccctccgcca gcgacaagaa cgccggtgtt    660

ggcaagtacg gcacctgctg ccctgagatg gatatctggg aggccaacaa gatctccacg    720

gcttacactc cccatccctg caagagcctc acccagcagt cctgcgaggg cgatgcctgc    780

ggtggcacct actcttctac tcgctatgct ggaacttgcg atcccgatgg ttgcgatttc    840

aacccttacc gccagggcaa ccacaccttc tacggtcccg gctccggctt caacgttgat    900

accaccaaga aggtgactgt cgtgacccag ttcatcaagg gcagcgacgg caagctctct    960

gagatcaagc gtctctatgt tcagaacggc aaggtcattg gcaaccccca gtccgagatt   1020

gccaacaacc ccggcagctc cgtcaccgac agcttctgca aggcccagaa ggttgcattc   1080

aacgaccccg atgacttcaa caagaagggt ggctggagcg gcatgaacga cgccctcgcc   1140

aagcccatgg ttctcgtcat gagcctgtgg cacgaccact acgccaacat gctctggctc   1200

gactctacct accccaaggg ctccaagact cccggctctg ctcgtggctc ttgccctgag   1260

gactctggtg tccccgccac tctcgagaag gaggtcccca actccagcgt cagcttctcc   1320

aacatcaagt tcggtcccat cggcagcacc tactccggca ccggcggcaa caaccccgac   1380

cccgaggagc ctgaggagcc cgaggagcct gtcggcaccg tcccccagtg gggccagtgc   1440

ggcggcatca actacagcgg ccccaccgcc tgcgtgtctc cctacaagtg caacaagatc   1500

aacgactact actcccagtg ctactag                                       1527


<210> 34
<211> 508
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(20)

<220> 
<221> DOMAIN
<222> (19)...(453)
<223> Glycosyl hydrolase family 7

<220> 
<221> DOMAIN
<222> (476)...(504)
<223> Fungal cellulose binding domain

<400> 34
Met Tyr Arg Ile Leu Ala Thr Ala Ser Ala Leu Leu Ala Thr Ala Arg
1               5                   10                  15      


Ala Gln Gln Ala Cys Thr Leu Asn Ala Glu Ser Lys Pro Ala Leu Thr
            20                  25                  30          


Trp Ser Lys Cys Thr Ser Ser Gly Cys Ser Asn Val Arg Gly Ser Val
        35                  40                  45              


Val Val Asp Ala Asn Trp Arg Trp Thr His Ser Thr Ser Ser Ser Thr
    50                  55                  60                  


Asn Cys Tyr Thr Gly Asn Thr Trp Asp Lys Thr Leu Cys Pro Asp Gly
65                  70                  75                  80  


Lys Thr Cys Ala Asp Lys Cys Cys Leu Asp Gly Ala Asp Tyr Ser Gly
                85                  90                  95      


Thr Tyr Gly Val Thr Ser Ser Gly Asn Gln Leu Asn Leu Lys Phe Val
            100                 105                 110         


Thr Val Gly Pro Tyr Ser Thr Asn Val Gly Ser Arg Leu Tyr Leu Met
        115                 120                 125             


Glu Asp Glu Asn Asn Tyr Gln Met Phe Asp Leu Leu Gly Asn Glu Phe
    130                 135                 140                 


Thr Phe Asp Val Asp Val Asn Asn Ile Gly Cys Gly Leu Asn Gly Ala
145                 150                 155                 160 


Leu Tyr Phe Val Ser Met Asp Lys Asp Gly Gly Lys Ser Arg Phe Ser
                165                 170                 175     


Thr Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ala Gln
            180                 185                 190         


Cys Pro Arg Asp Val Lys Phe Ile Asn Gly Val Ala Asn Ser Asp Asp
        195                 200                 205             


Trp Gln Pro Ser Ala Ser Asp Lys Asn Ala Gly Val Gly Lys Tyr Gly
    210                 215                 220                 


Thr Cys Cys Pro Glu Met Asp Ile Trp Glu Ala Asn Lys Ile Ser Thr
225                 230                 235                 240 


Ala Tyr Thr Pro His Pro Cys Lys Ser Leu Thr Gln Gln Ser Cys Glu
                245                 250                 255     


Gly Asp Ala Cys Gly Gly Thr Tyr Ser Ser Thr Arg Tyr Ala Gly Thr
            260                 265                 270         


Cys Asp Pro Asp Gly Cys Asp Phe Asn Pro Tyr Arg Gln Gly Asn His
        275                 280                 285             


Thr Phe Tyr Gly Pro Gly Ser Gly Phe Asn Val Asp Thr Thr Lys Lys
    290                 295                 300                 


Val Thr Val Val Thr Gln Phe Ile Lys Gly Ser Asp Gly Lys Leu Ser
305                 310                 315                 320 


Glu Ile Lys Arg Leu Tyr Val Gln Asn Gly Lys Val Ile Gly Asn Pro
                325                 330                 335     


Gln Ser Glu Ile Ala Asn Asn Pro Gly Ser Ser Val Thr Asp Ser Phe
            340                 345                 350         


Cys Lys Ala Gln Lys Val Ala Phe Asn Asp Pro Asp Asp Phe Asn Lys
        355                 360                 365             


Lys Gly Gly Trp Ser Gly Met Asn Asp Ala Leu Ala Lys Pro Met Val
    370                 375                 380                 


Leu Val Met Ser Leu Trp His Asp His Tyr Ala Asn Met Leu Trp Leu
385                 390                 395                 400 


Asp Ser Thr Tyr Pro Lys Gly Ser Lys Thr Pro Gly Ser Ala Arg Gly
                405                 410                 415     


Ser Cys Pro Glu Asp Ser Gly Val Pro Ala Thr Leu Glu Lys Glu Val
            420                 425                 430         


Pro Asn Ser Ser Val Ser Phe Ser Asn Ile Lys Phe Gly Pro Ile Gly
        435                 440                 445             


Ser Thr Tyr Ser Gly Thr Gly Gly Asn Asn Pro Asp Pro Glu Glu Pro
    450                 455                 460                 


Glu Glu Pro Glu Glu Pro Val Gly Thr Val Pro Gln Trp Gly Gln Cys
465                 470                 475                 480 


Gly Gly Ile Asn Tyr Ser Gly Pro Thr Ala Cys Val Ser Pro Tyr Lys
                485                 490                 495     


Cys Asn Lys Ile Asn Asp Tyr Tyr Ser Gln Cys Tyr
            500                 505             


<210> 35
<211> 1515
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 35
atgtatcaga aattggccgc catctcggcc ttcctggctg ctgctcgtgc tcagcaggtc     60

tgcacccaac aagcggagac tcacccacct ctgacatggc agaaatgcac agcttccggt    120

tgcacagctc aatcaggatc cgtggttctt gatgccaact ggcgttggac tcacgatacc    180

aagagcacta ccaactgcta cgatggcaat acctggagct caaccttgtg ccccgacgat    240

gcgacttgtg ccaagaactg ttgtttggac ggagccaact actcaggcac ttacggagtc    300

accaccagcg gcgacgctct caccattcaa tttgttactc agtcgaatgt cggctcccgt    360

ctttacctga tggcaactga taccacttac caggagttca cactgtctgg caacgagttc    420

tcctttgacg ttgatgtttc ccaactgcct tgtggcttga acggagcgct gtactttgtt    480

tccatggatg ccgatggtgg caaaagcaag taccccggca atgctgccgg tgccaaatat    540

ggcacaggtt actgtgacag ccaatgccct cgtgacctga agttcatcaa cggtcaggcc    600

aacgttgatg gttggcaacc atcttccaac aacgccaaca ctggtattgg taaccacgga    660

agctgctgct cagagatgga tatctgggag gcaaactcca tttctgaggc tcttactcct    720

cacccttgcg aggatgtcgg ccagacgatg tgcagcggcg attcttgcgg tggaacctac    780

tccgatgacc gatatggcgg aacctgcgac cctgatggct gcgactggaa cccataccgc    840

ctgggtaaca cctccttcta cggacccggc tcatcattta ctctcgacac caccaagaag    900

ttgaccgttg tgacccagtt tgccaccaac ggtgcaatta gccgatacta tgtccagaat    960

ggagtcaagt tccagcagcc caacgctcaa gtcggcagct actctggcaa caccatcaac   1020

gccgactact gtgcagctga gcagacagcc ttcggcggaa cctcattcac agacaagggc   1080

ggccttgccc agatcaacaa ggcattccag ggcggaatgg tcttggtcat gagcttgtgg   1140

gatgattacg ctgtcaacat gctttggttg gattccacct acccagcaaa cgccactggc   1200

acccccggcg ccaagcgagg aagctgctct accagctctg gtgttcccgc ccaagtcgaa   1260

gctcagtcac ccaactccaa ggttgtcttc tccaacatcc gcttcggacc cattggcagc   1320

actggcggca acactggcag caaccctccc ggcacttcga ccactcgggc gcctccgtcc   1380

agcactggaa gctcccccac cgccacccag acacactacg gccaatgtgg tggaactggc   1440

tggggcggac ctaccatatg cgctagcggc tacacttgcc aggttctgaa cccattctac   1500

tctcagtgcc tgtaa                                                    1515


<210> 36
<211> 504
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (472)...(500)
<223> Fungal cellulose binding domain

<220> 
<221> DOMAIN
<222> (19)...(444)
<223> Glycosyl hydrolase family 7

<400> 36
Met Tyr Gln Lys Leu Ala Ala Ile Ser Ala Phe Leu Ala Ala Ala Arg
1               5                   10                  15      


Ala Gln Gln Val Cys Thr Gln Gln Ala Glu Thr His Pro Pro Leu Thr
            20                  25                  30          


Trp Gln Lys Cys Thr Ala Ser Gly Cys Thr Ala Gln Ser Gly Ser Val
        35                  40                  45              


Val Leu Asp Ala Asn Trp Arg Trp Thr His Asp Thr Lys Ser Thr Thr
    50                  55                  60                  


Asn Cys Tyr Asp Gly Asn Thr Trp Ser Ser Thr Leu Cys Pro Asp Asp
65                  70                  75                  80  


Ala Thr Cys Ala Lys Asn Cys Cys Leu Asp Gly Ala Asn Tyr Ser Gly
                85                  90                  95      


Thr Tyr Gly Val Thr Thr Ser Gly Asp Ala Leu Thr Ile Gln Phe Val
            100                 105                 110         


Thr Gln Ser Asn Val Gly Ser Arg Leu Tyr Leu Met Ala Thr Asp Thr
        115                 120                 125             


Thr Tyr Gln Glu Phe Thr Leu Ser Gly Asn Glu Phe Ser Phe Asp Val
    130                 135                 140                 


Asp Val Ser Gln Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val
145                 150                 155                 160 


Ser Met Asp Ala Asp Gly Gly Lys Ser Lys Tyr Pro Gly Asn Ala Ala
                165                 170                 175     


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp
            180                 185                 190         


Leu Lys Phe Ile Asn Gly Gln Ala Asn Val Asp Gly Trp Gln Pro Ser
        195                 200                 205             


Ser Asn Asn Ala Asn Thr Gly Ile Gly Asn His Gly Ser Cys Cys Ser
    210                 215                 220                 


Glu Met Asp Ile Trp Glu Ala Asn Ser Ile Ser Glu Ala Leu Thr Pro
225                 230                 235                 240 


His Pro Cys Glu Asp Val Gly Gln Thr Met Cys Ser Gly Asp Ser Cys
                245                 250                 255     


Gly Gly Thr Tyr Ser Asp Asp Arg Tyr Gly Gly Thr Cys Asp Pro Asp
            260                 265                 270         


Gly Cys Asp Trp Asn Pro Tyr Arg Leu Gly Asn Thr Ser Phe Tyr Gly
        275                 280                 285             


Pro Gly Ser Ser Phe Thr Leu Asp Thr Thr Lys Lys Leu Thr Val Val
    290                 295                 300                 


Thr Gln Phe Ala Thr Asn Gly Ala Ile Ser Arg Tyr Tyr Val Gln Asn
305                 310                 315                 320 


Gly Val Lys Phe Gln Gln Pro Asn Ala Gln Val Gly Ser Tyr Ser Gly
                325                 330                 335     


Asn Thr Ile Asn Ala Asp Tyr Cys Ala Ala Glu Gln Thr Ala Phe Gly
            340                 345                 350         


Gly Thr Ser Phe Thr Asp Lys Gly Gly Leu Ala Gln Ile Asn Lys Ala
        355                 360                 365             


Phe Gln Gly Gly Met Val Leu Val Met Ser Leu Trp Asp Asp Tyr Ala
    370                 375                 380                 


Val Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Ala Asn Ala Thr Gly
385                 390                 395                 400 


Thr Pro Gly Ala Lys Arg Gly Ser Cys Ser Thr Ser Ser Gly Val Pro
                405                 410                 415     


Ala Gln Val Glu Ala Gln Ser Pro Asn Ser Lys Val Val Phe Ser Asn
            420                 425                 430         


Ile Arg Phe Gly Pro Ile Gly Ser Thr Gly Gly Asn Thr Gly Ser Asn
        435                 440                 445             


Pro Pro Gly Thr Ser Thr Thr Arg Ala Pro Pro Ser Ser Thr Gly Ser
    450                 455                 460                 


Ser Pro Thr Ala Thr Gln Thr His Tyr Gly Gln Cys Gly Gly Thr Gly
465                 470                 475                 480 


Trp Gly Gly Pro Thr Ile Cys Ala Ser Gly Tyr Thr Cys Gln Val Leu
                485                 490                 495     


Asn Pro Phe Tyr Ser Gln Cys Leu
            500                 


<210> 37
<211> 1014
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 37
atgccgaaga agcttctcgc gtcgttcatc gcattattct tcgcggcgaa cgcggccgct     60

gcgccttcgt cccaggagcc acctgcctct tcgcagctgg agcttcgccg cggggtgaac    120

gtcctgggtt acgacccgat ctggaccgat cccgcaaaag gccgtttcca gcagagccac    180

tttgccgaga ttcggcgcgg cggcttcgac ttcgtacgcg tcaacctcca cgccttcggt    240

cacatggacg cgaagaatca gctgaagccc gccttcattg agcgcctcga ctggatcgtt    300

acgaacgcga cggctgccgg tctctcggtc atcctcgatg agcacgactt caacgcctgt    360

gcggacaatg tggagacgtg tcgaacgagg ctgtcggcct tctggagcca ggtggctccg    420

cgatatcggg acgcaccgcc gacggtgctt ttcgagcttc tgaatgagcc gcacgcgaaa    480

ctggacgcgg acacttggaa cggcttgttt ccggagatcc ttgcgattgt gcgccagacc    540

aatcccacgc gccgggtcat catcggaccg acacaatgga acagccgtga gaagctcgac    600

acgctgaagc tacccgcgaa cgacccgaac atcatcgcga ccttccatta ttacgacccg    660

ttccccttca cccaccaagg tgcgtcgtgg gtcgaggaga tgaaggcggt gaacggcatc    720

acctggggca gcgagggcga ccgggcgcag gttgcgaccg acttcgacaa ggtcgcgcaa    780

tgggcgaagg cgaacaatcg gcagatactg ctaggcgagt tcggagccta cgatcggagc    840

gggacgccga ccgcgatgcg cgcagcctac acggaggcgg tcgcccgcga ggccgagcgg    900

cacggcttct cctgggccta ttggcagttc gacagcgact tcatcgtctg ggacatggcc    960

aggaagggct gggtcgagcc catccacaag gcgctgatcc cggaggctcg ctag         1014


<210> 38
<211> 337
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(20)

<220> 
<221> DOMAIN
<222> (38)...(318)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 38
Met Pro Lys Lys Leu Leu Ala Ser Phe Ile Ala Leu Phe Phe Ala Ala
1               5                   10                  15      


Asn Ala Ala Ala Ala Pro Ser Ser Gln Glu Pro Pro Ala Ser Ser Gln
            20                  25                  30          


Leu Glu Leu Arg Arg Gly Val Asn Val Leu Gly Tyr Asp Pro Ile Trp
        35                  40                  45              


Thr Asp Pro Ala Lys Gly Arg Phe Gln Gln Ser His Phe Ala Glu Ile
    50                  55                  60                  


Arg Arg Gly Gly Phe Asp Phe Val Arg Val Asn Leu His Ala Phe Gly
65                  70                  75                  80  


His Met Asp Ala Lys Asn Gln Leu Lys Pro Ala Phe Ile Glu Arg Leu
                85                  90                  95      


Asp Trp Ile Val Thr Asn Ala Thr Ala Ala Gly Leu Ser Val Ile Leu
            100                 105                 110         


Asp Glu His Asp Phe Asn Ala Cys Ala Asp Asn Val Glu Thr Cys Arg
        115                 120                 125             


Thr Arg Leu Ser Ala Phe Trp Ser Gln Val Ala Pro Arg Tyr Arg Asp
    130                 135                 140                 


Ala Pro Pro Thr Val Leu Phe Glu Leu Leu Asn Glu Pro His Ala Lys
145                 150                 155                 160 


Leu Asp Ala Asp Thr Trp Asn Gly Leu Phe Pro Glu Ile Leu Ala Ile
                165                 170                 175     


Val Arg Gln Thr Asn Pro Thr Arg Arg Val Ile Ile Gly Pro Thr Gln
            180                 185                 190         


Trp Asn Ser Arg Glu Lys Leu Asp Thr Leu Lys Leu Pro Ala Asn Asp
        195                 200                 205             


Pro Asn Ile Ile Ala Thr Phe His Tyr Tyr Asp Pro Phe Pro Phe Thr
    210                 215                 220                 


His Gln Gly Ala Ser Trp Val Glu Glu Met Lys Ala Val Asn Gly Ile
225                 230                 235                 240 


Thr Trp Gly Ser Glu Gly Asp Arg Ala Gln Val Ala Thr Asp Phe Asp
                245                 250                 255     


Lys Val Ala Gln Trp Ala Lys Ala Asn Asn Arg Gln Ile Leu Leu Gly
            260                 265                 270         


Glu Phe Gly Ala Tyr Asp Arg Ser Gly Thr Pro Thr Ala Met Arg Ala
        275                 280                 285             


Ala Tyr Thr Glu Ala Val Ala Arg Glu Ala Glu Arg His Gly Phe Ser
    290                 295                 300                 


Trp Ala Tyr Trp Gln Phe Asp Ser Asp Phe Ile Val Trp Asp Met Ala
305                 310                 315                 320 


Arg Lys Gly Trp Val Glu Pro Ile His Lys Ala Leu Ile Pro Glu Ala
                325                 330                 335     


Arg
    


<210> 39
<211> 1872
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 39
atgacccagc ccgatatccg cgtcgccggt ccgctgttcc tggatcgtca cggccggcag     60

gtgatcctgc gcggcgtcaa cctcggcggc gacaccaagg tcccctggcc cggcggcggc    120

accgagaacc cgtccaattt caccgatcat cgcgccgttt cgttcgtcgg acggccattt    180

ccgctggaag aggccgatga gcatctggcc cgtatcgccg gctggggctt caacgtgctg    240

cggctgctga ccacctggga ggcggtcgag cacgcggggc ccaggcagta cgacaccgcc    300

tatctcgact atctggtcgc ggtcatcggc aaggcggccg agcacggcct caacgtcttc    360

atcgacttcc accaggacgt ctgggcccgg atgagcggcg gcgacggcgc gcccggctgg    420

acgttcgagg cggcaggcct ggacatcggc cgcttccacg ccagcggcgc ggcgctggtg    480

atgcagaacg cctacgacta cgccagcgac gaacgccgtc agcccgccta tccgcagatg    540

gtctggagca gcaactaccg gctgccggcc aacggggtga tgtggagcct gttctggggc    600

ggccggttct tcgcgccgga cttcgagatc gagggcgaga acgtccagaa cttcctgcag    660

gggcgatacc tgggcgccat ggacgccatc gcccgccggg tgaaagacct ccccaacgtg    720

atcggcttcg acaccctcaa cgaacccggc ttcggctggt tcgggacgcc gttgagctat    780

cggcacctga ggaagaccga ggagaaccgc gtcctgccga tggacggccc ggccctgtcg    840

cccggcgacc agctggcgat cctggcgggc gagtcgccga gcgtgccggt gctcaagggc    900

ggcgaggtgg tcggcgagca ggtcatgaac cccggcgggg tgcgggtctg gacgggcgcc    960

gacccgttcg cggcgcacct gcgcgaagac ctgttcagcc atgccggcgg tcacccgctg   1020

tcgctgtcgg aagacgccta tgggcccttc ttccagcgcg tggcggacac catccgggcg   1080

cacaatccgg gctgggcggt gttcgccgag atggacccct acgccgtctt cgccaagcgc   1140

ggctttccga cgacgctgcc ggagcggacg gtcaacgccg gccactggta cgacgtgcgg   1200

ctgctgcatt cgaaggacta cgacgccaag gccgacccgg ccgagacggc ggcccgctac   1260

gtgcgccagc tgtcgcacct caagcgcgag gccgaggcgt tcgagggcgg ggctccggcc   1320

ctggtcgggg agttcggcat cccctacgac ctcgaccaag gcgaggccta cgcggcctgg   1380

gaccgcggcg agcgggacgg gatctgggcc aagcacgagg cggcgctgac ggcgatgtac   1440

gacgcgctgg accagctgca tctgcattcg acgcagtgga actacaccgc cagcaaccgc   1500

aacgacctgc gcatcggcga ccgctggaac caggaggatc tgtcgatctt ctcggccgac   1560

cagatcgagc ccggcaaccc cgacggcggc cgggccacgg cgggcttctg ccggccctac   1620

gcccgggcgg tgcaggggcg gctggtcgag gtggcgttcg acaaggcggc ggggacgttc   1680

cggctggtct gggacgcgga cccggccgtg ctcgagccga cggagatctt cgtgcccggg   1740

ctccagttcc cgaacggctt cgacctcgac atcgacggcg actgggaaga ggcgggcgag   1800

gcgggggacc agctgctggt ggtgagggcg cggtcggcgg ggcggatcgc gctgaccttg   1860

aagcgcctct ga                                                       1872


<210> 40
<211> 623
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (39)...(503)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 40
Met Thr Gln Pro Asp Ile Arg Val Ala Gly Pro Leu Phe Leu Asp Arg
1               5                   10                  15      


His Gly Arg Gln Val Ile Leu Arg Gly Val Asn Leu Gly Gly Asp Thr
            20                  25                  30          


Lys Val Pro Trp Pro Gly Gly Gly Thr Glu Asn Pro Ser Asn Phe Thr
        35                  40                  45              


Asp His Arg Ala Val Ser Phe Val Gly Arg Pro Phe Pro Leu Glu Glu
    50                  55                  60                  


Ala Asp Glu His Leu Ala Arg Ile Ala Gly Trp Gly Phe Asn Val Leu
65                  70                  75                  80  


Arg Leu Leu Thr Thr Trp Glu Ala Val Glu His Ala Gly Pro Arg Gln
                85                  90                  95      


Tyr Asp Thr Ala Tyr Leu Asp Tyr Leu Val Ala Val Ile Gly Lys Ala
            100                 105                 110         


Ala Glu His Gly Leu Asn Val Phe Ile Asp Phe His Gln Asp Val Trp
        115                 120                 125             


Ala Arg Met Ser Gly Gly Asp Gly Ala Pro Gly Trp Thr Phe Glu Ala
    130                 135                 140                 


Ala Gly Leu Asp Ile Gly Arg Phe His Ala Ser Gly Ala Ala Leu Val
145                 150                 155                 160 


Met Gln Asn Ala Tyr Asp Tyr Ala Ser Asp Glu Arg Arg Gln Pro Ala
                165                 170                 175     


Tyr Pro Gln Met Val Trp Ser Ser Asn Tyr Arg Leu Pro Ala Asn Gly
            180                 185                 190         


Val Met Trp Ser Leu Phe Trp Gly Gly Arg Phe Phe Ala Pro Asp Phe
        195                 200                 205             


Glu Ile Glu Gly Glu Asn Val Gln Asn Phe Leu Gln Gly Arg Tyr Leu
    210                 215                 220                 


Gly Ala Met Asp Ala Ile Ala Arg Arg Val Lys Asp Leu Pro Asn Val
225                 230                 235                 240 


Ile Gly Phe Asp Thr Leu Asn Glu Pro Gly Phe Gly Trp Phe Gly Thr
                245                 250                 255     


Pro Leu Ser Tyr Arg His Leu Arg Lys Thr Glu Glu Asn Arg Val Leu
            260                 265                 270         


Pro Met Asp Gly Pro Ala Leu Ser Pro Gly Asp Gln Leu Ala Ile Leu
        275                 280                 285             


Ala Gly Glu Ser Pro Ser Val Pro Val Leu Lys Gly Gly Glu Val Val
    290                 295                 300                 


Gly Glu Gln Val Met Asn Pro Gly Gly Val Arg Val Trp Thr Gly Ala
305                 310                 315                 320 


Asp Pro Phe Ala Ala His Leu Arg Glu Asp Leu Phe Ser His Ala Gly
                325                 330                 335     


Gly His Pro Leu Ser Leu Ser Glu Asp Ala Tyr Gly Pro Phe Phe Gln
            340                 345                 350         


Arg Val Ala Asp Thr Ile Arg Ala His Asn Pro Gly Trp Ala Val Phe
        355                 360                 365             


Ala Glu Met Asp Pro Tyr Ala Val Phe Ala Lys Arg Gly Phe Pro Thr
    370                 375                 380                 


Thr Leu Pro Glu Arg Thr Val Asn Ala Gly His Trp Tyr Asp Val Arg
385                 390                 395                 400 


Leu Leu His Ser Lys Asp Tyr Asp Ala Lys Ala Asp Pro Ala Glu Thr
                405                 410                 415     


Ala Ala Arg Tyr Val Arg Gln Leu Ser His Leu Lys Arg Glu Ala Glu
            420                 425                 430         


Ala Phe Glu Gly Gly Ala Pro Ala Leu Val Gly Glu Phe Gly Ile Pro
        435                 440                 445             


Tyr Asp Leu Asp Gln Gly Glu Ala Tyr Ala Ala Trp Asp Arg Gly Glu
    450                 455                 460                 


Arg Asp Gly Ile Trp Ala Lys His Glu Ala Ala Leu Thr Ala Met Tyr
465                 470                 475                 480 


Asp Ala Leu Asp Gln Leu His Leu His Ser Thr Gln Trp Asn Tyr Thr
                485                 490                 495     


Ala Ser Asn Arg Asn Asp Leu Arg Ile Gly Asp Arg Trp Asn Gln Glu
            500                 505                 510         


Asp Leu Ser Ile Phe Ser Ala Asp Gln Ile Glu Pro Gly Asn Pro Asp
        515                 520                 525             


Gly Gly Arg Ala Thr Ala Gly Phe Cys Arg Pro Tyr Ala Arg Ala Val
    530                 535                 540                 


Gln Gly Arg Leu Val Glu Val Ala Phe Asp Lys Ala Ala Gly Thr Phe
545                 550                 555                 560 


Arg Leu Val Trp Asp Ala Asp Pro Ala Val Leu Glu Pro Thr Glu Ile
                565                 570                 575     


Phe Val Pro Gly Leu Gln Phe Pro Asn Gly Phe Asp Leu Asp Ile Asp
            580                 585                 590         


Gly Asp Trp Glu Glu Ala Gly Glu Ala Gly Asp Gln Leu Leu Val Val
        595                 600                 605             


Arg Ala Arg Ser Ala Gly Arg Ile Ala Leu Thr Leu Lys Arg Leu
    610                 615                 620             


<210> 41
<211> 1380
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 41
aagctcgccg cccgttttcc gggcgatttc gtcttcggcg tcgcgaccgc cgccttccag     60

atcgaaggcg cgtcgaaagc agatggccgc aagccctcga tctgggatgc cttctccaac    120

atgccgggtc gcgtccataa ccgcgacaat ggcgatgtcg cctgcgatca ttacaaccgg    180

ctcgacgaag acctcgacct gatcaaggaa ctgggtgtcg aagcctatcg cttctcgatc    240

gcctggccgc gtatctatcc tgatggcact ggcccgctga acgagaaggg tctcgatttc    300

tacgaccgcc tggtcgacgg ttgcaagcag cgcggcatca agacctttgc gacgctctat    360

cactgggatc tgccgctgac cctgatgggc gacggcggct ggacggcgcg ctcgaccgct    420

tatgccttcc agcgctatgc gcagacgatc gcccggcgcc tcggcgaccg gctcgacgcg    480

gtcgcgacct tcaatgagcc ctggtgctcg gtcatcctgt cgcatcttct cggcatccat    540

gcgcccggcg agcgcaacat gcaggccacc ctgcatgcgg cccactacac caatcttgcc    600

cacggcctcg gcgtcgaggc gatccgggcc gaggcgccaa agctgcccgt cggcatcgtc    660

tacaacgcga tgtcgatcat cccggccaca cagacggagg ctgatctggc tgccgccaag    720

cgcgccgagg acttccacaa tgacatgttc tttggccccg tgttcaaggg cgcctatccc    780

aaggccctga tcgaagccta cgagccgatc atgcccgtca tcgaggagag cgacctcaag    840

atcatcagcc agaagatcga ctggtggggc ctgaactact atacgccgat gcgcgtggcc    900

gccgatccca atccggatgc cccctatccg gcgcatatcc aggcgccggc tgtgaagccg    960

gaaaagaccg atatcggctg ggaaatcgat tccacaggcc tcacccatat cgtccgcgac   1020

ctctattcga aatatgacct gccggattgc tacatcaccg agaatggcgc cgcctacaac   1080

atggacgtgg gcgccgatgg cgaagtcgat gaccagccgc gtctcgatta ttatgtcgac   1140

catctcggag tcacggccga cctgatcaag gaaggcttcc cgatgcgcgg ctattttgcc   1200

tggtcgctga tggacaattt cgaatgggcg gagggctaca agatgcgctt cggcctcgtc   1260

catgtcgatt acgagaccca ggtccgcaca gtgaagaaga gcggcaagtg gtattccgag   1320

ctggcgagcg aattccccaa gggcaatttc gccaacacca aggccgccga ggccgccgag   1380


<210> 42
<211> 460
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (2)...(447)
<223> Glycosyl hydrolase family 1

<400> 42
Lys Leu Ala Ala Arg Phe Pro Gly Asp Phe Val Phe Gly Val Ala Thr
1               5                   10                  15      


Ala Ala Phe Gln Ile Glu Gly Ala Ser Lys Ala Asp Gly Arg Lys Pro
            20                  25                  30          


Ser Ile Trp Asp Ala Phe Ser Asn Met Pro Gly Arg Val His Asn Arg
        35                  40                  45              


Asp Asn Gly Asp Val Ala Cys Asp His Tyr Asn Arg Leu Asp Glu Asp
    50                  55                  60                  


Leu Asp Leu Ile Lys Glu Leu Gly Val Glu Ala Tyr Arg Phe Ser Ile
65                  70                  75                  80  


Ala Trp Pro Arg Ile Tyr Pro Asp Gly Thr Gly Pro Leu Asn Glu Lys
                85                  90                  95      


Gly Leu Asp Phe Tyr Asp Arg Leu Val Asp Gly Cys Lys Gln Arg Gly
            100                 105                 110         


Ile Lys Thr Phe Ala Thr Leu Tyr His Trp Asp Leu Pro Leu Thr Leu
        115                 120                 125             


Met Gly Asp Gly Gly Trp Thr Ala Arg Ser Thr Ala Tyr Ala Phe Gln
    130                 135                 140                 


Arg Tyr Ala Gln Thr Ile Ala Arg Arg Leu Gly Asp Arg Leu Asp Ala
145                 150                 155                 160 


Val Ala Thr Phe Asn Glu Pro Trp Cys Ser Val Ile Leu Ser His Leu
                165                 170                 175     


Leu Gly Ile His Ala Pro Gly Glu Arg Asn Met Gln Ala Thr Leu His
            180                 185                 190         


Ala Ala His Tyr Thr Asn Leu Ala His Gly Leu Gly Val Glu Ala Ile
        195                 200                 205             


Arg Ala Glu Ala Pro Lys Leu Pro Val Gly Ile Val Tyr Asn Ala Met
    210                 215                 220                 


Ser Ile Ile Pro Ala Thr Gln Thr Glu Ala Asp Leu Ala Ala Ala Lys
225                 230                 235                 240 


Arg Ala Glu Asp Phe His Asn Asp Met Phe Phe Gly Pro Val Phe Lys
                245                 250                 255     


Gly Ala Tyr Pro Lys Ala Leu Ile Glu Ala Tyr Glu Pro Ile Met Pro
            260                 265                 270         


Val Ile Glu Glu Ser Asp Leu Lys Ile Ile Ser Gln Lys Ile Asp Trp
        275                 280                 285             


Trp Gly Leu Asn Tyr Tyr Thr Pro Met Arg Val Ala Ala Asp Pro Asn
    290                 295                 300                 


Pro Asp Ala Pro Tyr Pro Ala His Ile Gln Ala Pro Ala Val Lys Pro
305                 310                 315                 320 


Glu Lys Thr Asp Ile Gly Trp Glu Ile Asp Ser Thr Gly Leu Thr His
                325                 330                 335     


Ile Val Arg Asp Leu Tyr Ser Lys Tyr Asp Leu Pro Asp Cys Tyr Ile
            340                 345                 350         


Thr Glu Asn Gly Ala Ala Tyr Asn Met Asp Val Gly Ala Asp Gly Glu
        355                 360                 365             


Val Asp Asp Gln Pro Arg Leu Asp Tyr Tyr Val Asp His Leu Gly Val
    370                 375                 380                 


Thr Ala Asp Leu Ile Lys Glu Gly Phe Pro Met Arg Gly Tyr Phe Ala
385                 390                 395                 400 


Trp Ser Leu Met Asp Asn Phe Glu Trp Ala Glu Gly Tyr Lys Met Arg
                405                 410                 415     


Phe Gly Leu Val His Val Asp Tyr Glu Thr Gln Val Arg Thr Val Lys
            420                 425                 430         


Lys Ser Gly Lys Trp Tyr Ser Glu Leu Ala Ser Glu Phe Pro Lys Gly
        435                 440                 445             


Asn Phe Ala Asn Thr Lys Ala Ala Glu Ala Ala Glu
    450                 455                 460 


<210> 43
<211> 1374
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 43
atgatgcgat atctttccat cgttgccgcc acggcacttc tgaccggagt ggaagctcag     60

caatcagtct ggggacaatg tggtggtcaa ggctacactg gcgcgacgtc atgcgctgcc    120

ggttctacat gcagcactca aaacccttac tacgcacaat gcgtccctgc caccgctact    180

tcaactacat tggtgacaaa aacgtcttct accagcgtcg gaacgacatc accgccaaca    240

acaaccacga cgaaagctag taccactgct actaacactg ccgctgcatc cggaaaccca    300

ttctccggtt accagctcta tgccaatccg tactattctt cagaagtaca cacccttgcc    360

ctcccatctt tgactggctc acttgccgct gctgcgacca aagctgccga ggtcccctca    420

tttgtctggc ttgacacggc agccaaagtg cctacaatgg gcacctactt ggccaacatt    480

gaagctgcaa acaaggctgg cgccagccca cctattgccg gtatcttcgt tgtctatgac    540

ttgcctgacc gtgactgcgc agcagctgca agtaatggcg aatacactgt agcaaacaac    600

ggtgtagcaa actataaggc ttacattgat agcattgtgg ctcagttgaa agcctatccc    660

gacgtgcaca caatccttat cattgagcct gatagtcttg ccaacatggt caccaatttg    720

tctacagcca agtgtgccga agctcaatct gcatactatg agtgcgtcaa ctacgcattg    780

atcaacctca acttggccaa cgtagccatg tacatcgatg ccggtcatgc tggttggctc    840

ggatggcctg ccaatctttc accggcggct gaactcttcg caacagtcta taagaatgca    900

agtgctcctg ctgcacttcg gggattggct accaacgttg ccaactacaa tgcctggtcg    960

atcagcagcc caccctcata cacatctggc gactccaact acgacgaaca gctctacatc   1020

aacgctttgt ctcctctcct gacatctaac ggctggccta acgctcactt catcatggat   1080

acttcccgga acggtgttca accgactaaa cagcaggcat ggggtgactg gtgcaacgtg   1140

atcggaaccg gcttcggtgt tcagccgaca acaaatactg gtgacccact cgaggatgcc   1200

tttgtctggg tcaagccagg tggtgaaagt gatggtacat caaacagttc cgctactcgt   1260

tacgatttcc attgcggcta cagtgatgca cttcaacctg ctcccgaggc tggcacttgg   1320

ttccaagcat actttgtcca gcttttgacc aatgccaacc cagctttggt ctag         1374


<210> 44
<211> 457
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(19)

<220> 
<221> DOMAIN
<222> (108)...(423)
<223> Glycosyl hydrolases family 6

<220> 
<221> DOMAIN
<222> (23)...(51)
<223> Fungal cellulose binding domain

<400> 44
Met Met Arg Tyr Leu Ser Ile Val Ala Ala Thr Ala Leu Leu Thr Gly
1               5                   10                  15      


Val Glu Ala Gln Gln Ser Val Trp Gly Gln Cys Gly Gly Gln Gly Tyr
            20                  25                  30          


Thr Gly Ala Thr Ser Cys Ala Ala Gly Ser Thr Cys Ser Thr Gln Asn
        35                  40                  45              


Pro Tyr Tyr Ala Gln Cys Val Pro Ala Thr Ala Thr Ser Thr Thr Leu
    50                  55                  60                  


Val Thr Lys Thr Ser Ser Thr Ser Val Gly Thr Thr Ser Pro Pro Thr
65                  70                  75                  80  


Thr Thr Thr Thr Lys Ala Ser Thr Thr Ala Thr Asn Thr Ala Ala Ala
                85                  90                  95      


Ser Gly Asn Pro Phe Ser Gly Tyr Gln Leu Tyr Ala Asn Pro Tyr Tyr
            100                 105                 110         


Ser Ser Glu Val His Thr Leu Ala Leu Pro Ser Leu Thr Gly Ser Leu
        115                 120                 125             


Ala Ala Ala Ala Thr Lys Ala Ala Glu Val Pro Ser Phe Val Trp Leu
    130                 135                 140                 


Asp Thr Ala Ala Lys Val Pro Thr Met Gly Thr Tyr Leu Ala Asn Ile
145                 150                 155                 160 


Glu Ala Ala Asn Lys Ala Gly Ala Ser Pro Pro Ile Ala Gly Ile Phe
                165                 170                 175     


Val Val Tyr Asp Leu Pro Asp Arg Asp Cys Ala Ala Ala Ala Ser Asn
            180                 185                 190         


Gly Glu Tyr Thr Val Ala Asn Asn Gly Val Ala Asn Tyr Lys Ala Tyr
        195                 200                 205             


Ile Asp Ser Ile Val Ala Gln Leu Lys Ala Tyr Pro Asp Val His Thr
    210                 215                 220                 


Ile Leu Ile Ile Glu Pro Asp Ser Leu Ala Asn Met Val Thr Asn Leu
225                 230                 235                 240 


Ser Thr Ala Lys Cys Ala Glu Ala Gln Ser Ala Tyr Tyr Glu Cys Val
                245                 250                 255     


Asn Tyr Ala Leu Ile Asn Leu Asn Leu Ala Asn Val Ala Met Tyr Ile
            260                 265                 270         


Asp Ala Gly His Ala Gly Trp Leu Gly Trp Pro Ala Asn Leu Ser Pro
        275                 280                 285             


Ala Ala Glu Leu Phe Ala Thr Val Tyr Lys Asn Ala Ser Ala Pro Ala
    290                 295                 300                 


Ala Leu Arg Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Ala Trp Ser
305                 310                 315                 320 


Ile Ser Ser Pro Pro Ser Tyr Thr Ser Gly Asp Ser Asn Tyr Asp Glu
                325                 330                 335     


Gln Leu Tyr Ile Asn Ala Leu Ser Pro Leu Leu Thr Ser Asn Gly Trp
            340                 345                 350         


Pro Asn Ala His Phe Ile Met Asp Thr Ser Arg Asn Gly Val Gln Pro
        355                 360                 365             


Thr Lys Gln Gln Ala Trp Gly Asp Trp Cys Asn Val Ile Gly Thr Gly
    370                 375                 380                 


Phe Gly Val Gln Pro Thr Thr Asn Thr Gly Asp Pro Leu Glu Asp Ala
385                 390                 395                 400 


Phe Val Trp Val Lys Pro Gly Gly Glu Ser Asp Gly Thr Ser Asn Ser
                405                 410                 415     


Ser Ala Thr Arg Tyr Asp Phe His Cys Gly Tyr Ser Asp Ala Leu Gln
            420                 425                 430         


Pro Ala Pro Glu Ala Gly Thr Trp Phe Gln Ala Tyr Phe Val Gln Leu
        435                 440                 445             


Leu Thr Asn Ala Asn Pro Ala Leu Val
    450                 455         


<210> 45
<211> 1569
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 45
atgtccctcc tgctcacggc actgtccctc gtcgcggcag ccaaggctca gcaagtctgc     60

accctcacga cggagacaca cccacctctg tcgtggtcca aatgcacctc gtcgggctgc    120

accacgacgc agggctctgt tgtcgtcgac gccaactggc gctggactca tctcaccagc    180

tcgtcgacaa actgctacac gggcaacaaa tgggacacgt ccatctgcac gtcgggggcc    240

acgtgcgcgg cgcaatgctg cgtcgacggc gccgactatg ccggcacgta cggcgtcaca    300

acgtcgggca accagctcaa catcaagttt gtcaccaacg ggccctacag caagaacatt    360

ggcagtcggc tgtacctgat gcaggacgac accaactacc agatgttcac gctgctgggc    420

aacgagttca gcttcgacgt tgacgtgtcc aagattagct gtgggcttaa cggcgccctg    480

tactttgtgt cgatggacca ggacggcggc atgtccaagt acagcggcaa caaggccggt    540

gccaagtacg gcacgggata ctgcgacagc cagtgcccgc gagacgtcaa gtttatcaat    600

ggcgtggcca actcggacgg ttggcagcca tcggccaacg acgccaacgc tggcatcggc    660

aacctgggca cctgctgcgc cgaaatggac atctgggaag ccaacgacat ctcggccgcc    720

tacacccccc atccgtgcac caccatcggc cagcactcgt gcactggcga cagctgcggc    780

ggcacctact cctcggaccg atacggcggc gattgcgacc ccgacggctg cgacttcaac    840

agctaccgcc agggcaacac caccttctac ggccccggct ccggcttcac cctcgacacc    900

acccagaagc tcaccgtcgt cacccagttc ctcaagggct ccgacggcaa cctctccgag    960

attaagcgct tctacgtcca gaacggcaag gtcgtcccca actcccagag cgacatctct   1020

ggcgtctcgg gcaactcggt cacccaggcc tactgcgacg cccaaaagtc cgtctttggc   1080

gacaaggact cgttcaatgc aaagggcggc ttggcacaga tgggcaaggc ggtcgctcag   1140

cccatggttc tggtcatgag cctctgggac gaccactact ccaacatgct ctggctcgac   1200

tctacctatc ccacaaactc caccgctctg ggcaccaaac gcggcagctg tgccaccacc   1260

tctggcgtcc cctccgacat tgagaactca gcggccgact ctaccgtcgt cttctccaac   1320

atcaagttcg gcgccatcaa ctccaccttt actggctctg ggaacacggg cggcggtagc   1380

accaccacca aagccagcag caccaccacc agcatcaaga ccagcaccag ctccagctct   1440

ggcagcactg ccacgggaac cgctcctcac tggggtcagt gtggtggcat cggctggact   1500

ggtcctaccg tctgcgctgc cggatatact tgcacttaca acaatgccta ctactctcag   1560

tgcttgtaa                                                           1569


<210> 46
<211> 522
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (18)...(452)
<223> Glycosyl hydrolase family 7

<220> 
<221> DOMAIN
<222> (490)...(518)
<223> Fungal cellulose binding domain

<400> 46
Met Ser Leu Leu Leu Thr Ala Leu Ser Leu Val Ala Ala Ala Lys Ala
1               5                   10                  15      


Gln Gln Val Cys Thr Leu Thr Thr Glu Thr His Pro Pro Leu Ser Trp
            20                  25                  30          


Ser Lys Cys Thr Ser Ser Gly Cys Thr Thr Thr Gln Gly Ser Val Val
        35                  40                  45              


Val Asp Ala Asn Trp Arg Trp Thr His Leu Thr Ser Ser Ser Thr Asn
    50                  55                  60                  


Cys Tyr Thr Gly Asn Lys Trp Asp Thr Ser Ile Cys Thr Ser Gly Ala
65                  70                  75                  80  


Thr Cys Ala Ala Gln Cys Cys Val Asp Gly Ala Asp Tyr Ala Gly Thr
                85                  90                  95      


Tyr Gly Val Thr Thr Ser Gly Asn Gln Leu Asn Ile Lys Phe Val Thr
            100                 105                 110         


Asn Gly Pro Tyr Ser Lys Asn Ile Gly Ser Arg Leu Tyr Leu Met Gln
        115                 120                 125             


Asp Asp Thr Asn Tyr Gln Met Phe Thr Leu Leu Gly Asn Glu Phe Ser
    130                 135                 140                 


Phe Asp Val Asp Val Ser Lys Ile Ser Cys Gly Leu Asn Gly Ala Leu
145                 150                 155                 160 


Tyr Phe Val Ser Met Asp Gln Asp Gly Gly Met Ser Lys Tyr Ser Gly
                165                 170                 175     


Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys
            180                 185                 190         


Pro Arg Asp Val Lys Phe Ile Asn Gly Val Ala Asn Ser Asp Gly Trp
        195                 200                 205             


Gln Pro Ser Ala Asn Asp Ala Asn Ala Gly Ile Gly Asn Leu Gly Thr
    210                 215                 220                 


Cys Cys Ala Glu Met Asp Ile Trp Glu Ala Asn Asp Ile Ser Ala Ala
225                 230                 235                 240 


Tyr Thr Pro His Pro Cys Thr Thr Ile Gly Gln His Ser Cys Thr Gly
                245                 250                 255     


Asp Ser Cys Gly Gly Thr Tyr Ser Ser Asp Arg Tyr Gly Gly Asp Cys
            260                 265                 270         


Asp Pro Asp Gly Cys Asp Phe Asn Ser Tyr Arg Gln Gly Asn Thr Thr
        275                 280                 285             


Phe Tyr Gly Pro Gly Ser Gly Phe Thr Leu Asp Thr Thr Gln Lys Leu
    290                 295                 300                 


Thr Val Val Thr Gln Phe Leu Lys Gly Ser Asp Gly Asn Leu Ser Glu
305                 310                 315                 320 


Ile Lys Arg Phe Tyr Val Gln Asn Gly Lys Val Val Pro Asn Ser Gln
                325                 330                 335     


Ser Asp Ile Ser Gly Val Ser Gly Asn Ser Val Thr Gln Ala Tyr Cys
            340                 345                 350         


Asp Ala Gln Lys Ser Val Phe Gly Asp Lys Asp Ser Phe Asn Ala Lys
        355                 360                 365             


Gly Gly Leu Ala Gln Met Gly Lys Ala Val Ala Gln Pro Met Val Leu
    370                 375                 380                 


Val Met Ser Leu Trp Asp Asp His Tyr Ser Asn Met Leu Trp Leu Asp
385                 390                 395                 400 


Ser Thr Tyr Pro Thr Asn Ser Thr Ala Leu Gly Thr Lys Arg Gly Ser
                405                 410                 415     


Cys Ala Thr Thr Ser Gly Val Pro Ser Asp Ile Glu Asn Ser Ala Ala
            420                 425                 430         


Asp Ser Thr Val Val Phe Ser Asn Ile Lys Phe Gly Ala Ile Asn Ser
        435                 440                 445             


Thr Phe Thr Gly Ser Gly Asn Thr Gly Gly Gly Ser Thr Thr Thr Lys
    450                 455                 460                 


Ala Ser Ser Thr Thr Thr Ser Ile Lys Thr Ser Thr Ser Ser Ser Ser
465                 470                 475                 480 


Gly Ser Thr Ala Thr Gly Thr Ala Pro His Trp Gly Gln Cys Gly Gly
                485                 490                 495     


Ile Gly Trp Thr Gly Pro Thr Val Cys Ala Ala Gly Tyr Thr Cys Thr
            500                 505                 510         


Tyr Asn Asn Ala Tyr Tyr Ser Gln Cys Leu
        515                 520         


<210> 47
<211> 1170
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 47
atgagaaaaa acattttaat gttagccgta gctatgattg cggcaatgtg tttaactacg     60

tcgtgcggaa acaaagccca gaaacaggac gaaacgcagg ctggacaagt gaacaacttc    120

cgcattaagc gcggtaccaa catcagtcat tggctgtcgc agagtgagca gcgtggcgag    180

gcccgtcgct tgcacattca ggaggacgac tttgcacgtc tggaggagtt aggattcgac    240

tttgtacgta ttcccatcga cgaggtacag ttctgggacg aggacggcaa gcagttgccc    300

gaggcttggg gattgctgaa caacgcactc gactgggcaa agaagcacaa cctgcgcgcc    360

attgtggatc tgcatattat ccgctcgcac tactttaacg cagcaaacga ggacgataag    420

gctgctaaca ccctgtttac atcagaagag tcgcagcagg gactgattaa cctgtggaag    480

cagctgtcgg acaccttaaa gaaccgcagc aacgactggg tggcttacga gtttatgaac    540

gagcctgtag cacccgagca cgagcagtgg aaccagctgg tagccaaggt acacaaggcc    600

ctgcgcgaac tggagccaca gcgtacactg gtaattggta gtaacatgtg gcagggacac    660

gagaccatga agtacctgaa ggtgcccgag ggcgacaaga atatcatcct gagtttccac    720

tactacaacc ccatgattct gacacactat ggtgcttggt ggacaccact gggcaagtat    780

cagggcaagg tgaactatcc tggtgtgctg gtatcgaagg aggattacga ggctgctcct    840

gcagagatca aggatcagct gaagccttac accgagcagg tatgggatat caacaccatc    900

cgtgcacagt ttaaggatgc catcgaggct gccaagaagt acgacctgca gctgttctgc    960

ggtgagtggg gtgtttacga gccagtggac cgtgagttgg cttacaactg gacacgcgat   1020

atgctgaccg tattcgacga gtataacatt gcctggacaa cctggtgcta cgatgccgac   1080

tttggtttct gggatcagca gcgccacacc ttcaaggacc gtccattggt tgagttgctg   1140

atgagtggca aaaaactggg agacgaatga                                    1170


<210> 48
<211> 389
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(28)

<220> 
<221> DOMAIN
<222> (49)...(365)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 48
Met Arg Lys Asn Ile Leu Met Leu Ala Val Ala Met Ile Ala Ala Met
1               5                   10                  15      


Cys Leu Thr Thr Ser Cys Gly Asn Lys Ala Gln Lys Gln Asp Glu Thr
            20                  25                  30          


Gln Ala Gly Gln Val Asn Asn Phe Arg Ile Lys Arg Gly Thr Asn Ile
        35                  40                  45              


Ser His Trp Leu Ser Gln Ser Glu Gln Arg Gly Glu Ala Arg Arg Leu
    50                  55                  60                  


His Ile Gln Glu Asp Asp Phe Ala Arg Leu Glu Glu Leu Gly Phe Asp
65                  70                  75                  80  


Phe Val Arg Ile Pro Ile Asp Glu Val Gln Phe Trp Asp Glu Asp Gly
                85                  90                  95      


Lys Gln Leu Pro Glu Ala Trp Gly Leu Leu Asn Asn Ala Leu Asp Trp
            100                 105                 110         


Ala Lys Lys His Asn Leu Arg Ala Ile Val Asp Leu His Ile Ile Arg
        115                 120                 125             


Ser His Tyr Phe Asn Ala Ala Asn Glu Asp Asp Lys Ala Ala Asn Thr
    130                 135                 140                 


Leu Phe Thr Ser Glu Glu Ser Gln Gln Gly Leu Ile Asn Leu Trp Lys
145                 150                 155                 160 


Gln Leu Ser Asp Thr Leu Lys Asn Arg Ser Asn Asp Trp Val Ala Tyr
                165                 170                 175     


Glu Phe Met Asn Glu Pro Val Ala Pro Glu His Glu Gln Trp Asn Gln
            180                 185                 190         


Leu Val Ala Lys Val His Lys Ala Leu Arg Glu Leu Glu Pro Gln Arg
        195                 200                 205             


Thr Leu Val Ile Gly Ser Asn Met Trp Gln Gly His Glu Thr Met Lys
    210                 215                 220                 


Tyr Leu Lys Val Pro Glu Gly Asp Lys Asn Ile Ile Leu Ser Phe His
225                 230                 235                 240 


Tyr Tyr Asn Pro Met Ile Leu Thr His Tyr Gly Ala Trp Trp Thr Pro
                245                 250                 255     


Leu Gly Lys Tyr Gln Gly Lys Val Asn Tyr Pro Gly Val Leu Val Ser
            260                 265                 270         


Lys Glu Asp Tyr Glu Ala Ala Pro Ala Glu Ile Lys Asp Gln Leu Lys
        275                 280                 285             


Pro Tyr Thr Glu Gln Val Trp Asp Ile Asn Thr Ile Arg Ala Gln Phe
    290                 295                 300                 


Lys Asp Ala Ile Glu Ala Ala Lys Lys Tyr Asp Leu Gln Leu Phe Cys
305                 310                 315                 320 


Gly Glu Trp Gly Val Tyr Glu Pro Val Asp Arg Glu Leu Ala Tyr Asn
                325                 330                 335     


Trp Thr Arg Asp Met Leu Thr Val Phe Asp Glu Tyr Asn Ile Ala Trp
            340                 345                 350         


Thr Thr Trp Cys Tyr Asp Ala Asp Phe Gly Phe Trp Asp Gln Gln Arg
        355                 360                 365             


His Thr Phe Lys Asp Arg Pro Leu Val Glu Leu Leu Met Ser Gly Lys
    370                 375                 380                 


Lys Leu Gly Asp Glu
385                 


<210> 49
<211> 2325
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 49
atgaaaacga caaaagctgt tactttactg gcaatgggcg gcgcactctt tgcgcttaca     60

gcctgcaacg gacagaaatg gacggaaagg caagtggatt ctttcatgct ggtcacacaa    120

aagggtggcc ctacattggg ctattcgccg cagtcgggcg ttaaaatact gaccgaagac    180

gggtatgcct ttaaggacct gaaccgtaat gactcgctgg atgactacga agactggcga    240

ctaacggcac aggaacgggc cgcagacctg gcaaagaaac tctcggtgga ggagattgcc    300

ggactgatgc tctacagcag tcatcagtcg gtgccgatgg tttcgcacat gggtttcggg    360

gaatccactt acggcgggaa gtcgtttaag gaaagcggag ccaaaccttc cgacctctcc    420

gacgaccaga agaaattcct gcgcgacgat catctgcggg ccgtcctgat gacgggcgtg    480

gagagtcctg aggtggctgc ccgctggaac aacaatatgc aggcctacgt ggaagcactc    540

gaccacggca ttccggccaa caccagttcc gacccgcgcc atgaagccaa ggccacgacc    600

gaatataatg ccggagcggg cgggcaaatc tcactctggc ctacttcgct cggactggct    660

gccacgttca atccgcaatt agtttatcgt ttcggcgaga tagcatccga ggaatatcgc    720

gcgctgggca ttgccactgc cctttcgccg caggtggaca tcgccacgga tccccgctgg    780

acgcgtttca acggcacttt cggagaagac cctcaactgg ccacggatat ggcgcaggct    840

tactgcgatg cgttccagac aactcccgaa accaaaggat ggggcacgaa gagcgtcaat    900

gcaatggtga agcattggta tggatacggg gctcaggagg gaggtcgcga ctcccatctt    960

gcctccggaa aatatgccgt atatccaggc aagaacctgg ctatgcataa acgttcgttc   1020

acggagggcg ctttccggct gaaaggcggc acggagatgg cttcggccgt gatgcccatc   1080

tatagcattc tctggaatca agacccgtcg gacgagaacg tgggcggcag ctatagccgc   1140

tggctcattc agcaacaact ccgcgacgaa gcaaagtttg agggagtggt ctgcaccgac   1200

tggggcatca cgaaggacat gaaggtgctt gacagtccga gaggcggcaa gccctggggc   1260

gcggaatcgc tcaccgagac cgagcgccac tacaagattc ttcaggcggg cgtggatcag   1320

tttggcggca acaacgagat tggtcctgtg ctcgaggcct acaagatgtg ggctaaggct   1380

caaggcgagg ccagcgcccg cgagcgtttc gaacagtctg cacgccgact gctgctcaat   1440

gttttccgcg tgggactgtt tgagaatcca tatttagacc ctgccgagtc gcagaagacc   1500

gtgggtaatc ctgaatttat gaaagcgggc tacgaagctc agctgaaatc tatcgtgatg   1560

ctgaagaacc acgccaacca gacgcttccc gtgaaagaga agaaggtgta tgttcccaag   1620

cgccacttcc ccgccatccc aggtctctgg ggaggcatct cggaggaaaa gacggtggag   1680

cccatcgacc tcgccttggt gggaaagtat ttcgaagtgg tgaaagaacc gggagaagcc   1740

gactttgcca tctgtctgat agaagaaccc agcgccggtt tcggctatag cacggctgac   1800

gtgaagagcg gcggcaatgg ttacgtaccc tatagcctgc aatacgacga ctatacggcc   1860

gaccatgccc gcagtgtgag cattgccggc ggcgacccca tggagaaatt caccaaccgg   1920

agtttcaagg gcaaaacggt aaagacctac aaccgcgatg acatgctgct ggtgaggaac   1980

acgaaaaagg agatgggcga aaagcccgtc attgtcgttt tggagactgg ccgcccggtg   2040

gtgcttttag aaatagaacc ctttgccgat gctattctgg tgtccttcaa cgtgcagcat   2100

caggcattgc tcgacatcat cagcggaaag gcagaacctt cagccttgtt gcccatgcag   2160

atgcctgccg acatgaagac ggtggaagag caactggagg acatgccgcg cgacatgcgc   2220

tgctatcatg atgccgacgg gcacacctac gacttcgctt tcggcatgaa ctggcaggga   2280

gtgatcagcg atgagcgagt tatgaaatat ggggaaaagc cctaa                   2325


<210> 50
<211> 774
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (176)...(443)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (518)...(760)
<223> Glycosyl hydrolase family 3 C terminal domain

<400> 50
Met Lys Thr Thr Lys Ala Val Thr Leu Leu Ala Met Gly Gly Ala Leu
1               5                   10                  15      


Phe Ala Leu Thr Ala Cys Asn Gly Gln Lys Trp Thr Glu Arg Gln Val
            20                  25                  30          


Asp Ser Phe Met Leu Val Thr Gln Lys Gly Gly Pro Thr Leu Gly Tyr
        35                  40                  45              


Ser Pro Gln Ser Gly Val Lys Ile Leu Thr Glu Asp Gly Tyr Ala Phe
    50                  55                  60                  


Lys Asp Leu Asn Arg Asn Asp Ser Leu Asp Asp Tyr Glu Asp Trp Arg
65                  70                  75                  80  


Leu Thr Ala Gln Glu Arg Ala Ala Asp Leu Ala Lys Lys Leu Ser Val
                85                  90                  95      


Glu Glu Ile Ala Gly Leu Met Leu Tyr Ser Ser His Gln Ser Val Pro
            100                 105                 110         


Met Val Ser His Met Gly Phe Gly Glu Ser Thr Tyr Gly Gly Lys Ser
        115                 120                 125             


Phe Lys Glu Ser Gly Ala Lys Pro Ser Asp Leu Ser Asp Asp Gln Lys
    130                 135                 140                 


Lys Phe Leu Arg Asp Asp His Leu Arg Ala Val Leu Met Thr Gly Val
145                 150                 155                 160 


Glu Ser Pro Glu Val Ala Ala Arg Trp Asn Asn Asn Met Gln Ala Tyr
                165                 170                 175     


Val Glu Ala Leu Asp His Gly Ile Pro Ala Asn Thr Ser Ser Asp Pro
            180                 185                 190         


Arg His Glu Ala Lys Ala Thr Thr Glu Tyr Asn Ala Gly Ala Gly Gly
        195                 200                 205             


Gln Ile Ser Leu Trp Pro Thr Ser Leu Gly Leu Ala Ala Thr Phe Asn
    210                 215                 220                 


Pro Gln Leu Val Tyr Arg Phe Gly Glu Ile Ala Ser Glu Glu Tyr Arg
225                 230                 235                 240 


Ala Leu Gly Ile Ala Thr Ala Leu Ser Pro Gln Val Asp Ile Ala Thr
                245                 250                 255     


Asp Pro Arg Trp Thr Arg Phe Asn Gly Thr Phe Gly Glu Asp Pro Gln
            260                 265                 270         


Leu Ala Thr Asp Met Ala Gln Ala Tyr Cys Asp Ala Phe Gln Thr Thr
        275                 280                 285             


Pro Glu Thr Lys Gly Trp Gly Thr Lys Ser Val Asn Ala Met Val Lys
    290                 295                 300                 


His Trp Tyr Gly Tyr Gly Ala Gln Glu Gly Gly Arg Asp Ser His Leu
305                 310                 315                 320 


Ala Ser Gly Lys Tyr Ala Val Tyr Pro Gly Lys Asn Leu Ala Met His
                325                 330                 335     


Lys Arg Ser Phe Thr Glu Gly Ala Phe Arg Leu Lys Gly Gly Thr Glu
            340                 345                 350         


Met Ala Ser Ala Val Met Pro Ile Tyr Ser Ile Leu Trp Asn Gln Asp
        355                 360                 365             


Pro Ser Asp Glu Asn Val Gly Gly Ser Tyr Ser Arg Trp Leu Ile Gln
    370                 375                 380                 


Gln Gln Leu Arg Asp Glu Ala Lys Phe Glu Gly Val Val Cys Thr Asp
385                 390                 395                 400 


Trp Gly Ile Thr Lys Asp Met Lys Val Leu Asp Ser Pro Arg Gly Gly
                405                 410                 415     


Lys Pro Trp Gly Ala Glu Ser Leu Thr Glu Thr Glu Arg His Tyr Lys
            420                 425                 430         


Ile Leu Gln Ala Gly Val Asp Gln Phe Gly Gly Asn Asn Glu Ile Gly
        435                 440                 445             


Pro Val Leu Glu Ala Tyr Lys Met Trp Ala Lys Ala Gln Gly Glu Ala
    450                 455                 460                 


Ser Ala Arg Glu Arg Phe Glu Gln Ser Ala Arg Arg Leu Leu Leu Asn
465                 470                 475                 480 


Val Phe Arg Val Gly Leu Phe Glu Asn Pro Tyr Leu Asp Pro Ala Glu
                485                 490                 495     


Ser Gln Lys Thr Val Gly Asn Pro Glu Phe Met Lys Ala Gly Tyr Glu
            500                 505                 510         


Ala Gln Leu Lys Ser Ile Val Met Leu Lys Asn His Ala Asn Gln Thr
        515                 520                 525             


Leu Pro Val Lys Glu Lys Lys Val Tyr Val Pro Lys Arg His Phe Pro
    530                 535                 540                 


Ala Ile Pro Gly Leu Trp Gly Gly Ile Ser Glu Glu Lys Thr Val Glu
545                 550                 555                 560 


Pro Ile Asp Leu Ala Leu Val Gly Lys Tyr Phe Glu Val Val Lys Glu
                565                 570                 575     


Pro Gly Glu Ala Asp Phe Ala Ile Cys Leu Ile Glu Glu Pro Ser Ala
            580                 585                 590         


Gly Phe Gly Tyr Ser Thr Ala Asp Val Lys Ser Gly Gly Asn Gly Tyr
        595                 600                 605             


Val Pro Tyr Ser Leu Gln Tyr Asp Asp Tyr Thr Ala Asp His Ala Arg
    610                 615                 620                 


Ser Val Ser Ile Ala Gly Gly Asp Pro Met Glu Lys Phe Thr Asn Arg
625                 630                 635                 640 


Ser Phe Lys Gly Lys Thr Val Lys Thr Tyr Asn Arg Asp Asp Met Leu
                645                 650                 655     


Leu Val Arg Asn Thr Lys Lys Glu Met Gly Glu Lys Pro Val Ile Val
            660                 665                 670         


Val Leu Glu Thr Gly Arg Pro Val Val Leu Leu Glu Ile Glu Pro Phe
        675                 680                 685             


Ala Asp Ala Ile Leu Val Ser Phe Asn Val Gln His Gln Ala Leu Leu
    690                 695                 700                 


Asp Ile Ile Ser Gly Lys Ala Glu Pro Ser Ala Leu Leu Pro Met Gln
705                 710                 715                 720 


Met Pro Ala Asp Met Lys Thr Val Glu Glu Gln Leu Glu Asp Met Pro
                725                 730                 735     


Arg Asp Met Arg Cys Tyr His Asp Ala Asp Gly His Thr Tyr Asp Phe
            740                 745                 750         


Ala Phe Gly Met Asn Trp Gln Gly Val Ile Ser Asp Glu Arg Val Met
        755                 760                 765             


Lys Tyr Gly Glu Lys Pro
    770                 


<210> 51
<211> 1638
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 51
atgaagggtt ccatctccta ccagatctac aagggggctc tcctcctctc ctccctgctg     60

gcctccgtct cggcccaggg cgccggcact ctgactgccg agtcccaccc ggccttgacc    120

tggcagaagt gctctgccgg tggcagctgc acccccgtca gcggtagtgt cgtcattgac    180

gccaactggc gctgggttca cgacaagaac ggcaagaact gctacactgg taacacctgg    240

gacgcgaccc tgtgtcccga tgacaagacc tgcgccgcca actgtgccgt cgacggtgcc    300

agctacgcga gcacctacgg tgtgaccacc agtggcaact ctctgcgcat caactttgtc    360

acccaagctt cccagaagaa catcggttct cgtctctacc tgctggagaa cgacaccact    420

taccagaagt tcaacctcct gaaccaggag ttcacctttg acgtggatgt ttccaacctg    480

ccctgcggtc tcaacggtgc cctctacttc gtcgacatgg acgccgatgg tggtatggcc    540

aagtacccaa ccaacaaggc cggtgccaag tacggtactg gatactgtga ctctcagtgc    600

ccccgtgacc tcaagttcat caacggtatc gccaacgtcg agggctggac tccctcctcc    660

aacgacccca actcgggtgt cggtggccac ggtacttgct gtgccgagat ggacatttgg    720

gaggccaact ccatctccga ggccctcact cctcaccctt gcgacacccc cggccagacc    780

atgtgcgagg gcaacgcctg cggtggtacc tacagcaacg accgttacgc tggtacttgc    840

gatcctgatg ggtgtgactt caacccgtac cgtcagggtg tgaccaactt ctacggtccc    900

ggcatgaccg tggacaccaa gtctcccttc accgtggtga cccagttcct caccgacgac    960

ggtacctcca ccggtaccct gagcgagatc aagcgcttct acgtccagaa cggcaaggtg   1020

atcggccagc cccagtccac cgtcgccggt gtcagcggta actccatcac cgactccttc   1080

tgcaaggccc agaaggccgc ctttggcgat accgatgact tcaccaagca cggtgccctg   1140

gccggtatgg gtgccgcctt tgaggagggc atggtcctgg tcatgagtct ctgggatgac   1200

cacaactcca acatgctctg gctcgacagc acctacccca ccaccgccag ctccaccacc   1260

ctcggtgcca agcgcggctc ttgcgacatc tcctccggtg ctcccaacga cgttgagtcc   1320

cagaatgcca actcctacgt tgtcttctcc aacatcaagg ctggtcccat tggctccacc   1380

ttcaacagcg gctccaccgg cggcggcaac ggcagcggct ccaccaccac caccaagggt   1440

tccaccacca ccaccaaggc cccaaccacc accaccacta ccagcaaggc caccaccacc   1500

actgctgcct ctggcggtaa cggcggcggt gctgctcact gggcccagcg cggtggtgtt   1560

ggctacaccg gtcccaccac ctgtgccagc ccttacacct gcaccaagca gaacgagtac   1620

tactcccagt gcctgtaa                                                 1638


<210> 52
<211> 545
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(27)

<220> 
<221> DOMAIN
<222> (27)...(463)
<223> Glycosyl hydrolase family 7

<220> 
<221> DOMAIN
<222> (513)...(541)
<223> Fungal cellulose binding domain

<400> 52
Met Lys Gly Ser Ile Ser Tyr Gln Ile Tyr Lys Gly Ala Leu Leu Leu
1               5                   10                  15      


Ser Ser Leu Leu Ala Ser Val Ser Ala Gln Gly Ala Gly Thr Leu Thr
            20                  25                  30          


Ala Glu Ser His Pro Ala Leu Thr Trp Gln Lys Cys Ser Ala Gly Gly
        35                  40                  45              


Ser Cys Thr Pro Val Ser Gly Ser Val Val Ile Asp Ala Asn Trp Arg
    50                  55                  60                  


Trp Val His Asp Lys Asn Gly Lys Asn Cys Tyr Thr Gly Asn Thr Trp
65                  70                  75                  80  


Asp Ala Thr Leu Cys Pro Asp Asp Lys Thr Cys Ala Ala Asn Cys Ala
                85                  90                  95      


Val Asp Gly Ala Ser Tyr Ala Ser Thr Tyr Gly Val Thr Thr Ser Gly
            100                 105                 110         


Asn Ser Leu Arg Ile Asn Phe Val Thr Gln Ala Ser Gln Lys Asn Ile
        115                 120                 125             


Gly Ser Arg Leu Tyr Leu Leu Glu Asn Asp Thr Thr Tyr Gln Lys Phe
    130                 135                 140                 


Asn Leu Leu Asn Gln Glu Phe Thr Phe Asp Val Asp Val Ser Asn Leu
145                 150                 155                 160 


Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val Asp Met Asp Ala Asp
                165                 170                 175     


Gly Gly Met Ala Lys Tyr Pro Thr Asn Lys Ala Gly Ala Lys Tyr Gly
            180                 185                 190         


Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp Leu Lys Phe Ile Asn
        195                 200                 205             


Gly Ile Ala Asn Val Glu Gly Trp Thr Pro Ser Ser Asn Asp Pro Asn
    210                 215                 220                 


Ser Gly Val Gly Gly His Gly Thr Cys Cys Ala Glu Met Asp Ile Trp
225                 230                 235                 240 


Glu Ala Asn Ser Ile Ser Glu Ala Leu Thr Pro His Pro Cys Asp Thr
                245                 250                 255     


Pro Gly Gln Thr Met Cys Glu Gly Asn Ala Cys Gly Gly Thr Tyr Ser
            260                 265                 270         


Asn Asp Arg Tyr Ala Gly Thr Cys Asp Pro Asp Gly Cys Asp Phe Asn
        275                 280                 285             


Pro Tyr Arg Gln Gly Val Thr Asn Phe Tyr Gly Pro Gly Met Thr Val
    290                 295                 300                 


Asp Thr Lys Ser Pro Phe Thr Val Val Thr Gln Phe Leu Thr Asp Asp
305                 310                 315                 320 


Gly Thr Ser Thr Gly Thr Leu Ser Glu Ile Lys Arg Phe Tyr Val Gln
                325                 330                 335     


Asn Gly Lys Val Ile Gly Gln Pro Gln Ser Thr Val Ala Gly Val Ser
            340                 345                 350         


Gly Asn Ser Ile Thr Asp Ser Phe Cys Lys Ala Gln Lys Ala Ala Phe
        355                 360                 365             


Gly Asp Thr Asp Asp Phe Thr Lys His Gly Ala Leu Ala Gly Met Gly
    370                 375                 380                 


Ala Ala Phe Glu Glu Gly Met Val Leu Val Met Ser Leu Trp Asp Asp
385                 390                 395                 400 


His Asn Ser Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Thr Ala
                405                 410                 415     


Ser Ser Thr Thr Leu Gly Ala Lys Arg Gly Ser Cys Asp Ile Ser Ser
            420                 425                 430         


Gly Ala Pro Asn Asp Val Glu Ser Gln Asn Ala Asn Ser Tyr Val Val
        435                 440                 445             


Phe Ser Asn Ile Lys Ala Gly Pro Ile Gly Ser Thr Phe Asn Ser Gly
    450                 455                 460                 


Ser Thr Gly Gly Gly Asn Gly Ser Gly Ser Thr Thr Thr Thr Lys Gly
465                 470                 475                 480 


Ser Thr Thr Thr Thr Lys Ala Pro Thr Thr Thr Thr Thr Thr Ser Lys
                485                 490                 495     


Ala Thr Thr Thr Thr Ala Ala Ser Gly Gly Asn Gly Gly Gly Ala Ala
            500                 505                 510         


His Trp Ala Gln Arg Gly Gly Val Gly Tyr Thr Gly Pro Thr Thr Cys
        515                 520                 525             


Ala Ser Pro Tyr Thr Cys Thr Lys Gln Asn Glu Tyr Tyr Ser Gln Cys
    530                 535                 540                 


Leu
545 


<210> 53
<211> 1590
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 53
atgtctgcct tgaactcttt caatatgtac aagagcgccc tcatcttggg ctccttgctg     60

gcaacagctg gtgctcagca aattggtact tataccgctg aaacccatcc ctctttgagc    120

tggtctactt gcaaatcggg tggtagctgc accacaaact ccggtgccat tacgttggat    180

gccaactggc gttgggtcca tggtgtcaat accagcacca actgctacac tggcaacact    240

tggaataccg ccatctgcga cactgatgca tcctgtgccc aggactgtgc tcttgatggt    300

gctgactact ctggcacgta cggtatcact acctccggca actcattgcg cctgaacttc    360

gttaccggtt ccaacgtcgg atctcgtacc tacctgatgg ccgataacac ccactaccaa    420

atcttcgact tgttgaacca ggagttcact ttcaccgtcg atgtctccca cctcccttgc    480

ggtttgaacg gtgccctcta cttcgtgacc atggacgccg acggtggcgt ctccaagtac    540

cccaacaaca aggccggcgc tcagtacggt gttggatact gtgactctca atgccctcgt    600

gacttgaaat tcatcgctgg tcaggccaac gttgagggct ggacgccctc ctccaacaac    660

gccaacactg gacttggcaa ccacggagct tgctgcgcag agcttgatat ctgggaggca    720

aacagcatct cagaggcttt gactcctcac ccttgcgata cacccggtct atctgtttgc    780

actactgatg cctgcggtgg tacctacagc tccgatcgtt acgccggtac ctgcgaccct    840

gatggatgtg acttcaaccc ttaccgtctt ggtgtcactg acttctacgg ctccggcaag    900

accgttgaca ccaccaaacc catcaccgtt gtgactcaat tcgtcactga cgacggcaca    960

tccaccggca ccctctccga gatcagacgt tactacgttc agaacggtgt tgtcatcccc   1020

cagccttcct ccaagatctc cggagtcagc ggaaatgtca tcaactccga cttctgcgat   1080

gctgagatct ccacctttgg cgagactgcc tccttcagca aacacggtgg cctggcaaag   1140

atgggcgctg gtatggaagc tggtatggtc ttggtcatga gtttgtggga cgactactcc   1200

gtcaacatgc tctggctcga cagcacctac cctacaaacg cgactggtac ccccggtgcc   1260

gctcgtggtt cctgccctac cacttctggg gaccctaaga ccgttgaatc acaatccggc   1320

agctcctatg tcaccttttc tgacatccgg gttggtcctt tcaactctac gttcagcggt   1380

ggttctagca ccggtggcag ctccactact accgccagcg gcaccaccac caccaaggcc   1440

tcttccacct ctacttccag cacctctacc ggcactggag tcgctgctca ctggggtcag   1500

tgtggtggcc agggttggac tggtcctacc acctgcgcta gtggaaccac atgcaccgtc   1560

gtgaaccctt actactctca atgtttgtag                                    1590


<210> 54
<211> 529
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (27)...(460)
<223> Glycosyl hydrolase family 7

<220> 
<221> DOMAIN
<222> (497)...(525)
<223> Fungal cellulose binding domain

<400> 54
Met Ser Ala Leu Asn Ser Phe Asn Met Tyr Lys Ser Ala Leu Ile Leu
1               5                   10                  15      


Gly Ser Leu Leu Ala Thr Ala Gly Ala Gln Gln Ile Gly Thr Tyr Thr
            20                  25                  30          


Ala Glu Thr His Pro Ser Leu Ser Trp Ser Thr Cys Lys Ser Gly Gly
        35                  40                  45              


Ser Cys Thr Thr Asn Ser Gly Ala Ile Thr Leu Asp Ala Asn Trp Arg
    50                  55                  60                  


Trp Val His Gly Val Asn Thr Ser Thr Asn Cys Tyr Thr Gly Asn Thr
65                  70                  75                  80  


Trp Asn Thr Ala Ile Cys Asp Thr Asp Ala Ser Cys Ala Gln Asp Cys
                85                  90                  95      


Ala Leu Asp Gly Ala Asp Tyr Ser Gly Thr Tyr Gly Ile Thr Thr Ser
            100                 105                 110         


Gly Asn Ser Leu Arg Leu Asn Phe Val Thr Gly Ser Asn Val Gly Ser
        115                 120                 125             


Arg Thr Tyr Leu Met Ala Asp Asn Thr His Tyr Gln Ile Phe Asp Leu
    130                 135                 140                 


Leu Asn Gln Glu Phe Thr Phe Thr Val Asp Val Ser His Leu Pro Cys
145                 150                 155                 160 


Gly Leu Asn Gly Ala Leu Tyr Phe Val Thr Met Asp Ala Asp Gly Gly
                165                 170                 175     


Val Ser Lys Tyr Pro Asn Asn Lys Ala Gly Ala Gln Tyr Gly Val Gly
            180                 185                 190         


Tyr Cys Asp Ser Gln Cys Pro Arg Asp Leu Lys Phe Ile Ala Gly Gln
        195                 200                 205             


Ala Asn Val Glu Gly Trp Thr Pro Ser Ser Asn Asn Ala Asn Thr Gly
    210                 215                 220                 


Leu Gly Asn His Gly Ala Cys Cys Ala Glu Leu Asp Ile Trp Glu Ala
225                 230                 235                 240 


Asn Ser Ile Ser Glu Ala Leu Thr Pro His Pro Cys Asp Thr Pro Gly
                245                 250                 255     


Leu Ser Val Cys Thr Thr Asp Ala Cys Gly Gly Thr Tyr Ser Ser Asp
            260                 265                 270         


Arg Tyr Ala Gly Thr Cys Asp Pro Asp Gly Cys Asp Phe Asn Pro Tyr
        275                 280                 285             


Arg Leu Gly Val Thr Asp Phe Tyr Gly Ser Gly Lys Thr Val Asp Thr
    290                 295                 300                 


Thr Lys Pro Ile Thr Val Val Thr Gln Phe Val Thr Asp Asp Gly Thr
305                 310                 315                 320 


Ser Thr Gly Thr Leu Ser Glu Ile Arg Arg Tyr Tyr Val Gln Asn Gly
                325                 330                 335     


Val Val Ile Pro Gln Pro Ser Ser Lys Ile Ser Gly Val Ser Gly Asn
            340                 345                 350         


Val Ile Asn Ser Asp Phe Cys Asp Ala Glu Ile Ser Thr Phe Gly Glu
        355                 360                 365             


Thr Ala Ser Phe Ser Lys His Gly Gly Leu Ala Lys Met Gly Ala Gly
    370                 375                 380                 


Met Glu Ala Gly Met Val Leu Val Met Ser Leu Trp Asp Asp Tyr Ser
385                 390                 395                 400 


Val Asn Met Leu Trp Leu Asp Ser Thr Tyr Pro Thr Asn Ala Thr Gly
                405                 410                 415     


Thr Pro Gly Ala Ala Arg Gly Ser Cys Pro Thr Thr Ser Gly Asp Pro
            420                 425                 430         


Lys Thr Val Glu Ser Gln Ser Gly Ser Ser Tyr Val Thr Phe Ser Asp
        435                 440                 445             


Ile Arg Val Gly Pro Phe Asn Ser Thr Phe Ser Gly Gly Ser Ser Thr
    450                 455                 460                 


Gly Gly Ser Ser Thr Thr Thr Ala Ser Gly Thr Thr Thr Thr Lys Ala
465                 470                 475                 480 


Ser Ser Thr Ser Thr Ser Ser Thr Ser Thr Gly Thr Gly Val Ala Ala
                485                 490                 495     


His Trp Gly Gln Cys Gly Gly Gln Gly Trp Thr Gly Pro Thr Thr Cys
            500                 505                 510         


Ala Ser Gly Thr Thr Cys Thr Val Val Asn Pro Tyr Tyr Ser Gln Cys
        515                 520                 525             


Leu
    


<210> 55
<211> 1356
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 55
atgtatcgaa ctcttgcaat tgcctcctcc attctcgccg tggcccaagg ccagctggct     60

ggcacccaga cgaccgaaac ccatcctggt atgacatggc agcaatgcac tgccaagggt    120

agctgcacta ccaagaatgg caagattgtg ctggactcca attggcgctg gcttcacaca    180

aagacgggat acaccaattg ttacaccgac aacaaatggg actcgtccat ttgcgccgac    240

aacaaggctt gcgcaacggc ctgcgcgctc gatggcgcag attacaaggg cacctacgga    300

atccaggcta gcggaaactc cctgaagttg accttcgtca ctaagggatc gtattccacc    360

aacattggct cccgtactta tatgatgaag gacgactcca catacgagat gttcaagttc    420

aacaaccaag agtttacgtt tgatgttgac ctttctaacc ttccatgcgg cttgaacggc    480

gctctatact ttgtgtccat ggacgccgac ggcggcatga agaagtactc gaccaacaag    540

gctggcgcca agtacggtac tggttactgc gatgctcagt gcccacgtga tctcaagttt    600

atcaatggcg agggtaacat agagaattgg cagccatctt ccaacgacgc caatgctggc    660

gtcggtgggc acggctcatg ctgtgccgag atggacatct gggaggccaa ctccgtctct    720

gccgcctata ctccccactc ttgctccacc atcgagcaaa gccgatgcga tggcgattcc    780

tgcggtggaa catactccgc cgagcgttat gcaggcgtgt gcgatcccga tggctgcgat    840

ttcaactcgt accgtatggg cgacaagacc ttttacggca agggcaagac cgtcgacacg    900

agcaagaaat tcacggtcgt tacccagttc atcggtagcg gtgccaatat ggagatcaag    960

cgcttctacg ttcaaaacgg caaggtcatt cccaattcga tgtctcaaat tcctggtgtc   1020

gagggcaatt ccatcactac caagttttgc gatcagcaga aggaagtatt tggagacagg   1080

tacaccttca aggaaaaggg cggcatggcg ggcatggcat cggctctgtc taaaggcatg   1140

gtcttggtca tgtcgctgtg ggatgaccat aactccaaca tgctctggct cgactccacc   1200

ttccccaccg acaaggaccc aagcgttccc ggtattggcc gtggagaatg cgacatcaca   1260

tctggcgtgc ctgccgatgt cgagtccaag tctgcttcgg cttccgtgac ctactccaac   1320

attcgctacg gtcccatcaa ctccaccttt ggttag                             1356


<210> 56
<211> 451
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(20)

<220> 
<221> DOMAIN
<222> (19)...(451)
<223> Glycosyl hydrolase family 7

<400> 56
Met Tyr Arg Thr Leu Ala Ile Ala Ser Ser Ile Leu Ala Val Ala Gln
1               5                   10                  15      


Gly Gln Leu Ala Gly Thr Gln Thr Thr Glu Thr His Pro Gly Met Thr
            20                  25                  30          


Trp Gln Gln Cys Thr Ala Lys Gly Ser Cys Thr Thr Lys Asn Gly Lys
        35                  40                  45              


Ile Val Leu Asp Ser Asn Trp Arg Trp Leu His Thr Lys Thr Gly Tyr
    50                  55                  60                  


Thr Asn Cys Tyr Thr Asp Asn Lys Trp Asp Ser Ser Ile Cys Ala Asp
65                  70                  75                  80  


Asn Lys Ala Cys Ala Thr Ala Cys Ala Leu Asp Gly Ala Asp Tyr Lys
                85                  90                  95      


Gly Thr Tyr Gly Ile Gln Ala Ser Gly Asn Ser Leu Lys Leu Thr Phe
            100                 105                 110         


Val Thr Lys Gly Ser Tyr Ser Thr Asn Ile Gly Ser Arg Thr Tyr Met
        115                 120                 125             


Met Lys Asp Asp Ser Thr Tyr Glu Met Phe Lys Phe Asn Asn Gln Glu
    130                 135                 140                 


Phe Thr Phe Asp Val Asp Leu Ser Asn Leu Pro Cys Gly Leu Asn Gly
145                 150                 155                 160 


Ala Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Met Lys Lys Tyr
                165                 170                 175     


Ser Thr Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ala
            180                 185                 190         


Gln Cys Pro Arg Asp Leu Lys Phe Ile Asn Gly Glu Gly Asn Ile Glu
        195                 200                 205             


Asn Trp Gln Pro Ser Ser Asn Asp Ala Asn Ala Gly Val Gly Gly His
    210                 215                 220                 


Gly Ser Cys Cys Ala Glu Met Asp Ile Trp Glu Ala Asn Ser Val Ser
225                 230                 235                 240 


Ala Ala Tyr Thr Pro His Ser Cys Ser Thr Ile Glu Gln Ser Arg Cys
                245                 250                 255     


Asp Gly Asp Ser Cys Gly Gly Thr Tyr Ser Ala Glu Arg Tyr Ala Gly
            260                 265                 270         


Val Cys Asp Pro Asp Gly Cys Asp Phe Asn Ser Tyr Arg Met Gly Asp
        275                 280                 285             


Lys Thr Phe Tyr Gly Lys Gly Lys Thr Val Asp Thr Ser Lys Lys Phe
    290                 295                 300                 


Thr Val Val Thr Gln Phe Ile Gly Ser Gly Ala Asn Met Glu Ile Lys
305                 310                 315                 320 


Arg Phe Tyr Val Gln Asn Gly Lys Val Ile Pro Asn Ser Met Ser Gln
                325                 330                 335     


Ile Pro Gly Val Glu Gly Asn Ser Ile Thr Thr Lys Phe Cys Asp Gln
            340                 345                 350         


Gln Lys Glu Val Phe Gly Asp Arg Tyr Thr Phe Lys Glu Lys Gly Gly
        355                 360                 365             


Met Ala Gly Met Ala Ser Ala Leu Ser Lys Gly Met Val Leu Val Met
    370                 375                 380                 


Ser Leu Trp Asp Asp His Asn Ser Asn Met Leu Trp Leu Asp Ser Thr
385                 390                 395                 400 


Phe Pro Thr Asp Lys Asp Pro Ser Val Pro Gly Ile Gly Arg Gly Glu
                405                 410                 415     


Cys Asp Ile Thr Ser Gly Val Pro Ala Asp Val Glu Ser Lys Ser Ala
            420                 425                 430         


Ser Ala Ser Val Thr Tyr Ser Asn Ile Arg Tyr Gly Pro Ile Asn Ser
        435                 440                 445             


Thr Phe Gly
    450     


<210> 57
<211> 1977
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 57
atgacaacca gggtcgggcg ttgcgcccaa gctaagcttc tgctcggctt ttgcgccttg     60

gcgctggcct cctgccagac ggcgacgacc ggcaccgttc cggtcgtccc cgtcggtccg    120

gtcacgcatg tcgccgccag cgatacgctc aatgagcggg tccggtcgat cgtcgcaagc    180

atgacgctgg aacagaagat cgggcaaatg acgcagccgg atattcggtc cgtcacgccc    240

gacgacgttc ggcgttacta tatcgggtcc atcctcaacg gcggcggggc atggccgggc    300

atgaacatgc atgccacggt ggaggattgg ctcaagctgt cggactcatt ctaccgggcg    360

tcgatggcga cggacatggc ggtcaagatc cccgttatct ggggcaccga cgcggttcac    420

ggccacaaca atgtctatgg cgcaacgctg tttccgcaca atgtcggtct aggcgccgcg    480

cacgaccccg agctcatggt ccggatcggc cgcgcgaccg ccgagcaggt gcgcgcgacc    540

ggcataacct gggccttcgc accgacgctt gcggtcgtcc agaacccgcg ttggggccgc    600

agctacgaaa gctacagctc cgatccgcaa ctggtgcgcg cctatggcga agccatggtc    660

cgcgggctcc agggccagct tgggagcccg acgtccgtgc ttgcgaccgc gaagcattgg    720

atcggcgacg gcggcacctt ccacggcaag gatcagggcg aaacccgcac cagcgaagat    780

gaactgctga cggtccaggg cgcgggttat gccggggcac tggcagccaa tgtgcggacg    840

gtgatggtca gctactcgag cttcaccgat accgcgaccg gcaaggcatg gggcaagatg    900

cacggcaacg cgcatctgat cgacggagtg ctgaagcaaa agctgggttt cgacggcctt    960

gtcgtgagtg actggaacgg aatcgagcaa gtgccgggct gcaccaaatg gcactgcccg   1020

gaggcagtca acgccggaat cgacatggtc atggtgccgg atgactggaa gcagttcatc   1080

gcggcgacgc tcgacgatgt gcgtgccggc cgaatcccga tgagcaggat cgacgacgcc   1140

gtctcgcgca tcgttagggt caagcttcaa tccgggctgt tcgaaagctc gcccgcccgg   1200

gcccacaccg atgccgctgt gctgcattcg ccggccgtcg aggagctcgc gcgggaggcg   1260

gttcggaaat cgctcgtgct gttgaagaac gagggcggaa tcctgccgct tcgcccgacc   1320

ggaaagatcc tggtcgtcgg caagggcgcg gacaacctgc cgatgcaggc cggcggatgg   1380

tcgctgacct ggcagggcga caatagcgca accgccgact atccgaacgc cgacactctg   1440

ctgtccgcct tgcgcaagtc gctgggcgcc agccgcgtcg actatagcgc cgatggatcg   1500

gcaaaggtcg aaggctatag cgccgtcatc atggtcgccg cggaagaccc atatgccgaa   1560

gggaagggtg acatcgcctt cccggcaccg cttcgccaca gcgcccgtta ccccgccgac   1620

ctccaggccc tgaagcgcat cagcgggagc ggcgtcccgg tggtgacctt gctgttctcg   1680

ggccggccgt taccggtcaa tgacttgatc aaccggtccg acgcgttcgt ggctgcatgg   1740

ctgccaggaa ccgaaggcga agggattgcc gacatgctgg tcgcgccatc ggctcgtcgc   1800

gcgccttacg actttgtcgg caggcttccg ttcgactggc cggcgacaga ttgcctggga   1860

ctcggcgaca agccgctatt tgcgcgcggt tacgggctgg cgttgagcga ccgaaggaag   1920

ttgggacgat tgcccgagac ccccgtaccg gtggcttgcc cggccgatag ccgctag      1977


<210> 58
<211> 658
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(30)

<220> 
<221> DOMAIN
<222> (125)...(352)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (425)...(637)
<223> Glycosyl hydrolase family 3 C terminal domain

<400> 58
Met Thr Thr Arg Val Gly Arg Cys Ala Gln Ala Lys Leu Leu Leu Gly
1               5                   10                  15      


Phe Cys Ala Leu Ala Leu Ala Ser Cys Gln Thr Ala Thr Thr Gly Thr
            20                  25                  30          


Val Pro Val Val Pro Val Gly Pro Val Thr His Val Ala Ala Ser Asp
        35                  40                  45              


Thr Leu Asn Glu Arg Val Arg Ser Ile Val Ala Ser Met Thr Leu Glu
    50                  55                  60                  


Gln Lys Ile Gly Gln Met Thr Gln Pro Asp Ile Arg Ser Val Thr Pro
65                  70                  75                  80  


Asp Asp Val Arg Arg Tyr Tyr Ile Gly Ser Ile Leu Asn Gly Gly Gly
                85                  90                  95      


Ala Trp Pro Gly Met Asn Met His Ala Thr Val Glu Asp Trp Leu Lys
            100                 105                 110         


Leu Ser Asp Ser Phe Tyr Arg Ala Ser Met Ala Thr Asp Met Ala Val
        115                 120                 125             


Lys Ile Pro Val Ile Trp Gly Thr Asp Ala Val His Gly His Asn Asn
    130                 135                 140                 


Val Tyr Gly Ala Thr Leu Phe Pro His Asn Val Gly Leu Gly Ala Ala
145                 150                 155                 160 


His Asp Pro Glu Leu Met Val Arg Ile Gly Arg Ala Thr Ala Glu Gln
                165                 170                 175     


Val Arg Ala Thr Gly Ile Thr Trp Ala Phe Ala Pro Thr Leu Ala Val
            180                 185                 190         


Val Gln Asn Pro Arg Trp Gly Arg Ser Tyr Glu Ser Tyr Ser Ser Asp
        195                 200                 205             


Pro Gln Leu Val Arg Ala Tyr Gly Glu Ala Met Val Arg Gly Leu Gln
    210                 215                 220                 


Gly Gln Leu Gly Ser Pro Thr Ser Val Leu Ala Thr Ala Lys His Trp
225                 230                 235                 240 


Ile Gly Asp Gly Gly Thr Phe His Gly Lys Asp Gln Gly Glu Thr Arg
                245                 250                 255     


Thr Ser Glu Asp Glu Leu Leu Thr Val Gln Gly Ala Gly Tyr Ala Gly
            260                 265                 270         


Ala Leu Ala Ala Asn Val Arg Thr Val Met Val Ser Tyr Ser Ser Phe
        275                 280                 285             


Thr Asp Thr Ala Thr Gly Lys Ala Trp Gly Lys Met His Gly Asn Ala
    290                 295                 300                 


His Leu Ile Asp Gly Val Leu Lys Gln Lys Leu Gly Phe Asp Gly Leu
305                 310                 315                 320 


Val Val Ser Asp Trp Asn Gly Ile Glu Gln Val Pro Gly Cys Thr Lys
                325                 330                 335     


Trp His Cys Pro Glu Ala Val Asn Ala Gly Ile Asp Met Val Met Val
            340                 345                 350         


Pro Asp Asp Trp Lys Gln Phe Ile Ala Ala Thr Leu Asp Asp Val Arg
        355                 360                 365             


Ala Gly Arg Ile Pro Met Ser Arg Ile Asp Asp Ala Val Ser Arg Ile
    370                 375                 380                 


Val Arg Val Lys Leu Gln Ser Gly Leu Phe Glu Ser Ser Pro Ala Arg
385                 390                 395                 400 


Ala His Thr Asp Ala Ala Val Leu His Ser Pro Ala Val Glu Glu Leu
                405                 410                 415     


Ala Arg Glu Ala Val Arg Lys Ser Leu Val Leu Leu Lys Asn Glu Gly
            420                 425                 430         


Gly Ile Leu Pro Leu Arg Pro Thr Gly Lys Ile Leu Val Val Gly Lys
        435                 440                 445             


Gly Ala Asp Asn Leu Pro Met Gln Ala Gly Gly Trp Ser Leu Thr Trp
    450                 455                 460                 


Gln Gly Asp Asn Ser Ala Thr Ala Asp Tyr Pro Asn Ala Asp Thr Leu
465                 470                 475                 480 


Leu Ser Ala Leu Arg Lys Ser Leu Gly Ala Ser Arg Val Asp Tyr Ser
                485                 490                 495     


Ala Asp Gly Ser Ala Lys Val Glu Gly Tyr Ser Ala Val Ile Met Val
            500                 505                 510         


Ala Ala Glu Asp Pro Tyr Ala Glu Gly Lys Gly Asp Ile Ala Phe Pro
        515                 520                 525             


Ala Pro Leu Arg His Ser Ala Arg Tyr Pro Ala Asp Leu Gln Ala Leu
    530                 535                 540                 


Lys Arg Ile Ser Gly Ser Gly Val Pro Val Val Thr Leu Leu Phe Ser
545                 550                 555                 560 


Gly Arg Pro Leu Pro Val Asn Asp Leu Ile Asn Arg Ser Asp Ala Phe
                565                 570                 575     


Val Ala Ala Trp Leu Pro Gly Thr Glu Gly Glu Gly Ile Ala Asp Met
            580                 585                 590         


Leu Val Ala Pro Ser Ala Arg Arg Ala Pro Tyr Asp Phe Val Gly Arg
        595                 600                 605             


Leu Pro Phe Asp Trp Pro Ala Thr Asp Cys Leu Gly Leu Gly Asp Lys
    610                 615                 620                 


Pro Leu Phe Ala Arg Gly Tyr Gly Leu Ala Leu Ser Asp Arg Arg Lys
625                 630                 635                 640 


Leu Gly Arg Leu Pro Glu Thr Pro Val Pro Val Ala Cys Pro Ala Asp
                645                 650                 655     


Ser Arg
        


<210> 59
<211> 1425
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 59
atggccggac gaacgcttat aacactggcc tctttggcca cgttggcggc ttgtgcccct     60

gccgaagcaa gacagaactg tgcaacactt tggggacaat gtggtggcca aaactggact    120

ggtgcaactt gttgtgcctc cggcagctcc tgtgtagctc agaaccctta ttattcccag    180

tgtcttcctc agacatccgc aacttctgcg tcttccacaa cgcgtgcttc ctccaccact    240

tcacaagcaa gctcgactaa gaccagtgcc agtgtaactt ccactaccaa ggttgccagc    300

accaccacga ctgccccccc agtaggttcc ggtactgcga catggagcgg caacccgttt    360

tccggagtca acctttggcc aaacaactac tatgcttccg aagttagcag cttggcaatt    420

ccaagcttga ctggagccat ggccacagcc gcagcagcag ttgccaaggt gccttcgttc    480

atgtggctag acactttgtc caagacaccc cttatggaac agaccctttc agatattcgt    540

gctgcaaaca aggcaggcgg taattacgct gggcagttcg ttgtctacga cttgccagat    600

agagactgcg ccgccgcggc aagtaacgga gagtactcaa tcgcaaatgg aggcgtagcc    660

aactataaga actatattga tactatccgc ggcatcgtca ctacctactc ggacgttcga    720

attctcctag tcattgaacc tgattctctt gccaacctgg tgaccaacct aaatgttgcc    780

aagtgctcca acgcccaagc cgcgtatctg gagtgcgtca actacgccgt tacaaagctc    840

aacctcccca atgttgccat gtacctggat gcaggccatg caggctggct aggctggccc    900

gcaaaccaag accctgcggc tcaactcttc gccaatgtat acaagaacgc cggctcgccc    960

agctcgctgc gtggattggc caccaatgtg gccaactaca atgcatggga catatcttcg   1020

gcgccaccat atacccaggg caacgccgtt tacgacgaga agctgtacat tcatgccatg   1080

ggcccgctgc tggccaacca tggttggtcc aatgcttatt tcatcactga tcagggtcgc   1140

tccggtaagc agcccacggg tcaagcccaa tggggtgact ggtgcaacgc cattggcact   1200

ggatttggca ttcgcccctc tgcaaacact ggcgacacgc tgctcgatgc ctttgtgtgg   1260

gtgaagccgg gcggcgagtc tgatggcaca agcaatagca gcgccacgcg ctacgattat   1320

cactgcggac agtcggattc cttgcagcct gccccggaag ctggtacctg gttccaggct   1380

tatttcgttc aactcctgac caacgcgaac ccttcgttca tgtaa                   1425


<210> 60
<211> 474
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(23)

<220> 
<221> DOMAIN
<222> (30)...(58)
<223> Fungal cellulose binding domain

<220> 
<221> DOMAIN
<222> (127)...(440)
<223> Glycosyl hydrolases family 6

<400> 60
Met Ala Gly Arg Thr Leu Ile Thr Leu Ala Ser Leu Ala Thr Leu Ala
1               5                   10                  15      


Ala Cys Ala Pro Ala Glu Ala Arg Gln Asn Cys Ala Thr Leu Trp Gly
            20                  25                  30          


Gln Cys Gly Gly Gln Asn Trp Thr Gly Ala Thr Cys Cys Ala Ser Gly
        35                  40                  45              


Ser Ser Cys Val Ala Gln Asn Pro Tyr Tyr Ser Gln Cys Leu Pro Gln
    50                  55                  60                  


Thr Ser Ala Thr Ser Ala Ser Ser Thr Thr Arg Ala Ser Ser Thr Thr
65                  70                  75                  80  


Ser Gln Ala Ser Ser Thr Lys Thr Ser Ala Ser Val Thr Ser Thr Thr
                85                  90                  95      


Lys Val Ala Ser Thr Thr Thr Thr Ala Pro Pro Val Gly Ser Gly Thr
            100                 105                 110         


Ala Thr Trp Ser Gly Asn Pro Phe Ser Gly Val Asn Leu Trp Pro Asn
        115                 120                 125             


Asn Tyr Tyr Ala Ser Glu Val Ser Ser Leu Ala Ile Pro Ser Leu Thr
    130                 135                 140                 


Gly Ala Met Ala Thr Ala Ala Ala Ala Val Ala Lys Val Pro Ser Phe
145                 150                 155                 160 


Met Trp Leu Asp Thr Leu Ser Lys Thr Pro Leu Met Glu Gln Thr Leu
                165                 170                 175     


Ser Asp Ile Arg Ala Ala Asn Lys Ala Gly Gly Asn Tyr Ala Gly Gln
            180                 185                 190         


Phe Val Val Tyr Asp Leu Pro Asp Arg Asp Cys Ala Ala Ala Ala Ser
        195                 200                 205             


Asn Gly Glu Tyr Ser Ile Ala Asn Gly Gly Val Ala Asn Tyr Lys Asn
    210                 215                 220                 


Tyr Ile Asp Thr Ile Arg Gly Ile Val Thr Thr Tyr Ser Asp Val Arg
225                 230                 235                 240 


Ile Leu Leu Val Ile Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn
                245                 250                 255     


Leu Asn Val Ala Lys Cys Ser Asn Ala Gln Ala Ala Tyr Leu Glu Cys
            260                 265                 270         


Val Asn Tyr Ala Val Thr Lys Leu Asn Leu Pro Asn Val Ala Met Tyr
        275                 280                 285             


Leu Asp Ala Gly His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gln Asp
    290                 295                 300                 


Pro Ala Ala Gln Leu Phe Ala Asn Val Tyr Lys Asn Ala Gly Ser Pro
305                 310                 315                 320 


Ser Ser Leu Arg Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Ala Trp
                325                 330                 335     


Asp Ile Ser Ser Ala Pro Pro Tyr Thr Gln Gly Asn Ala Val Tyr Asp
            340                 345                 350         


Glu Lys Leu Tyr Ile His Ala Met Gly Pro Leu Leu Ala Asn His Gly
        355                 360                 365             


Trp Ser Asn Ala Tyr Phe Ile Thr Asp Gln Gly Arg Ser Gly Lys Gln
    370                 375                 380                 


Pro Thr Gly Gln Ala Gln Trp Gly Asp Trp Cys Asn Ala Ile Gly Thr
385                 390                 395                 400 


Gly Phe Gly Ile Arg Pro Ser Ala Asn Thr Gly Asp Thr Leu Leu Asp
                405                 410                 415     


Ala Phe Val Trp Val Lys Pro Gly Gly Glu Ser Asp Gly Thr Ser Asn
            420                 425                 430         


Ser Ser Ala Thr Arg Tyr Asp Tyr His Cys Gly Gln Ser Asp Ser Leu
        435                 440                 445             


Gln Pro Ala Pro Glu Ala Gly Thr Trp Phe Gln Ala Tyr Phe Val Gln
    450                 455                 460                 


Leu Leu Thr Asn Ala Asn Pro Ser Phe Met
465                 470                 


<210> 61
<211> 1074
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 61
atgaataaag aaagattgcg cggagtgaat ttgggcggat ggttcagtca ggttgactgc     60

atcgaagaaa aggatcctgt cggttttccg ggggtcgtag gtcacatcaa gacgttcctt    120

ggaacaagcg atttcaagcg catccacgat gcggggttca accacgttcg cttgccggtt    180

gactacttta atttatttaa aggcgacgac ctcaagccgg acgaagaaat ctttgcgctc    240

ctggacaagg cgctcaagga catccaggat gcagatctgg acgtgattct tgaccttcac    300

aagtgcccgg gtcacgattt ccacctcgcc agcaaccacg aacaagcttt ctttgccgat    360

gcgaacgccc gcaaggacac gcgcaagatc tgggctttca tggccgaacg ctacagctcc    420

atgccgcgcg tgatgatgga acttttgaat gaaccggctg caagcgattc caaggtttgg    480

gataaagtca aggacgaaat cttctgggaa atccgcaagc acgctccgaa gaacactatc    540

gtcgtaggcg ccaacaagtg gaacagcgcc agggaattcg aattcttgac accgctcgat    600

gacgacaacg ccatctacag cttccatacc tacacgccag tgacgtttac gcaccagggc    660

gccgcatgga tcgacgaccc gttcttcaag attgaacgcc cgtggccagg cgactacgcc    720

gcccccgaag caggcggcac gacacgtttg aatgtggaat atggcaagtg ggacaaggcc    780

cagttgcagg ccagcatcca gaacgccctc gatttccgcg ccaagtacga cttaccggta    840

agctgcaacg agttcggcgt ttacgtacaa gttccccgca aatatcagct tgcctggatg    900

cgcgacttcc tcgacatcct ccgcgacgcc gacgtgggtt acagctactg gaactacaag    960

aatctggact tcggccttgt ttcgaagggc gaatcgctcc acaacagtct agagcagtac   1020

aacaaccccg aacgcctcga ccgcgaactc atggaaatga ttgctaaggg gtaa         1074


<210> 62
<211> 357
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (17)...(327)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 62
Met Asn Lys Glu Arg Leu Arg Gly Val Asn Leu Gly Gly Trp Phe Ser
1               5                   10                  15      


Gln Val Asp Cys Ile Glu Glu Lys Asp Pro Val Gly Phe Pro Gly Val
            20                  25                  30          


Val Gly His Ile Lys Thr Phe Leu Gly Thr Ser Asp Phe Lys Arg Ile
        35                  40                  45              


His Asp Ala Gly Phe Asn His Val Arg Leu Pro Val Asp Tyr Phe Asn
    50                  55                  60                  


Leu Phe Lys Gly Asp Asp Leu Lys Pro Asp Glu Glu Ile Phe Ala Leu
65                  70                  75                  80  


Leu Asp Lys Ala Leu Lys Asp Ile Gln Asp Ala Asp Leu Asp Val Ile
                85                  90                  95      


Leu Asp Leu His Lys Cys Pro Gly His Asp Phe His Leu Ala Ser Asn
            100                 105                 110         


His Glu Gln Ala Phe Phe Ala Asp Ala Asn Ala Arg Lys Asp Thr Arg
        115                 120                 125             


Lys Ile Trp Ala Phe Met Ala Glu Arg Tyr Ser Ser Met Pro Arg Val
    130                 135                 140                 


Met Met Glu Leu Leu Asn Glu Pro Ala Ala Ser Asp Ser Lys Val Trp
145                 150                 155                 160 


Asp Lys Val Lys Asp Glu Ile Phe Trp Glu Ile Arg Lys His Ala Pro
                165                 170                 175     


Lys Asn Thr Ile Val Val Gly Ala Asn Lys Trp Asn Ser Ala Arg Glu
            180                 185                 190         


Phe Glu Phe Leu Thr Pro Leu Asp Asp Asp Asn Ala Ile Tyr Ser Phe
        195                 200                 205             


His Thr Tyr Thr Pro Val Thr Phe Thr His Gln Gly Ala Ala Trp Ile
    210                 215                 220                 


Asp Asp Pro Phe Phe Lys Ile Glu Arg Pro Trp Pro Gly Asp Tyr Ala
225                 230                 235                 240 


Ala Pro Glu Ala Gly Gly Thr Thr Arg Leu Asn Val Glu Tyr Gly Lys
                245                 250                 255     


Trp Asp Lys Ala Gln Leu Gln Ala Ser Ile Gln Asn Ala Leu Asp Phe
            260                 265                 270         


Arg Ala Lys Tyr Asp Leu Pro Val Ser Cys Asn Glu Phe Gly Val Tyr
        275                 280                 285             


Val Gln Val Pro Arg Lys Tyr Gln Leu Ala Trp Met Arg Asp Phe Leu
    290                 295                 300                 


Asp Ile Leu Arg Asp Ala Asp Val Gly Tyr Ser Tyr Trp Asn Tyr Lys
305                 310                 315                 320 


Asn Leu Asp Phe Gly Leu Val Ser Lys Gly Glu Ser Leu His Asn Ser
                325                 330                 335     


Leu Glu Gln Tyr Asn Asn Pro Glu Arg Leu Asp Arg Glu Leu Met Glu
            340                 345                 350         


Met Ile Ala Lys Gly
        355         


<210> 63
<211> 1191
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 63
atgaagggtc tctacactgc cttggtggct tccgccatca gtggcgctct cgctgctccc     60

agccccgttg aacaaggtcc catcactgct cgcgccgctg cggcagcatg cgccactccc    120

gtcactctct ccggcaaccc gttcgcctcg cgatccatct atgccaacaa ggagtactcc    180

aaggaggtca tcgccgcggc cgcttccatg accgatagcg ccttggctgc taaggcctcc    240

aaggtcgccc aggtcggaac cttcctgtgg attgacactc gtgcccgtat ctcagtcatc    300

gaggacaacc ttaaggacgt tccttgcaac cagatcgccg ccatggtcat ctacgatctc    360

ccaggccgtg attgtgctgc caaggcttcc aacggtgaac ttgccgctgg tgatatcact    420

acctacaagt ccgagtacat tgaccccatt gtcgccatct tcaagaagta tcccaacact    480

gccattgctc tcgtcatcga gcccgattcc cttcccaact tggtcaccaa tgccgacaag    540

caggcctgca aggactccgc ctctggatac aaggcgggtg tcgcgtatgc tctcaagtcc    600

cttaaccttc ccaacatcgc catgtacatt gatgctggcc acggtggctg gttgggctgg    660

aacgacaacc tcaagcccgg agccaagatg ctcgctagcg tatacaagga cgccggctct    720

cccaagcaag tccgcggctt cgccaccaac gttgccggct ggaacgcctg ggacctgtcc    780

cccggcgagt tctccaaggc caccgatgcc cagtggaaca agtgccagaa tgagaagctc    840

tacgtgcagg ccttctctcc cgagctcaag agcgccggca tgccatccca ggccattgtc    900

gacactggcc gtaacgccgt cactggcctc cgcaaggagt ggggtgactg gtgcaacgtc    960

aacggcgccg gcttcggtgt ccgccccacg agcagcacgg gcagctcgct cgttgattct   1020

ttcgtctggg tcaagcctgg tggcgagtct gatggtacta gcgatactag cgctacccgc   1080

tatgactctt tctgcggcaa ggacgatgcc tacaagcctt ctcccgaggc tgggcaatgg   1140

aaccaggagt actttgagat gctgctcacg aatgctaagc cttctttcta a            1191


<210> 64
<211> 396
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(18)

<220> 
<221> DOMAIN
<222> (55)...(363)
<223> Glycosyl hydrolases family 6

<400> 64
Met Lys Gly Leu Tyr Thr Ala Leu Val Ala Ser Ala Ile Ser Gly Ala
1               5                   10                  15      


Leu Ala Ala Pro Ser Pro Val Glu Gln Gly Pro Ile Thr Ala Arg Ala
            20                  25                  30          


Ala Ala Ala Ala Cys Ala Thr Pro Val Thr Leu Ser Gly Asn Pro Phe
        35                  40                  45              


Ala Ser Arg Ser Ile Tyr Ala Asn Lys Glu Tyr Ser Lys Glu Val Ile
    50                  55                  60                  


Ala Ala Ala Ala Ser Met Thr Asp Ser Ala Leu Ala Ala Lys Ala Ser
65                  70                  75                  80  


Lys Val Ala Gln Val Gly Thr Phe Leu Trp Ile Asp Thr Arg Ala Arg
                85                  90                  95      


Ile Ser Val Ile Glu Asp Asn Leu Lys Asp Val Pro Cys Asn Gln Ile
            100                 105                 110         


Ala Ala Met Val Ile Tyr Asp Leu Pro Gly Arg Asp Cys Ala Ala Lys
        115                 120                 125             


Ala Ser Asn Gly Glu Leu Ala Ala Gly Asp Ile Thr Thr Tyr Lys Ser
    130                 135                 140                 


Glu Tyr Ile Asp Pro Ile Val Ala Ile Phe Lys Lys Tyr Pro Asn Thr
145                 150                 155                 160 


Ala Ile Ala Leu Val Ile Glu Pro Asp Ser Leu Pro Asn Leu Val Thr
                165                 170                 175     


Asn Ala Asp Lys Gln Ala Cys Lys Asp Ser Ala Ser Gly Tyr Lys Ala
            180                 185                 190         


Gly Val Ala Tyr Ala Leu Lys Ser Leu Asn Leu Pro Asn Ile Ala Met
        195                 200                 205             


Tyr Ile Asp Ala Gly His Gly Gly Trp Leu Gly Trp Asn Asp Asn Leu
    210                 215                 220                 


Lys Pro Gly Ala Lys Met Leu Ala Ser Val Tyr Lys Asp Ala Gly Ser
225                 230                 235                 240 


Pro Lys Gln Val Arg Gly Phe Ala Thr Asn Val Ala Gly Trp Asn Ala
                245                 250                 255     


Trp Asp Leu Ser Pro Gly Glu Phe Ser Lys Ala Thr Asp Ala Gln Trp
            260                 265                 270         


Asn Lys Cys Gln Asn Glu Lys Leu Tyr Val Gln Ala Phe Ser Pro Glu
        275                 280                 285             


Leu Lys Ser Ala Gly Met Pro Ser Gln Ala Ile Val Asp Thr Gly Arg
    290                 295                 300                 


Asn Ala Val Thr Gly Leu Arg Lys Glu Trp Gly Asp Trp Cys Asn Val
305                 310                 315                 320 


Asn Gly Ala Gly Phe Gly Val Arg Pro Thr Ser Ser Thr Gly Ser Ser
                325                 330                 335     


Leu Val Asp Ser Phe Val Trp Val Lys Pro Gly Gly Glu Ser Asp Gly
            340                 345                 350         


Thr Ser Asp Thr Ser Ala Thr Arg Tyr Asp Ser Phe Cys Gly Lys Asp
        355                 360                 365             


Asp Ala Tyr Lys Pro Ser Pro Glu Ala Gly Gln Trp Asn Gln Glu Tyr
    370                 375                 380                 


Phe Glu Met Leu Leu Thr Asn Ala Lys Pro Ser Phe
385                 390                 395     


<210> 65
<211> 1386
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 65
atggccgtca aaaacatctt gttggctgct gccgcgctca gcgcctccgt cgccgccacc     60

ccgttggagc cgcgtcagtc atgcggaggc tcttgggcgc aatgcggcgg cattggattc    120

tcgggcccga cctgctgtgt gtccggaaac acgtgtacgt accagaacga ctggtactcc    180

cagtgccttc cgggcggacc gaccaccagc tcggccgtgc gttccacctc gactacctcc    240

aagcagcagg gtaccacgtc cactgccacc acccacgcat cggccaccac taccgcaccc    300

gtgaactcgg gcaacccctt cagcggcgtc cagatgtggg ccaacgacta ctacgcctct    360

gagatctctg ccagcgccat cccctccctc actggcgcca tggcgaccaa ggccgccgcg    420

gttgccaagg ttcccacctt ccagtggctc gacaccgccg acaaggtacc gacgctgatg    480

gcggacacgc tctccaagat ccgcgcggcc aacaagaatg cggccacccc ctatgctgga    540

ttgtttgtgg tgtacgatct cccggaccgt gattgcgcgg cggcggcgtc caacggcgag    600

tacagcatcg ccaatggagg tatcgccaac tacaaggcgt acatcgatgc catcaagagc    660

cagctcacca catactcgga tgtcaagaac atcctggtca tcgagcccga cagccttgcg    720

aacctcgtga ccaacatgaa cgtgaccaag tgcgccaatg cccagtctgc ctacctcgag    780

tgcacaaact acgcgctgaa gcagctcaac ctgcccaacg ttgccatgta cctggatgct    840

ggacacgccg gatggctcgg ctggagcgcc aacctgtcgc ccgccgccca gctgtttgcg    900

tcggtctaca agaacgccag cagcccgtct caggtgcgcg gactggccac caacgtcgcc    960

aactacaacg cctggagcct cagctctcca ccgtcctaca cgtcaggcaa ctccaactac   1020

gacgagaagc actacgtgca ggcgatcgcg ccactcctcg cccagcaagg attcaacgcc   1080

cacttcatca ccgaccaggg tcgctccgga aagcagccga cgggccagag ccagtggggt   1140

gactggtgca acgctgtggg caccggattc ggcacccgcc cgaccaccaa cacgggcctc   1200

gacgtccagg acgccttcgt ttgggtcaag cccggcggtg aatgcgacgg cacttccaac   1260

accggcgccg cccgctacga cttccactgc ggccagagcg acgctctgca gcccgcgccg   1320

gaggccggca cgtggttcga gaagtacttt gagcagctgc tgaccaacgc gaacccggcc   1380

ttttga                                                              1386


<210> 66
<211> 461
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(18)

<220> 
<221> DOMAIN
<222> (114)...(428)
<223> Glycosyl hydrolases family 6

<220> 
<221> DOMAIN
<222> (31)...(59)
<223> Fungal cellulose binding domain

<400> 66
Met Ala Val Lys Asn Ile Leu Leu Ala Ala Ala Ala Leu Ser Ala Ser
1               5                   10                  15      


Val Ala Ala Thr Pro Leu Glu Pro Arg Gln Ser Cys Gly Gly Ser Trp
            20                  25                  30          


Ala Gln Cys Gly Gly Ile Gly Phe Ser Gly Pro Thr Cys Cys Val Ser
        35                  40                  45              


Gly Asn Thr Cys Thr Tyr Gln Asn Asp Trp Tyr Ser Gln Cys Leu Pro
    50                  55                  60                  


Gly Gly Pro Thr Thr Ser Ser Ala Val Arg Ser Thr Ser Thr Thr Ser
65                  70                  75                  80  


Lys Gln Gln Gly Thr Thr Ser Thr Ala Thr Thr His Ala Ser Ala Thr
                85                  90                  95      


Thr Thr Ala Pro Val Asn Ser Gly Asn Pro Phe Ser Gly Val Gln Met
            100                 105                 110         


Trp Ala Asn Asp Tyr Tyr Ala Ser Glu Ile Ser Ala Ser Ala Ile Pro
        115                 120                 125             


Ser Leu Thr Gly Ala Met Ala Thr Lys Ala Ala Ala Val Ala Lys Val
    130                 135                 140                 


Pro Thr Phe Gln Trp Leu Asp Thr Ala Asp Lys Val Pro Thr Leu Met
145                 150                 155                 160 


Ala Asp Thr Leu Ser Lys Ile Arg Ala Ala Asn Lys Asn Ala Ala Thr
                165                 170                 175     


Pro Tyr Ala Gly Leu Phe Val Val Tyr Asp Leu Pro Asp Arg Asp Cys
            180                 185                 190         


Ala Ala Ala Ala Ser Asn Gly Glu Tyr Ser Ile Ala Asn Gly Gly Ile
        195                 200                 205             


Ala Asn Tyr Lys Ala Tyr Ile Asp Ala Ile Lys Ser Gln Leu Thr Thr
    210                 215                 220                 


Tyr Ser Asp Val Lys Asn Ile Leu Val Ile Glu Pro Asp Ser Leu Ala
225                 230                 235                 240 


Asn Leu Val Thr Asn Met Asn Val Thr Lys Cys Ala Asn Ala Gln Ser
                245                 250                 255     


Ala Tyr Leu Glu Cys Thr Asn Tyr Ala Leu Lys Gln Leu Asn Leu Pro
            260                 265                 270         


Asn Val Ala Met Tyr Leu Asp Ala Gly His Ala Gly Trp Leu Gly Trp
        275                 280                 285             


Ser Ala Asn Leu Ser Pro Ala Ala Gln Leu Phe Ala Ser Val Tyr Lys
    290                 295                 300                 


Asn Ala Ser Ser Pro Ser Gln Val Arg Gly Leu Ala Thr Asn Val Ala
305                 310                 315                 320 


Asn Tyr Asn Ala Trp Ser Leu Ser Ser Pro Pro Ser Tyr Thr Ser Gly
                325                 330                 335     


Asn Ser Asn Tyr Asp Glu Lys His Tyr Val Gln Ala Ile Ala Pro Leu
            340                 345                 350         


Leu Ala Gln Gln Gly Phe Asn Ala His Phe Ile Thr Asp Gln Gly Arg
        355                 360                 365             


Ser Gly Lys Gln Pro Thr Gly Gln Ser Gln Trp Gly Asp Trp Cys Asn
    370                 375                 380                 


Ala Val Gly Thr Gly Phe Gly Thr Arg Pro Thr Thr Asn Thr Gly Leu
385                 390                 395                 400 


Asp Val Gln Asp Ala Phe Val Trp Val Lys Pro Gly Gly Glu Cys Asp
                405                 410                 415     


Gly Thr Ser Asn Thr Gly Ala Ala Arg Tyr Asp Phe His Cys Gly Gln
            420                 425                 430         


Ser Asp Ala Leu Gln Pro Ala Pro Glu Ala Gly Thr Trp Phe Glu Lys
        435                 440                 445             


Tyr Phe Glu Gln Leu Leu Thr Asn Ala Asn Pro Ala Phe
    450                 455                 460     


<210> 67
<211> 1380
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 67
atgaagagtg caacactatt tgcccttgcg gcgactgcgc aggcccaggt cgcgctctat     60

ggacaatgtg gcggaatcaa ttacagtgga tccacgacct gtgcttccgg ttcctactgc    120

tcgaagatca atgactacta ttcgcaatgt ctcccaggct ctggaaacgg caatggtgca    180

acaagcacca cgttgtccac tgtagctagg cctacaacct cagcgcggcc aaccagtgga    240

gccgccacca gttctggatc tgcgggaacc agcactgccc catcgaccga ggcctcgggc    300

aatcccttgg ctggaaagca gttttatgcg aatccgtact acgcttcaga gatttcgagt    360

ctcgcagttc caacgctttc agcaaagggc agtgcctcgt gggcagccaa ggcaaccgac    420

gttgctaagg ttggaacctt tgtttggctc gacactcgcg ccaaggtcga caccatcgat    480

acatacgcga aagatgttca ggccaagaac gcagccggtg ccaacctcat gctgccgttg    540

gtagtctacg accttcccga aagagattgc gctgctcttg cctccaacgg cgagttgtcc    600

ctggccaaca acggggccgc tctgtaccag ggctacattg actctattgc cgctaagatc    660

aaggcatacc ctgacgtctt cttcgtcctc gtcgttgagc ccgatagctt ggctaacctg    720

gtcaccaacc tgaatgtgca gaagtgctcg aatgcggcgt ctgcttacaa gacattgaca    780

caatatgcaa tcaagacgct caacttgaag aacgttgcca tgtacctcga tgctgggcac    840

gctgggtggc tcgggtggcc cgcgaacatt gagcctgcgg ccaagctgtt tggtgatctc    900

tacaccgctg caggaaagcc tgccgctgtc cgaggtctgg tcacaaatgt ggccaactat    960

aacgcttggt caatttccac gtgcccctcg tatacccaag gatcccagac ttgcgatgag   1020

aagacataca ttaacaatct ggctccgcta cttcgtgctc agggtttccc ggcccacttc   1080

atgatggata ccagccgcaa cggtgtccag cccaccaagc agcaagcgtg gggagactgg   1140

tgcaacgtca ttggagctgg atttggtatc cgcccttcca cttccacccc cgacccgctc   1200

cttgatgcct ttgcgtgggt gaagccagga ggtgagtgcg atggaacgag caactccact   1260

gccgttcgct acgatgcgca ctgtggctac gccgatgctc ttcagcctgc tcccgaggct   1320

ggcacctggt tccaagcgta ctttgagcag ctacttgtca atgccaatcc caagttctag   1380


<210> 68
<211> 459
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(15)

<220> 
<221> DOMAIN
<222> (19)...(47)
<223> Fungal cellulose binding domain

<220> 
<221> DOMAIN
<222> (110)...(426)
<223> Glycosyl hydrolases family 6

<400> 68
Met Lys Ser Ala Thr Leu Phe Ala Leu Ala Ala Thr Ala Gln Ala Gln
1               5                   10                  15      


Val Ala Leu Tyr Gly Gln Cys Gly Gly Ile Asn Tyr Ser Gly Ser Thr
            20                  25                  30          


Thr Cys Ala Ser Gly Ser Tyr Cys Ser Lys Ile Asn Asp Tyr Tyr Ser
        35                  40                  45              


Gln Cys Leu Pro Gly Ser Gly Asn Gly Asn Gly Ala Thr Ser Thr Thr
    50                  55                  60                  


Leu Ser Thr Val Ala Arg Pro Thr Thr Ser Ala Arg Pro Thr Ser Gly
65                  70                  75                  80  


Ala Ala Thr Ser Ser Gly Ser Ala Gly Thr Ser Thr Ala Pro Ser Thr
                85                  90                  95      


Glu Ala Ser Gly Asn Pro Leu Ala Gly Lys Gln Phe Tyr Ala Asn Pro
            100                 105                 110         


Tyr Tyr Ala Ser Glu Ile Ser Ser Leu Ala Val Pro Thr Leu Ser Ala
        115                 120                 125             


Lys Gly Ser Ala Ser Trp Ala Ala Lys Ala Thr Asp Val Ala Lys Val
    130                 135                 140                 


Gly Thr Phe Val Trp Leu Asp Thr Arg Ala Lys Val Asp Thr Ile Asp
145                 150                 155                 160 


Thr Tyr Ala Lys Asp Val Gln Ala Lys Asn Ala Ala Gly Ala Asn Leu
                165                 170                 175     


Met Leu Pro Leu Val Val Tyr Asp Leu Pro Glu Arg Asp Cys Ala Ala
            180                 185                 190         


Leu Ala Ser Asn Gly Glu Leu Ser Leu Ala Asn Asn Gly Ala Ala Leu
        195                 200                 205             


Tyr Gln Gly Tyr Ile Asp Ser Ile Ala Ala Lys Ile Lys Ala Tyr Pro
    210                 215                 220                 


Asp Val Phe Phe Val Leu Val Val Glu Pro Asp Ser Leu Ala Asn Leu
225                 230                 235                 240 


Val Thr Asn Leu Asn Val Gln Lys Cys Ser Asn Ala Ala Ser Ala Tyr
                245                 250                 255     


Lys Thr Leu Thr Gln Tyr Ala Ile Lys Thr Leu Asn Leu Lys Asn Val
            260                 265                 270         


Ala Met Tyr Leu Asp Ala Gly His Ala Gly Trp Leu Gly Trp Pro Ala
        275                 280                 285             


Asn Ile Glu Pro Ala Ala Lys Leu Phe Gly Asp Leu Tyr Thr Ala Ala
    290                 295                 300                 


Gly Lys Pro Ala Ala Val Arg Gly Leu Val Thr Asn Val Ala Asn Tyr
305                 310                 315                 320 


Asn Ala Trp Ser Ile Ser Thr Cys Pro Ser Tyr Thr Gln Gly Ser Gln
                325                 330                 335     


Thr Cys Asp Glu Lys Thr Tyr Ile Asn Asn Leu Ala Pro Leu Leu Arg
            340                 345                 350         


Ala Gln Gly Phe Pro Ala His Phe Met Met Asp Thr Ser Arg Asn Gly
        355                 360                 365             


Val Gln Pro Thr Lys Gln Gln Ala Trp Gly Asp Trp Cys Asn Val Ile
    370                 375                 380                 


Gly Ala Gly Phe Gly Ile Arg Pro Ser Thr Ser Thr Pro Asp Pro Leu
385                 390                 395                 400 


Leu Asp Ala Phe Ala Trp Val Lys Pro Gly Gly Glu Cys Asp Gly Thr
                405                 410                 415     


Ser Asn Ser Thr Ala Val Arg Tyr Asp Ala His Cys Gly Tyr Ala Asp
            420                 425                 430         


Ala Leu Gln Pro Ala Pro Glu Ala Gly Thr Trp Phe Gln Ala Tyr Phe
        435                 440                 445             


Glu Gln Leu Leu Val Asn Ala Asn Pro Lys Phe
    450                 455                 


<210> 69
<211> 1353
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 69
atgacgaaat cgaccgaaga atcgaccttc cccgaagact tcctctgggg cgccgcgacg     60

tccgcgtatc agatcgaagg ctcgccactc gcagacggcg ccggcccgag catctggcat    120

cgcttcgtcc gcacgcccgg gcgcgtgcac aacggcgaca ccggcgacgt cgcgtgcgac    180

cactacaacc gctaccgcga ggacgtcgcg ctcatgaagg agctcggcct caacacgtat    240

cgcttcagca tttcctggag ccgcatcctt cctgagggca ccggccgcgt gaatcaggga    300

ggtctcgact tctaccgccg gctcgtcgac gagctgctcg cagcgggcat cgcacctaac    360

gcgacgctct accactggga cctccccgcc gcgctcgatg accgcggcgg ctggctcaat    420

cgcgacgtcg ccgactggtt cgccgaatac gcggacgtca tgttccgcgc gctcgacgac    480

cgggtgaaga tgtgggcgac gctcaacgag ccgtgggtcg tcaccgacgg cggctacctg    540

cacggcccgc tcgcgcccgg acatcgcaac gtctgggagg cgccacgcgc gacgcacaac    600

ctgatgcgct cgcacgcgaa ggccgtcctc gcctatcgcg ccaccggaaa gcatcagatc    660

ggcacggtcg tgaacctcga gccgaagtac gccgcatccg actcacccga agacgccgcc    720

gccgtcgcac gcgccgacgc gtacatgaac cgccagtacc tcgacccgat cctcctcggc    780

cgttacccgc aggagctcat cgacggcttc ggcgaggcgt ggcccgacat cccctcctcc    840

gacttcgacg agctcacgac gcccatcgac ttcctcggca tcaactacta caagcgcggg    900

atcacgaagg cggacgagtc agtcctcatc gagcgcgcca cgcgcgtcga caaccctcgc    960

ggaacgacga cagaagtcgg ctgggaggtc tacgcagacg gccttacgaa gatcctgacg   1020

tgggtccgcg acaactacgg cgacctcccg ctctacatca ccgagaacgg cgcagcgttc   1080

tacgacccac cggtcgcatt caatggtcgc gtcgaagacc cgctgcgcgt cgactacctc   1140

cgcgaccaca tccgcgccgt gcgcgaagcc atgaaccagg gcgtgaacgt ccgcggctac   1200

tacgtctggt ccttcctcga caacttcgaa tgggccctcg gctacgccaa gcgcttcggc   1260

atcgtccacg tggactacga gactctcgtg cgcacgccga agtcgagcgc gcggtattac   1320

gcggacgtga tcaaggcaag acgggcgatc tga                                1353


<210> 70
<211> 450
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (6)...(449)
<223> Glycosyl hydrolase family 1

<400> 70
Met Thr Lys Ser Thr Glu Glu Ser Thr Phe Pro Glu Asp Phe Leu Trp
1               5                   10                  15      


Gly Ala Ala Thr Ser Ala Tyr Gln Ile Glu Gly Ser Pro Leu Ala Asp
            20                  25                  30          


Gly Ala Gly Pro Ser Ile Trp His Arg Phe Val Arg Thr Pro Gly Arg
        35                  40                  45              


Val His Asn Gly Asp Thr Gly Asp Val Ala Cys Asp His Tyr Asn Arg
    50                  55                  60                  


Tyr Arg Glu Asp Val Ala Leu Met Lys Glu Leu Gly Leu Asn Thr Tyr
65                  70                  75                  80  


Arg Phe Ser Ile Ser Trp Ser Arg Ile Leu Pro Glu Gly Thr Gly Arg
                85                  90                  95      


Val Asn Gln Gly Gly Leu Asp Phe Tyr Arg Arg Leu Val Asp Glu Leu
            100                 105                 110         


Leu Ala Ala Gly Ile Ala Pro Asn Ala Thr Leu Tyr His Trp Asp Leu
        115                 120                 125             


Pro Ala Ala Leu Asp Asp Arg Gly Gly Trp Leu Asn Arg Asp Val Ala
    130                 135                 140                 


Asp Trp Phe Ala Glu Tyr Ala Asp Val Met Phe Arg Ala Leu Asp Asp
145                 150                 155                 160 


Arg Val Lys Met Trp Ala Thr Leu Asn Glu Pro Trp Val Val Thr Asp
                165                 170                 175     


Gly Gly Tyr Leu His Gly Pro Leu Ala Pro Gly His Arg Asn Val Trp
            180                 185                 190         


Glu Ala Pro Arg Ala Thr His Asn Leu Met Arg Ser His Ala Lys Ala
        195                 200                 205             


Val Leu Ala Tyr Arg Ala Thr Gly Lys His Gln Ile Gly Thr Val Val
    210                 215                 220                 


Asn Leu Glu Pro Lys Tyr Ala Ala Ser Asp Ser Pro Glu Asp Ala Ala
225                 230                 235                 240 


Ala Val Ala Arg Ala Asp Ala Tyr Met Asn Arg Gln Tyr Leu Asp Pro
                245                 250                 255     


Ile Leu Leu Gly Arg Tyr Pro Gln Glu Leu Ile Asp Gly Phe Gly Glu
            260                 265                 270         


Ala Trp Pro Asp Ile Pro Ser Ser Asp Phe Asp Glu Leu Thr Thr Pro
        275                 280                 285             


Ile Asp Phe Leu Gly Ile Asn Tyr Tyr Lys Arg Gly Ile Thr Lys Ala
    290                 295                 300                 


Asp Glu Ser Val Leu Ile Glu Arg Ala Thr Arg Val Asp Asn Pro Arg
305                 310                 315                 320 


Gly Thr Thr Thr Glu Val Gly Trp Glu Val Tyr Ala Asp Gly Leu Thr
                325                 330                 335     


Lys Ile Leu Thr Trp Val Arg Asp Asn Tyr Gly Asp Leu Pro Leu Tyr
            340                 345                 350         


Ile Thr Glu Asn Gly Ala Ala Phe Tyr Asp Pro Pro Val Ala Phe Asn
        355                 360                 365             


Gly Arg Val Glu Asp Pro Leu Arg Val Asp Tyr Leu Arg Asp His Ile
    370                 375                 380                 


Arg Ala Val Arg Glu Ala Met Asn Gln Gly Val Asn Val Arg Gly Tyr
385                 390                 395                 400 


Tyr Val Trp Ser Phe Leu Asp Asn Phe Glu Trp Ala Leu Gly Tyr Ala
                405                 410                 415     


Lys Arg Phe Gly Ile Val His Val Asp Tyr Glu Thr Leu Val Arg Thr
            420                 425                 430         


Pro Lys Ser Ser Ala Arg Tyr Tyr Ala Asp Val Ile Lys Ala Arg Arg
        435                 440                 445             


Ala Ile
    450 


<210> 71
<211> 1371
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 71
atgaagactg caactttgtt ggctcttgca gctactgcac aagcgcaggt tgcaacctgg     60

ggacaatgcg gtggtatcaa ctatagcggg tccacgacct gcgcctctgg caactactgc    120

tccaagatca atgactatta ttcgcaatgt cttccgggca cgggtggagc aggcaccacg    180

ttgtccacag tggccgtacc taccaccgga acttccacta aacctgccac gggagcacca    240

acatctggat ccgcaggtac cacaactgca ccagcatccc aggcctcggg taatccattg    300

gccggcaaga cgttttatgc gaatccctac tatgcttcgg agatttcgag tctggcagtt    360

ccatcactgt cggcgaaagg cagtgctact tgggcagcaa aagccaccga cgtcgcgaag    420

attggaactt ttgtttggct tgatacccgc gccaaggtcc caactatcgc aacatatgcg    480

aaagatgtcc aggcacagaa tgctgccgga gctaatctca tgctgccact ggtagtctat    540

gatctccccg aaagagattg cgctgctctt gcctccaatg gcgaactgtc cctcgcgaac    600

aacggcgccg cgttgtatca gggctacatc gacgacatcg ctacccaaat caaggcgttc    660

ccagatgtct tttttgtgct cgtcattgag cccgatagct tggcaaattt ggtcacaaac    720

ttgaacgtgc aaaagtgctc gaacgcagca tccgcataca aaactctgac aacatacgcg    780

atcaagacgc tcaacttgaa gaacgtcgcc atgtacatgg acgctggtca cgctggatgg    840

cttggatggc ctgcgaacat taagcctgcc gcccagcttt tcggtcaact ctacagtgat    900

gcaggaaagc ccgctgctct ccgcggcctg gtgaccaacg tggcgaatta caatgcttgg    960

tccatttcca cctgcccttc atacacgcaa ggaagccaga cgtgcgatga gaagacttat   1020

atcaacaact tggcaccatt gcttacagcg gagggtttcc cagctcactt catggttgac   1080

actagccgta acggcgttca gccaaccaag cagcaggcgt ggggagactg gtgcaatgtc   1140

attggaactg gcttcggtat tcgccctacc actgtcactc ccgatccact cttggacgcc   1200

tttgcgtggg tcaagccagg tggcgagtgc gatggaacga gcaactccac tgccgttcgc   1260

tatgatgcac actgcggata cagtgatgct cttcagcctg cacccgaagc cggtacctgg   1320

ttcgaggcgt acttcgagca gcttcttgtc aatgccaacc ccaagttcta g            1371


<210> 72
<211> 456
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(15)

<220> 
<221> DOMAIN
<222> (107)...(423)
<223> Glycosyl hydrolases family 6

<220> 
<221> DOMAIN
<222> (19)...(47)
<223> Fungal cellulose binding domain

<400> 72
Met Lys Thr Ala Thr Leu Leu Ala Leu Ala Ala Thr Ala Gln Ala Gln
1               5                   10                  15      


Val Ala Thr Trp Gly Gln Cys Gly Gly Ile Asn Tyr Ser Gly Ser Thr
            20                  25                  30          


Thr Cys Ala Ser Gly Asn Tyr Cys Ser Lys Ile Asn Asp Tyr Tyr Ser
        35                  40                  45              


Gln Cys Leu Pro Gly Thr Gly Gly Ala Gly Thr Thr Leu Ser Thr Val
    50                  55                  60                  


Ala Val Pro Thr Thr Gly Thr Ser Thr Lys Pro Ala Thr Gly Ala Pro
65                  70                  75                  80  


Thr Ser Gly Ser Ala Gly Thr Thr Thr Ala Pro Ala Ser Gln Ala Ser
                85                  90                  95      


Gly Asn Pro Leu Ala Gly Lys Thr Phe Tyr Ala Asn Pro Tyr Tyr Ala
            100                 105                 110         


Ser Glu Ile Ser Ser Leu Ala Val Pro Ser Leu Ser Ala Lys Gly Ser
        115                 120                 125             


Ala Thr Trp Ala Ala Lys Ala Thr Asp Val Ala Lys Ile Gly Thr Phe
    130                 135                 140                 


Val Trp Leu Asp Thr Arg Ala Lys Val Pro Thr Ile Ala Thr Tyr Ala
145                 150                 155                 160 


Lys Asp Val Gln Ala Gln Asn Ala Ala Gly Ala Asn Leu Met Leu Pro
                165                 170                 175     


Leu Val Val Tyr Asp Leu Pro Glu Arg Asp Cys Ala Ala Leu Ala Ser
            180                 185                 190         


Asn Gly Glu Leu Ser Leu Ala Asn Asn Gly Ala Ala Leu Tyr Gln Gly
        195                 200                 205             


Tyr Ile Asp Asp Ile Ala Thr Gln Ile Lys Ala Phe Pro Asp Val Phe
    210                 215                 220                 


Phe Val Leu Val Ile Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn
225                 230                 235                 240 


Leu Asn Val Gln Lys Cys Ser Asn Ala Ala Ser Ala Tyr Lys Thr Leu
                245                 250                 255     


Thr Thr Tyr Ala Ile Lys Thr Leu Asn Leu Lys Asn Val Ala Met Tyr
            260                 265                 270         


Met Asp Ala Gly His Ala Gly Trp Leu Gly Trp Pro Ala Asn Ile Lys
        275                 280                 285             


Pro Ala Ala Gln Leu Phe Gly Gln Leu Tyr Ser Asp Ala Gly Lys Pro
    290                 295                 300                 


Ala Ala Leu Arg Gly Leu Val Thr Asn Val Ala Asn Tyr Asn Ala Trp
305                 310                 315                 320 


Ser Ile Ser Thr Cys Pro Ser Tyr Thr Gln Gly Ser Gln Thr Cys Asp
                325                 330                 335     


Glu Lys Thr Tyr Ile Asn Asn Leu Ala Pro Leu Leu Thr Ala Glu Gly
            340                 345                 350         


Phe Pro Ala His Phe Met Val Asp Thr Ser Arg Asn Gly Val Gln Pro
        355                 360                 365             


Thr Lys Gln Gln Ala Trp Gly Asp Trp Cys Asn Val Ile Gly Thr Gly
    370                 375                 380                 


Phe Gly Ile Arg Pro Thr Thr Val Thr Pro Asp Pro Leu Leu Asp Ala
385                 390                 395                 400 


Phe Ala Trp Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asn Ser
                405                 410                 415     


Thr Ala Val Arg Tyr Asp Ala His Cys Gly Tyr Ser Asp Ala Leu Gln
            420                 425                 430         


Pro Ala Pro Glu Ala Gly Thr Trp Phe Glu Ala Tyr Phe Glu Gln Leu
        435                 440                 445             


Leu Val Asn Ala Asn Pro Lys Phe
    450                 455     


<210> 73
<211> 1365
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 73
atgctttctc gcacactctt cctagcatct ctcttgtcca cgtccctggt cgctggccag     60

ggcgttggct cctaccagac cgagactcat cccaagatga gttggcagca gtgttctgga    120

accggtggaa cgagttgtaa gacggtccaa ggcgaggtag tagtcgatgc caactggcgc    180

tggctacaca tcaagggtga ctacaagaac tgctacgacg gcaacacatg ggacaaagcc    240

acttgtggca caaacgccaa ttgcaccacc aactgcgtcg tagagggcgc cgactactct    300

ggaacatatg ggattacggc tggaggtacc gatctgacgc tcaagttcgt gaccaagggc    360

caatattcga cgaatattgg ttcgcgtacg tatcttatga aggactcgag cacttatcag    420

accttcaacc tgatcggtaa cgaattcacg tttgacgtcg acgtgagtca attgccttgc    480

gggctcaatg gagccctgta cttcgtcgtc atggacccca agggccaggg cactgctggt    540

gccaagtatg ggactggata ctgcgatggt cagtgcccac gagacttgaa attcatcaat    600

ggaaaggcca atgcggaggg ctggcaacct tcgtcgaatg acaagaacgc cggagtcgga    660

gccaggggag cctgctgtgc cgagatggat gtctgggagg ccaactctat atccactgct    720

ctcacccctc attcctgcga tactgttacc ttctccgagt gcagcggaga caattgcggt    780

ggcacttact cgagcactcg ctatgctgga ccgtgcgacc cgaatggctg cgatttcaac    840

ccctaccgct taggtgtcac cgacttctac ggcaagggca agaccgttga cacaagcaaa    900

cccttcactg ttgtgactca gttcctgggt tctggatcga ccttgtcaga gatcaaacgt    960

ttctacgtcc agggaggcac cgtgattccg aaccctcagc caaagactgc tggcatcacg   1020

ggcaactcta tcacccaaga atggtgtgac gctgagaaca aagccaataa agaagacgtg   1080

tatccattca agacgcatgg aggaatgaaa tctatggcat cagcaatggc gaaaggaatg   1140

gtattagtaa tgtcgctctg ggatgaccat tatgcaaata tgctttggct ggacagcaca   1200

tatccaaccg accaaaccgg accagggact gcacgtggag actgccccac gagctcaggc   1260

gtgccagcag atgttgaatc gaagaacgcg aacgcccagg tgaagtatgg taacatcaag   1320

ttcggcccga ttggctccac gttcaagcag ccgtccgggt cctga                   1365


<210> 74
<211> 454
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(18)

<220> 
<221> DOMAIN
<222> (21)...(450)
<223> Glycosyl hydrolase family 7

<400> 74
Met Leu Ser Arg Thr Leu Phe Leu Ala Ser Leu Leu Ser Thr Ser Leu
1               5                   10                  15      


Val Ala Gly Gln Gly Val Gly Ser Tyr Gln Thr Glu Thr His Pro Lys
            20                  25                  30          


Met Ser Trp Gln Gln Cys Ser Gly Thr Gly Gly Thr Ser Cys Lys Thr
        35                  40                  45              


Val Gln Gly Glu Val Val Val Asp Ala Asn Trp Arg Trp Leu His Ile
    50                  55                  60                  


Lys Gly Asp Tyr Lys Asn Cys Tyr Asp Gly Asn Thr Trp Asp Lys Ala
65                  70                  75                  80  


Thr Cys Gly Thr Asn Ala Asn Cys Thr Thr Asn Cys Val Val Glu Gly
                85                  90                  95      


Ala Asp Tyr Ser Gly Thr Tyr Gly Ile Thr Ala Gly Gly Thr Asp Leu
            100                 105                 110         


Thr Leu Lys Phe Val Thr Lys Gly Gln Tyr Ser Thr Asn Ile Gly Ser
        115                 120                 125             


Arg Thr Tyr Leu Met Lys Asp Ser Ser Thr Tyr Gln Thr Phe Asn Leu
    130                 135                 140                 


Ile Gly Asn Glu Phe Thr Phe Asp Val Asp Val Ser Gln Leu Pro Cys
145                 150                 155                 160 


Gly Leu Asn Gly Ala Leu Tyr Phe Val Val Met Asp Pro Lys Gly Gln
                165                 170                 175     


Gly Thr Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Gly Gln Cys
            180                 185                 190         


Pro Arg Asp Leu Lys Phe Ile Asn Gly Lys Ala Asn Ala Glu Gly Trp
        195                 200                 205             


Gln Pro Ser Ser Asn Asp Lys Asn Ala Gly Val Gly Ala Arg Gly Ala
    210                 215                 220                 


Cys Cys Ala Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Thr Ala
225                 230                 235                 240 


Leu Thr Pro His Ser Cys Asp Thr Val Thr Phe Ser Glu Cys Ser Gly
                245                 250                 255     


Asp Asn Cys Gly Gly Thr Tyr Ser Ser Thr Arg Tyr Ala Gly Pro Cys
            260                 265                 270         


Asp Pro Asn Gly Cys Asp Phe Asn Pro Tyr Arg Leu Gly Val Thr Asp
        275                 280                 285             


Phe Tyr Gly Lys Gly Lys Thr Val Asp Thr Ser Lys Pro Phe Thr Val
    290                 295                 300                 


Val Thr Gln Phe Leu Gly Ser Gly Ser Thr Leu Ser Glu Ile Lys Arg
305                 310                 315                 320 


Phe Tyr Val Gln Gly Gly Thr Val Ile Pro Asn Pro Gln Pro Lys Thr
                325                 330                 335     


Ala Gly Ile Thr Gly Asn Ser Ile Thr Gln Glu Trp Cys Asp Ala Glu
            340                 345                 350         


Asn Lys Ala Asn Lys Glu Asp Val Tyr Pro Phe Lys Thr His Gly Gly
        355                 360                 365             


Met Lys Ser Met Ala Ser Ala Met Ala Lys Gly Met Val Leu Val Met
    370                 375                 380                 


Ser Leu Trp Asp Asp His Tyr Ala Asn Met Leu Trp Leu Asp Ser Thr
385                 390                 395                 400 


Tyr Pro Thr Asp Gln Thr Gly Pro Gly Thr Ala Arg Gly Asp Cys Pro
                405                 410                 415     


Thr Ser Ser Gly Val Pro Ala Asp Val Glu Ser Lys Asn Ala Asn Ala
            420                 425                 430         


Gln Val Lys Tyr Gly Asn Ile Lys Phe Gly Pro Ile Gly Ser Thr Phe
        435                 440                 445             


Lys Gln Pro Ser Gly Ser
    450                 


<210> 75
<211> 1383
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 75
gtgggatccg ttgcgactga tgttgacgcc gtgccgggcc cgcccgtgcc ctccgccctg     60

aagctgcctg catcgttcaa gtggggcgtc tccaccgctg cctatcagat cgagggggcc    120

gtcgcagagg atgggcgagg acccagcatc tgggatctgt tcggccgaca ggccggccgt    180

attgccaatg gcgacaccgg agatgtcgca tgtgaccact accatcgcta ccgggaagac    240

gtggcgctga tgcggcgcct aggcacgcag gtctaccgct tttcggtggc ctggccgcgg    300

gtgttgccgc aaggacgcgg ccaggccaac agcctcggcc tcgacttcta tgatcgtctg    360

atcgatgaac tgctgcgcag cggcatagag ccgtggttgt gcctgtacca ttgggatctg    420

ccgcaggcgt tggatgatct cggtggatgg cagaatcgcg atatcgctct ctggttcgcc    480

gactatgccg ccctgattgc gcgccgctat ggcgaccggg ttcggcattt tgcgactttc    540

aacgaaccca acgtgtgcac gctgttcggc tacggcatgg gctggaatgc gcccggtatc    600

gccaaccggc ggagtttcct gcaggcggct caccacgtga atctcgcgca tggcgaagcc    660

gtccgtacat tgagagcgtt ggccccggga gctcttcttg gcgccatcta caatcggcag    720

atctgcattc cgataagcgc ggcgccggag gatgccgtgg cggccaatat actggatgct    780

tgctggaacc gactttatgc cgacccgcaa tgcctcgccg agtatccccc cgaactggct    840

gaaagccttc tggaattctc gcgggcgggc gatatggcgc ggatcgccca gccgatcgac    900

tggttcggcc tgaaccacta ctgcccgatc tatgcgcgcg ccgatcgcgg tccgcttggt    960

tttgcctggg ccgatgcgcc ggtcgatggc cccttgaccg gcgtcggctg gcgcatcgac   1020

ccggaagcct tccgcaacga aataatcgcg gcgcaccggc gctacaagct tccgatctac   1080

gtcacggaga acggctacgg agcacacgag acactggatg aagcgggtgg ggtgaacgac   1140

ggcgggcgca tcgcctatct tgcaacctat ctcagggccc tggaggaagc cgtcgcttcg   1200

ggcgccgatg tgcggggtta tttcctctgg tcgcttctgg acaatctcga atggggtgcc   1260

ggctttgcga gccgttttgg aatcgtcttc gtcgactacg cgacccagcg gcgtgtgccc   1320

aaagcgtcat ttgattggtt tggcaagttg atccgagcgc agcaggacac ccatcacgcg   1380

tga                                                                 1383


<210> 76
<211> 460
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (18)...(456)
<223> Glycosyl hydrolase family 1

<400> 76
Met Gly Ser Val Ala Thr Asp Val Asp Ala Val Pro Gly Pro Pro Val
1               5                   10                  15      


Pro Ser Ala Leu Lys Leu Pro Ala Ser Phe Lys Trp Gly Val Ser Thr
            20                  25                  30          


Ala Ala Tyr Gln Ile Glu Gly Ala Val Ala Glu Asp Gly Arg Gly Pro
        35                  40                  45              


Ser Ile Trp Asp Leu Phe Gly Arg Gln Ala Gly Arg Ile Ala Asn Gly
    50                  55                  60                  


Asp Thr Gly Asp Val Ala Cys Asp His Tyr His Arg Tyr Arg Glu Asp
65                  70                  75                  80  


Val Ala Leu Met Arg Arg Leu Gly Thr Gln Val Tyr Arg Phe Ser Val
                85                  90                  95      


Ala Trp Pro Arg Val Leu Pro Gln Gly Arg Gly Gln Ala Asn Ser Leu
            100                 105                 110         


Gly Leu Asp Phe Tyr Asp Arg Leu Ile Asp Glu Leu Leu Arg Ser Gly
        115                 120                 125             


Ile Glu Pro Trp Leu Cys Leu Tyr His Trp Asp Leu Pro Gln Ala Leu
    130                 135                 140                 


Asp Asp Leu Gly Gly Trp Gln Asn Arg Asp Ile Ala Leu Trp Phe Ala
145                 150                 155                 160 


Asp Tyr Ala Ala Leu Ile Ala Arg Arg Tyr Gly Asp Arg Val Arg His
                165                 170                 175     


Phe Ala Thr Phe Asn Glu Pro Asn Val Cys Thr Leu Phe Gly Tyr Gly
            180                 185                 190         


Met Gly Trp Asn Ala Pro Gly Ile Ala Asn Arg Arg Ser Phe Leu Gln
        195                 200                 205             


Ala Ala His His Val Asn Leu Ala His Gly Glu Ala Val Arg Thr Leu
    210                 215                 220                 


Arg Ala Leu Ala Pro Gly Ala Leu Leu Gly Ala Ile Tyr Asn Arg Gln
225                 230                 235                 240 


Ile Cys Ile Pro Ile Ser Ala Ala Pro Glu Asp Ala Val Ala Ala Asn
                245                 250                 255     


Ile Leu Asp Ala Cys Trp Asn Arg Leu Tyr Ala Asp Pro Gln Cys Leu
            260                 265                 270         


Ala Glu Tyr Pro Pro Glu Leu Ala Glu Ser Leu Leu Glu Phe Ser Arg
        275                 280                 285             


Ala Gly Asp Met Ala Arg Ile Ala Gln Pro Ile Asp Trp Phe Gly Leu
    290                 295                 300                 


Asn His Tyr Cys Pro Ile Tyr Ala Arg Ala Asp Arg Gly Pro Leu Gly
305                 310                 315                 320 


Phe Ala Trp Ala Asp Ala Pro Val Asp Gly Pro Leu Thr Gly Val Gly
                325                 330                 335     


Trp Arg Ile Asp Pro Glu Ala Phe Arg Asn Glu Ile Ile Ala Ala His
            340                 345                 350         


Arg Arg Tyr Lys Leu Pro Ile Tyr Val Thr Glu Asn Gly Tyr Gly Ala
        355                 360                 365             


His Glu Thr Leu Asp Glu Ala Gly Gly Val Asn Asp Gly Gly Arg Ile
    370                 375                 380                 


Ala Tyr Leu Ala Thr Tyr Leu Arg Ala Leu Glu Glu Ala Val Ala Ser
385                 390                 395                 400 


Gly Ala Asp Val Arg Gly Tyr Phe Leu Trp Ser Leu Leu Asp Asn Leu
                405                 410                 415     


Glu Trp Gly Ala Gly Phe Ala Ser Arg Phe Gly Ile Val Phe Val Asp
            420                 425                 430         


Tyr Ala Thr Gln Arg Arg Val Pro Lys Ala Ser Phe Asp Trp Phe Gly
        435                 440                 445             


Lys Leu Ile Arg Ala Gln Gln Asp Thr His His Ala
    450                 455                 460 


<210> 77
<211> 1362
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 77
atgtatcaac gcgcacttct tttctccgcc ttgatggcgg gcgtgagtgc tcagcaggtt     60

ggaactcaga agcctgaaac ccacccacca ctcgcctgga aggagtgtac ctcgtctggc    120

tgcaccagca aagatggttc cgtggtcatt gatgccaact ggcgctgggt tcactcggtc    180

gatggttaca agaactgcta cactggtaac gaatgggaca gcaccctgtg ccctgacgat    240

gctacctgcg cgaccaactg cgctgtggac ggtgcggact atgccggcac ctacggagct    300

accaccgagg gagactccct gtccatcaac ttcgttaccg gatcaaacat cggctcgcgc    360

ttctacctca tggaggatga gaacaaatac cagatgttca agctcctgaa caaggaattc    420

accttcgacg ttgatgtttc cactcttccc tgtggcctca atggtgcctt gtactttgtc    480

tccatggatg ccgacggtgg catgtccaag tatgagacca acaaggcggg tgccaagtat    540

ggtacaggtt actgtgactc tcagtgcccg cgtgacctga agttcatcaa cggaaagggt    600

aacgttgaag gctggaagcc atctgcgaac gacaagaatg ccggtgttgg accacacggt    660

tcttgctgtg ctgaaatgga tatctgggag gctaacagca tctccactgc cttgactccc    720

catccctgcg ataccaacgg ccagaccatt tgcgaaggtg acagctgcgg tggaacctac    780

tctaccacca gatacgccgg tacctgcgat cccgatggct gcgacttcaa ccccttccgc    840

atgggtaacg aatccttcta cggccccgga aagatggtgg acaccaagtc gaagatgact    900

gtcgtgaccc agttcatcac cagcgacgga accgacactg gcagcttgaa ggagatcaag    960

cgcgtctatg tccagaatgg caaggtcatt gccaactcgg cctcggacgt gagcggcatt   1020

actggcaact cgatcacctc ggaattttgc actgctcaga agaagacctt tggcgacgag   1080

gatgtcttta acaagcatgg tggtctgtct ggcatgggtg atgctctggg agaaggcatg   1140

gttctcgtga tgagcctgtg ggatgaccac aactctaaca tgctctggct cgacggcgag   1200

aagtacccaa ccgatgctgc tgcttccaag gttggcgtca gccgtggcac ctgcagcact   1260

gactctggca agccctctac tattgaatcc gagtctggtt ctgccaaggt cgttttctcc   1320

aacatcaagg ttggctccat tgggtcaacc ttttccgcat aa                      1362


<210> 78
<211> 453
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (19)...(453)
<223> Glycosyl hydrolase family 7

<400> 78
Met Tyr Gln Arg Ala Leu Leu Phe Ser Ala Leu Met Ala Gly Val Ser
1               5                   10                  15      


Ala Gln Gln Val Gly Thr Gln Lys Pro Glu Thr His Pro Pro Leu Ala
            20                  25                  30          


Trp Lys Glu Cys Thr Ser Ser Gly Cys Thr Ser Lys Asp Gly Ser Val
        35                  40                  45              


Val Ile Asp Ala Asn Trp Arg Trp Val His Ser Val Asp Gly Tyr Lys
    50                  55                  60                  


Asn Cys Tyr Thr Gly Asn Glu Trp Asp Ser Thr Leu Cys Pro Asp Asp
65                  70                  75                  80  


Ala Thr Cys Ala Thr Asn Cys Ala Val Asp Gly Ala Asp Tyr Ala Gly
                85                  90                  95      


Thr Tyr Gly Ala Thr Thr Glu Gly Asp Ser Leu Ser Ile Asn Phe Val
            100                 105                 110         


Thr Gly Ser Asn Ile Gly Ser Arg Phe Tyr Leu Met Glu Asp Glu Asn
        115                 120                 125             


Lys Tyr Gln Met Phe Lys Leu Leu Asn Lys Glu Phe Thr Phe Asp Val
    130                 135                 140                 


Asp Val Ser Thr Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val
145                 150                 155                 160 


Ser Met Asp Ala Asp Gly Gly Met Ser Lys Tyr Glu Thr Asn Lys Ala
                165                 170                 175     


Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp
            180                 185                 190         


Leu Lys Phe Ile Asn Gly Lys Gly Asn Val Glu Gly Trp Lys Pro Ser
        195                 200                 205             


Ala Asn Asp Lys Asn Ala Gly Val Gly Pro His Gly Ser Cys Cys Ala
    210                 215                 220                 


Glu Met Asp Ile Trp Glu Ala Asn Ser Ile Ser Thr Ala Leu Thr Pro
225                 230                 235                 240 


His Pro Cys Asp Thr Asn Gly Gln Thr Ile Cys Glu Gly Asp Ser Cys
                245                 250                 255     


Gly Gly Thr Tyr Ser Thr Thr Arg Tyr Ala Gly Thr Cys Asp Pro Asp
            260                 265                 270         


Gly Cys Asp Phe Asn Pro Phe Arg Met Gly Asn Glu Ser Phe Tyr Gly
        275                 280                 285             


Pro Gly Lys Met Val Asp Thr Lys Ser Lys Met Thr Val Val Thr Gln
    290                 295                 300                 


Phe Ile Thr Ser Asp Gly Thr Asp Thr Gly Ser Leu Lys Glu Ile Lys
305                 310                 315                 320 


Arg Val Tyr Val Gln Asn Gly Lys Val Ile Ala Asn Ser Ala Ser Asp
                325                 330                 335     


Val Ser Gly Ile Thr Gly Asn Ser Ile Thr Ser Glu Phe Cys Thr Ala
            340                 345                 350         


Gln Lys Lys Thr Phe Gly Asp Glu Asp Val Phe Asn Lys His Gly Gly
        355                 360                 365             


Leu Ser Gly Met Gly Asp Ala Leu Gly Glu Gly Met Val Leu Val Met
    370                 375                 380                 


Ser Leu Trp Asp Asp His Asn Ser Asn Met Leu Trp Leu Asp Gly Glu
385                 390                 395                 400 


Lys Tyr Pro Thr Asp Ala Ala Ala Ser Lys Val Gly Val Ser Arg Gly
                405                 410                 415     


Thr Cys Ser Thr Asp Ser Gly Lys Pro Ser Thr Ile Glu Ser Glu Ser
            420                 425                 430         


Gly Ser Ala Lys Val Val Phe Ser Asn Ile Lys Val Gly Ser Ile Gly
        435                 440                 445             


Ser Thr Phe Ser Ala
    450             


<210> 79
<211> 1518
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 79
atgaacgata actttctttg gggtgtttca caatctggtt ttcagtttga aatgggagac     60

aaactgagga gaaatataga taccagaagt gattggtggc attgggttcg tgatccgtgg    120

aatattcaaa aggagcttgt aagtggagat ctccctgaag aggggataaa caattatgag    180

ctttatgaac aggatcatgc tttagctaag aatttgggac taaatgccta tagaattgga    240

attgaatgga gtaggatttt cccctgttct actgtacata tagatgttag ttactccctt    300

gattcctatg gattaataaa ggatattaag ataacaaaaa acacactgga ggcactggat    360

gaaattgcaa ataagaggga agtggagtat tacagacgag taatcatgaa tcttaggaat    420

aagggattca aagtcatagt caacttaaat catttcacac tccctatatg gctccatgat    480

ccaatagaag cacgagaaag agcattaaca aacaaaaaaa taggatgggt gagtgaacat    540

tctgtaatag aatttactaa gtttgttgct tatattgcat ataagtttgg agatattgta    600

gatatgtgga gtacgtttaa tgaacctatg gtagttgttg aacttggtta tctagcaccc    660

tattcgggat ttcctcctgg agttttgaat ccagaggctg cgagattagt tatattacat    720

atgataaatg cccatgcgag agcatatgac gctatcaaaa agtttgataa gacaaaagct    780

gacaaagact ctaaagaacc agctgaagta gggattatct ataataacat tggtgtttca    840

tacccatata ccaacaattc aaaggacatc acagcagctg aaaaaagcaa tttctttcac    900

agtgggttat ttttaacagc gataaacaaa ggaaaactta acattgaatt cgatggggaa    960

acattaatca atgtaaaaca tttaaagaga aatgactgga ttggattaaa ctactacaca   1020

agagaagttg ttagatattc tgaaccaaaa ttcccaagta tctccttaat atcctttgaa   1080

ggtgttcctg actatggata tgcctgtcaa ccaggatcac tgtcaaaaga tgggaatcct   1140

gtaagtgatt ttggatggga aatttatcct aaaggcatat acgactcaat tgaggctgcc   1200

agtgaatatg ggaaaccaat ctatgtaaca gaaaatggta ttgcagattc aaaagatatt   1260

ttaaggccgt actatattgt ttcccatatt gcggaaattg agagagcaat agaaaatggg   1320

ttcgatgtta atggatattt tcattgggca ttaacagata actatgaatg gccaatgggt   1380

tacagaatgc gttttggttt atatgaagtt gatttaataa ccaagaaaag aaagccaaga   1440

gtaaaaagtg ttgaaactta caaagatatt attgccaata atggactaac cgaaaagttg   1500

cgtgaagaat atctttaa                                                 1518


<210> 80
<211> 505
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(496)
<223> Glycosyl hydrolase family 1

<400> 80
Met Asn Asp Asn Phe Leu Trp Gly Val Ser Gln Ser Gly Phe Gln Phe
1               5                   10                  15      


Glu Met Gly Asp Lys Leu Arg Arg Asn Ile Asp Thr Arg Ser Asp Trp
            20                  25                  30          


Trp His Trp Val Arg Asp Pro Trp Asn Ile Gln Lys Glu Leu Val Ser
        35                  40                  45              


Gly Asp Leu Pro Glu Glu Gly Ile Asn Asn Tyr Glu Leu Tyr Glu Gln
    50                  55                  60                  


Asp His Ala Leu Ala Lys Asn Leu Gly Leu Asn Ala Tyr Arg Ile Gly
65                  70                  75                  80  


Ile Glu Trp Ser Arg Ile Phe Pro Cys Ser Thr Val His Ile Asp Val
                85                  90                  95      


Ser Tyr Ser Leu Asp Ser Tyr Gly Leu Ile Lys Asp Ile Lys Ile Thr
            100                 105                 110         


Lys Asn Thr Leu Glu Ala Leu Asp Glu Ile Ala Asn Lys Arg Glu Val
        115                 120                 125             


Glu Tyr Tyr Arg Arg Val Ile Met Asn Leu Arg Asn Lys Gly Phe Lys
    130                 135                 140                 


Val Ile Val Asn Leu Asn His Phe Thr Leu Pro Ile Trp Leu His Asp
145                 150                 155                 160 


Pro Ile Glu Ala Arg Glu Arg Ala Leu Thr Asn Lys Lys Ile Gly Trp
                165                 170                 175     


Val Ser Glu His Ser Val Ile Glu Phe Thr Lys Phe Val Ala Tyr Ile
            180                 185                 190         


Ala Tyr Lys Phe Gly Asp Ile Val Asp Met Trp Ser Thr Phe Asn Glu
        195                 200                 205             


Pro Met Val Val Val Glu Leu Gly Tyr Leu Ala Pro Tyr Ser Gly Phe
    210                 215                 220                 


Pro Pro Gly Val Leu Asn Pro Glu Ala Ala Arg Leu Val Ile Leu His
225                 230                 235                 240 


Met Ile Asn Ala His Ala Arg Ala Tyr Asp Ala Ile Lys Lys Phe Asp
                245                 250                 255     


Lys Thr Lys Ala Asp Lys Asp Ser Lys Glu Pro Ala Glu Val Gly Ile
            260                 265                 270         


Ile Tyr Asn Asn Ile Gly Val Ser Tyr Pro Tyr Thr Asn Asn Ser Lys
        275                 280                 285             


Asp Ile Thr Ala Ala Glu Lys Ser Asn Phe Phe His Ser Gly Leu Phe
    290                 295                 300                 


Leu Thr Ala Ile Asn Lys Gly Lys Leu Asn Ile Glu Phe Asp Gly Glu
305                 310                 315                 320 


Thr Leu Ile Asn Val Lys His Leu Lys Arg Asn Asp Trp Ile Gly Leu
                325                 330                 335     


Asn Tyr Tyr Thr Arg Glu Val Val Arg Tyr Ser Glu Pro Lys Phe Pro
            340                 345                 350         


Ser Ile Ser Leu Ile Ser Phe Glu Gly Val Pro Asp Tyr Gly Tyr Ala
        355                 360                 365             


Cys Gln Pro Gly Ser Leu Ser Lys Asp Gly Asn Pro Val Ser Asp Phe
    370                 375                 380                 


Gly Trp Glu Ile Tyr Pro Lys Gly Ile Tyr Asp Ser Ile Glu Ala Ala
385                 390                 395                 400 


Ser Glu Tyr Gly Lys Pro Ile Tyr Val Thr Glu Asn Gly Ile Ala Asp
                405                 410                 415     


Ser Lys Asp Ile Leu Arg Pro Tyr Tyr Ile Val Ser His Ile Ala Glu
            420                 425                 430         


Ile Glu Arg Ala Ile Glu Asn Gly Phe Asp Val Asn Gly Tyr Phe His
        435                 440                 445             


Trp Ala Leu Thr Asp Asn Tyr Glu Trp Pro Met Gly Tyr Arg Met Arg
    450                 455                 460                 


Phe Gly Leu Tyr Glu Val Asp Leu Ile Thr Lys Lys Arg Lys Pro Arg
465                 470                 475                 480 


Val Lys Ser Val Glu Thr Tyr Lys Asp Ile Ile Ala Asn Asn Gly Leu
                485                 490                 495     


Thr Glu Lys Leu Arg Glu Glu Tyr Leu
            500                 505 


<210> 81
<211> 1395
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 81
atgcagagaa catcagcttg ggcactgctc cttctggcgc agattgccac tgctcagcag     60

accgtctggg gacaatgtgg tggtatcggc tactctggac cgaccagctg tgttgcagga    120

tcttcttgta gcacccagaa ctcttactac gctcaatgtc tcccaggcag tggaaacggc    180

ggtggcggtg cggcaaccac gaccacgact gctggacaaa ccaccaagac caccatggcc    240

accaccacca ccacttcaac caagacctca gctggtagtg gcggcagcac cactactgct    300

cctcctgcta gcaacagtgg caaccccttc aagggatacc agccttacgt gaacccgtac    360

tacgcttccg aggttcagag cctggctatt ccctctctgg cagcctctct ggcgcccaag    420

gccagcgcgg tggccaaggt cccatccttc gtttggctgg acactgctgc taaggtccct    480

actatgggca cttacttggc agacatcaag gccaagaacg cggctggtgc taacccaccc    540

attgccggta tctttgtcgt ttacgatctt cctgaccgtg actgcgctgc tcttgccagt    600

aacggcgagt actccatcgc caacggcggt gttgccaact acaagaagta cattgactcg    660

atccgcgctc agcttctcaa gtaccctgat gtgcacacca tcctggtcat cgaacccgac    720

agtctcgcca acctggtcac caacatgaac gtcgccaaat gctcgggtgc tcacgacgcc    780

tacctggagt gcactgacta tgcactcaag cagctcaact tgcccaacgt tgccatgtac    840

cttgatgccg gacacgctgg ctggcttgga tggcccgcca acattggacc cgctgccgac    900

ctcttcgcca gtgtgtacaa gaatgccggc tctcccgccg ccgtccgtgg attggccacc    960

aacgttgcca actacaacgc ctggtccatc tccacctgcc catcttacac tcagggtgac   1020

cagaactgtg acgagaagcg ctacatcaac gccctcgctc ctctcctccg cgcgaacggc   1080

ttcgacgccc acttcatcat ggacacctcc cgtaacggtg ttcagcccac taagcaacaa   1140

gcctggggtg actggtgcaa cgtcattggc actggcttcg gtaccccctt caccaccgac   1200

actggtgatg ctcttcagga cgctttcatc tgggtcaagc ccggtggtga gtgtgacggt   1260

acctcggaca catcctctcc tcgctacgac gcccactgcg gatacagcga tgccctcaag   1320

ccggcccccg aggctggaac ttggttccaa gcctacttcg agcagctgct cgtcaacgcc   1380

aacccaagct tctaa                                                    1395


<210> 82
<211> 464
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(18)

<220> 
<221> DOMAIN
<222> (117)...(431)
<223> Glycosyl hydrolases family 6

<220> 
<221> DOMAIN
<222> (22)...(50)
<223> Fungal cellulose binding domain

<400> 82
Met Gln Arg Thr Ser Ala Trp Ala Leu Leu Leu Leu Ala Gln Ile Ala
1               5                   10                  15      


Thr Ala Gln Gln Thr Val Trp Gly Gln Cys Gly Gly Ile Gly Tyr Ser
            20                  25                  30          


Gly Pro Thr Ser Cys Val Ala Gly Ser Ser Cys Ser Thr Gln Asn Ser
        35                  40                  45              


Tyr Tyr Ala Gln Cys Leu Pro Gly Ser Gly Asn Gly Gly Gly Gly Ala
    50                  55                  60                  


Ala Thr Thr Thr Thr Thr Ala Gly Gln Thr Thr Lys Thr Thr Met Ala
65                  70                  75                  80  


Thr Thr Thr Thr Thr Ser Thr Lys Thr Ser Ala Gly Ser Gly Gly Ser
                85                  90                  95      


Thr Thr Thr Ala Pro Pro Ala Ser Asn Ser Gly Asn Pro Phe Lys Gly
            100                 105                 110         


Tyr Gln Pro Tyr Val Asn Pro Tyr Tyr Ala Ser Glu Val Gln Ser Leu
        115                 120                 125             


Ala Ile Pro Ser Leu Ala Ala Ser Leu Ala Pro Lys Ala Ser Ala Val
    130                 135                 140                 


Ala Lys Val Pro Ser Phe Val Trp Leu Asp Thr Ala Ala Lys Val Pro
145                 150                 155                 160 


Thr Met Gly Thr Tyr Leu Ala Asp Ile Lys Ala Lys Asn Ala Ala Gly
                165                 170                 175     


Ala Asn Pro Pro Ile Ala Gly Ile Phe Val Val Tyr Asp Leu Pro Asp
            180                 185                 190         


Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Tyr Ser Ile Ala Asn
        195                 200                 205             


Gly Gly Val Ala Asn Tyr Lys Lys Tyr Ile Asp Ser Ile Arg Ala Gln
    210                 215                 220                 


Leu Leu Lys Tyr Pro Asp Val His Thr Ile Leu Val Ile Glu Pro Asp
225                 230                 235                 240 


Ser Leu Ala Asn Leu Val Thr Asn Met Asn Val Ala Lys Cys Ser Gly
                245                 250                 255     


Ala His Asp Ala Tyr Leu Glu Cys Thr Asp Tyr Ala Leu Lys Gln Leu
            260                 265                 270         


Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala Gly His Ala Gly Trp
        275                 280                 285             


Leu Gly Trp Pro Ala Asn Ile Gly Pro Ala Ala Asp Leu Phe Ala Ser
    290                 295                 300                 


Val Tyr Lys Asn Ala Gly Ser Pro Ala Ala Val Arg Gly Leu Ala Thr
305                 310                 315                 320 


Asn Val Ala Asn Tyr Asn Ala Trp Ser Ile Ser Thr Cys Pro Ser Tyr
                325                 330                 335     


Thr Gln Gly Asp Gln Asn Cys Asp Glu Lys Arg Tyr Ile Asn Ala Leu
            340                 345                 350         


Ala Pro Leu Leu Arg Ala Asn Gly Phe Asp Ala His Phe Ile Met Asp
        355                 360                 365             


Thr Ser Arg Asn Gly Val Gln Pro Thr Lys Gln Gln Ala Trp Gly Asp
    370                 375                 380                 


Trp Cys Asn Val Ile Gly Thr Gly Phe Gly Thr Pro Phe Thr Thr Asp
385                 390                 395                 400 


Thr Gly Asp Ala Leu Gln Asp Ala Phe Ile Trp Val Lys Pro Gly Gly
                405                 410                 415     


Glu Cys Asp Gly Thr Ser Asp Thr Ser Ser Pro Arg Tyr Asp Ala His
            420                 425                 430         


Cys Gly Tyr Ser Asp Ala Leu Lys Pro Ala Pro Glu Ala Gly Thr Trp
        435                 440                 445             


Phe Gln Ala Tyr Phe Glu Gln Leu Leu Val Asn Ala Asn Pro Ser Phe
    450                 455                 460                 


<210> 83
<211> 1365
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 83
atgtatcgac gcgcagtcct tttctccgcc ctggcggcgg cagcccatgc tcagcaggtt     60

ggcacatcga agcccgagac ccatccctct ttgacctggc aaaagtgcac tgccaagggc    120

agttgcactg accagaaggg ttctgttgtc atcgatgcca actggcgctg gcttcactct    180

gtcgatgggt acaccaactg ctacactgga aacgagtggg atgcgactct ctgccctgat    240

gacaagactt gcgccacaaa ctgtgctctg gagggcgctg actacgccgc cacctacggt    300

gcgaccaccg atggcaatgc cctcactttg agctttgtca ctggagccaa cgttggctct    360

cgtctgttct tgatggagga cgatagtacc taccagattt tcaagctgaa gaaccaggag    420

ttcacattcg atgtggacac ctccgctctg ccctgcggac tcaatggagc tctgtacttt    480

gtgtccatgg atgccgatgg tggcatggct aagtacgatg gcaacaaggc aggcgccaag    540

tacggaactg gctactgtga ctctcagtgc cctcgtgatc tgaagttcat caacggccag    600

gccaacgtgg acggctgggt gccttccgag aacgacaaga acgccggtgt tggcggccac    660

ggatcttgct gccctgagat ggatatctgg gaagccaata gcatttccac ggcctacacg    720

cctcaccctt gcgagagccc cgaacagacc atgtgcgagg gcgacaagtg tggcggaacc    780

tactcctcta ctcgctatgc aggaacctgc gatcccgatg gatgcgattt caactccttc    840

cgcatgggta acgagacctt cttcggccct ggcaagaccg tcgacaccaa gtccaagatg    900

actgttgtta ctcagttcat caccagtgac ggcaccgata ctggcgccct cagcgagatc    960

aagcgcatct acgtgcagga tggaaaggtt atcgccaact ctgcctcgga agttagcggt   1020

gtcaccggaa actctatcac ctcggacttc tgcacagccc agaagaaagc cttcggcgat   1080

gaggatgtct tcgcccagaa aggtggtctg gctggcatgg gcgagggttt ggaccaggga   1140

atggttttgg tcatgagctt gtgggatgac cactatgcta acatgctctg gttggacggc   1200

gaggtctacc ccactgatgc ctctgcctcc gatcctggtg ctgctcgcgg tacatgcgcc   1260

accacctccg gcgacccagc gaccattgag ggtgagtcgg gctcggccaa ggtgacctac   1320

tccaacatca aggttggacc tattggctct acctacgctt cctaa                   1365


<210> 84
<211> 454
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(17)

<220> 
<221> DOMAIN
<222> (19)...(454)
<223> Glycosyl hydrolase family 7

<400> 84
Met Tyr Arg Arg Ala Val Leu Phe Ser Ala Leu Ala Ala Ala Ala His
1               5                   10                  15      


Ala Gln Gln Val Gly Thr Ser Lys Pro Glu Thr His Pro Ser Leu Thr
            20                  25                  30          


Trp Gln Lys Cys Thr Ala Lys Gly Ser Cys Thr Asp Gln Lys Gly Ser
        35                  40                  45              


Val Val Ile Asp Ala Asn Trp Arg Trp Leu His Ser Val Asp Gly Tyr
    50                  55                  60                  


Thr Asn Cys Tyr Thr Gly Asn Glu Trp Asp Ala Thr Leu Cys Pro Asp
65                  70                  75                  80  


Asp Lys Thr Cys Ala Thr Asn Cys Ala Leu Glu Gly Ala Asp Tyr Ala
                85                  90                  95      


Ala Thr Tyr Gly Ala Thr Thr Asp Gly Asn Ala Leu Thr Leu Ser Phe
            100                 105                 110         


Val Thr Gly Ala Asn Val Gly Ser Arg Leu Phe Leu Met Glu Asp Asp
        115                 120                 125             


Ser Thr Tyr Gln Ile Phe Lys Leu Lys Asn Gln Glu Phe Thr Phe Asp
    130                 135                 140                 


Val Asp Thr Ser Ala Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe
145                 150                 155                 160 


Val Ser Met Asp Ala Asp Gly Gly Met Ala Lys Tyr Asp Gly Asn Lys
                165                 170                 175     


Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg
            180                 185                 190         


Asp Leu Lys Phe Ile Asn Gly Gln Ala Asn Val Asp Gly Trp Val Pro
        195                 200                 205             


Ser Glu Asn Asp Lys Asn Ala Gly Val Gly Gly His Gly Ser Cys Cys
    210                 215                 220                 


Pro Glu Met Asp Ile Trp Glu Ala Asn Ser Ile Ser Thr Ala Tyr Thr
225                 230                 235                 240 


Pro His Pro Cys Glu Ser Pro Glu Gln Thr Met Cys Glu Gly Asp Lys
                245                 250                 255     


Cys Gly Gly Thr Tyr Ser Ser Thr Arg Tyr Ala Gly Thr Cys Asp Pro
            260                 265                 270         


Asp Gly Cys Asp Phe Asn Ser Phe Arg Met Gly Asn Glu Thr Phe Phe
        275                 280                 285             


Gly Pro Gly Lys Thr Val Asp Thr Lys Ser Lys Met Thr Val Val Thr
    290                 295                 300                 


Gln Phe Ile Thr Ser Asp Gly Thr Asp Thr Gly Ala Leu Ser Glu Ile
305                 310                 315                 320 


Lys Arg Ile Tyr Val Gln Asp Gly Lys Val Ile Ala Asn Ser Ala Ser
                325                 330                 335     


Glu Val Ser Gly Val Thr Gly Asn Ser Ile Thr Ser Asp Phe Cys Thr
            340                 345                 350         


Ala Gln Lys Lys Ala Phe Gly Asp Glu Asp Val Phe Ala Gln Lys Gly
        355                 360                 365             


Gly Leu Ala Gly Met Gly Glu Gly Leu Asp Gln Gly Met Val Leu Val
    370                 375                 380                 


Met Ser Leu Trp Asp Asp His Tyr Ala Asn Met Leu Trp Leu Asp Gly
385                 390                 395                 400 


Glu Val Tyr Pro Thr Asp Ala Ser Ala Ser Asp Pro Gly Ala Ala Arg
                405                 410                 415     


Gly Thr Cys Ala Thr Thr Ser Gly Asp Pro Ala Thr Ile Glu Gly Glu
            420                 425                 430         


Ser Gly Ser Ala Lys Val Thr Tyr Ser Asn Ile Lys Val Gly Pro Ile
        435                 440                 445             


Gly Ser Thr Tyr Ala Ser
    450                 


<210> 85
<211> 1170
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 85
atgagaaaaa acattttaat gctggccgta gctatgattg cggcaatgtg cgtaaccacg     60

tcgtgcggaa acaaagccca gaaacaggac gaaacgcagg ctggacaagt gaacaacttc    120

cgcattaagc gcggtaccaa catcagtcac tggctgtcgc agagcgagca gcgtggcgag    180

gcccgtcgcc tacacattca ggaggacgac tttgcacgtc tggaggagtt aggattcgac    240

tttgtacgta ttcccatcga cgaggtacag ttctgggacg aggacggcaa gcagctgccc    300

gaggcttggg gattgctgaa caacgcactc gattgggcta agaagcacaa cctgcgcgct    360

attgtggatc tgcacattat ccgctcacac tactttaacg cagcaaacga ggacgataaa    420

gctgctaaca ccctgtttac ttcagaggag tcgcagcagg gactgcttaa cctgtggaag    480

cagctgtcgg acaccttgaa gaaccgcagc aacgactggg tggcttacga gtttatgaac    540

gagcctgtag cacccgagca cgagcagtgg aacctgctgg tagccaaggt acacaaggcc    600

ctgcgcgaac tggagccaca gcgtacactg gtgattggta gtaacatgtg gcagggacac    660

gagaccatga agtacctgaa ggtgcccgag ggcgacaaga atatcatctt gagtttccac    720

tactacaacc ccatgattct gacacactat ggtgcttggt ggacaccact gggcaagtat    780

cagggcaagg tgaactatcc tggtgtgctg gtatcgaagg aggattacga ggctgctcct    840

gctgagatta aggatcagct gaagccttac accgagcagg tttgggacat caacaccatt    900

cgtgcccagt ttaaggatgc catcgaggct gccaagaagt acgacctgca gctgttctgc    960

ggtgagtggg gtgtttacga gccagtggac cgcgagttgg cttacaactg gacacgcgac   1020

atgctgaccg tattcgacga gtacaacatt gcctggacta cctggtgcta cgatgccgac   1080

tttggtttct gggatcagca gcgccacacc ttcaaggacc gtccattggt tgagttgctg   1140

atgagtggca aaaaactggg agaggaatga                                    1170


<210> 86
<211> 389
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(28)

<220> 
<221> DOMAIN
<222> (49)...(365)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 86
Met Arg Lys Asn Ile Leu Met Leu Ala Val Ala Met Ile Ala Ala Met
1               5                   10                  15      


Cys Val Thr Thr Ser Cys Gly Asn Lys Ala Gln Lys Gln Asp Glu Thr
            20                  25                  30          


Gln Ala Gly Gln Val Asn Asn Phe Arg Ile Lys Arg Gly Thr Asn Ile
        35                  40                  45              


Ser His Trp Leu Ser Gln Ser Glu Gln Arg Gly Glu Ala Arg Arg Leu
    50                  55                  60                  


His Ile Gln Glu Asp Asp Phe Ala Arg Leu Glu Glu Leu Gly Phe Asp
65                  70                  75                  80  


Phe Val Arg Ile Pro Ile Asp Glu Val Gln Phe Trp Asp Glu Asp Gly
                85                  90                  95      


Lys Gln Leu Pro Glu Ala Trp Gly Leu Leu Asn Asn Ala Leu Asp Trp
            100                 105                 110         


Ala Lys Lys His Asn Leu Arg Ala Ile Val Asp Leu His Ile Ile Arg
        115                 120                 125             


Ser His Tyr Phe Asn Ala Ala Asn Glu Asp Asp Lys Ala Ala Asn Thr
    130                 135                 140                 


Leu Phe Thr Ser Glu Glu Ser Gln Gln Gly Leu Leu Asn Leu Trp Lys
145                 150                 155                 160 


Gln Leu Ser Asp Thr Leu Lys Asn Arg Ser Asn Asp Trp Val Ala Tyr
                165                 170                 175     


Glu Phe Met Asn Glu Pro Val Ala Pro Glu His Glu Gln Trp Asn Leu
            180                 185                 190         


Leu Val Ala Lys Val His Lys Ala Leu Arg Glu Leu Glu Pro Gln Arg
        195                 200                 205             


Thr Leu Val Ile Gly Ser Asn Met Trp Gln Gly His Glu Thr Met Lys
    210                 215                 220                 


Tyr Leu Lys Val Pro Glu Gly Asp Lys Asn Ile Ile Leu Ser Phe His
225                 230                 235                 240 


Tyr Tyr Asn Pro Met Ile Leu Thr His Tyr Gly Ala Trp Trp Thr Pro
                245                 250                 255     


Leu Gly Lys Tyr Gln Gly Lys Val Asn Tyr Pro Gly Val Leu Val Ser
            260                 265                 270         


Lys Glu Asp Tyr Glu Ala Ala Pro Ala Glu Ile Lys Asp Gln Leu Lys
        275                 280                 285             


Pro Tyr Thr Glu Gln Val Trp Asp Ile Asn Thr Ile Arg Ala Gln Phe
    290                 295                 300                 


Lys Asp Ala Ile Glu Ala Ala Lys Lys Tyr Asp Leu Gln Leu Phe Cys
305                 310                 315                 320 


Gly Glu Trp Gly Val Tyr Glu Pro Val Asp Arg Glu Leu Ala Tyr Asn
                325                 330                 335     


Trp Thr Arg Asp Met Leu Thr Val Phe Asp Glu Tyr Asn Ile Ala Trp
            340                 345                 350         


Thr Thr Trp Cys Tyr Asp Ala Asp Phe Gly Phe Trp Asp Gln Gln Arg
        355                 360                 365             


His Thr Phe Lys Asp Arg Pro Leu Val Glu Leu Leu Met Ser Gly Lys
    370                 375                 380                 


Lys Leu Gly Glu Glu
385                 


<210> 87
<211> 1344
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 87
atgcttcccc ttgttcttct ttccctgctc ggcgccgtta cggcccagca ggtcggcacc     60

aacgaacctg agacccatcc ccggatgtcc tggaagaagt gcaccggcaa cgccaactgc    120

cagactgtca acggcgaggt cgtcattgac tccaactggc gctggatcca caaggtcggc    180

ggcactgaaa actgctacga aggcaacaag tggacgggca cctgcaccag caacagcgac    240

tgtgccaaca actgcgccat ggagggtgcc aactacccca acacctacgg tgtgacgacc    300

agcggcgatg ccatgactct caagtttgtg acgcagcacc agcacggcac caacgtcggc    360

tctcgtctgt acctgatgaa ctcgcctacc cgctacgaga tgttcaacct catcaacaac    420

gagttcacct ttgacgttga cctgtccacc gtggcctgtg gtctcaacag cgccctctac    480

tttgtcgcca tggacgccga cggtggcctg ggcaagttcc cgtccaacaa ggccggtgcc    540

aagtacggta ccggatactg tgactcgcag tgcgcccgtg acctcaagtt catcggcggt    600

gagggcaact accagggttg ggtcccctcg tccagcgact cccaggctgg tattggaaac    660

atgggcgcct gctgcgccga gattgacgtc tgggagtcca actcccactc gtacgccctg    720

accccccacg cgtgctccaa caacaacttt cacatctgcc gtggcgacga gaactgtggc    780

ggcacctact cgcccgaccg cttcaagggc ctgtgcgacg ccaacggctg cgactacaac    840

ccctaccgcc tgggccgaca ggacttttac ggagccggca agcaggttga cacttccaag    900

aagttcaccg tcgtgaccca gttcaccaac aactcgctca agcagttctt tgtccagaac    960

ggccgccgca tcgacgtccc cacccccggc cacagtggcc ttcccgccag caacgaggtc   1020

aacaagaact tctgcgacaa cgtcttccgc gtctttggtg accgcaaccg ctacaacgag   1080

gtcggaggct ggactgccat gcaggatgcc ctgcgaaagc cccatgtcct ggtcatgtcc   1140

atctgggccg accactacgc caacatgctc tggctcgacg gtgtctggcc ccgcggcggc   1200

aaccccgcta ctcccggcat caagcgcggt gactgccctg ctgaaggcag ctccccgccc   1260

gaggttattg ctaaccaccc caacgctttc gtcacctggt ccaacatccg cttcggaccc   1320

atcggatcca ccactggcct gtag                                          1344


<210> 88
<211> 447
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (17)...(447)
<223> Glycosyl hydrolase family 7

<400> 88
Met Leu Pro Leu Val Leu Leu Ser Leu Leu Gly Ala Val Thr Ala Gln
1               5                   10                  15      


Gln Val Gly Thr Asn Glu Pro Glu Thr His Pro Arg Met Ser Trp Lys
            20                  25                  30          


Lys Cys Thr Gly Asn Ala Asn Cys Gln Thr Val Asn Gly Glu Val Val
        35                  40                  45              


Ile Asp Ser Asn Trp Arg Trp Ile His Lys Val Gly Gly Thr Glu Asn
    50                  55                  60                  


Cys Tyr Glu Gly Asn Lys Trp Thr Gly Thr Cys Thr Ser Asn Ser Asp
65                  70                  75                  80  


Cys Ala Asn Asn Cys Ala Met Glu Gly Ala Asn Tyr Pro Asn Thr Tyr
                85                  90                  95      


Gly Val Thr Thr Ser Gly Asp Ala Met Thr Leu Lys Phe Val Thr Gln
            100                 105                 110         


His Gln His Gly Thr Asn Val Gly Ser Arg Leu Tyr Leu Met Asn Ser
        115                 120                 125             


Pro Thr Arg Tyr Glu Met Phe Asn Leu Ile Asn Asn Glu Phe Thr Phe
    130                 135                 140                 


Asp Val Asp Leu Ser Thr Val Ala Cys Gly Leu Asn Ser Ala Leu Tyr
145                 150                 155                 160 


Phe Val Ala Met Asp Ala Asp Gly Gly Leu Gly Lys Phe Pro Ser Asn
                165                 170                 175     


Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Ala
            180                 185                 190         


Arg Asp Leu Lys Phe Ile Gly Gly Glu Gly Asn Tyr Gln Gly Trp Val
        195                 200                 205             


Pro Ser Ser Ser Asp Ser Gln Ala Gly Ile Gly Asn Met Gly Ala Cys
    210                 215                 220                 


Cys Ala Glu Ile Asp Val Trp Glu Ser Asn Ser His Ser Tyr Ala Leu
225                 230                 235                 240 


Thr Pro His Ala Cys Ser Asn Asn Asn Phe His Ile Cys Arg Gly Asp
                245                 250                 255     


Glu Asn Cys Gly Gly Thr Tyr Ser Pro Asp Arg Phe Lys Gly Leu Cys
            260                 265                 270         


Asp Ala Asn Gly Cys Asp Tyr Asn Pro Tyr Arg Leu Gly Arg Gln Asp
        275                 280                 285             


Phe Tyr Gly Ala Gly Lys Gln Val Asp Thr Ser Lys Lys Phe Thr Val
    290                 295                 300                 


Val Thr Gln Phe Thr Asn Asn Ser Leu Lys Gln Phe Phe Val Gln Asn
305                 310                 315                 320 


Gly Arg Arg Ile Asp Val Pro Thr Pro Gly His Ser Gly Leu Pro Ala
                325                 330                 335     


Ser Asn Glu Val Asn Lys Asn Phe Cys Asp Asn Val Phe Arg Val Phe
            340                 345                 350         


Gly Asp Arg Asn Arg Tyr Asn Glu Val Gly Gly Trp Thr Ala Met Gln
        355                 360                 365             


Asp Ala Leu Arg Lys Pro His Val Leu Val Met Ser Ile Trp Ala Asp
    370                 375                 380                 


His Tyr Ala Asn Met Leu Trp Leu Asp Gly Val Trp Pro Arg Gly Gly
385                 390                 395                 400 


Asn Pro Ala Thr Pro Gly Ile Lys Arg Gly Asp Cys Pro Ala Glu Gly
                405                 410                 415     


Ser Ser Pro Pro Glu Val Ile Ala Asn His Pro Asn Ala Phe Val Thr
            420                 425                 430         


Trp Ser Asn Ile Arg Phe Gly Pro Ile Gly Ser Thr Thr Gly Leu
        435                 440                 445         


<210> 89
<211> 2238
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 89
ttgaacaccg gctggcgcgg aagcttcctc gcagtcgccg cggtttcgct ggcggcgctc     60

gcgacttcat ctgtggcgca ggcgccgagc gccggcgagc gcgccgaggc cgcggtcagc    120

aagatgactg tcgaggagaa gctgaagctc gtcttcggct atttcgccag cgactgggag    180

ggcaagaagc cgcccgctga agcgcactac gggtcggccg gctacgtgcc gggaattccg    240

cggctcggca tcccaccgca atatgagacc gatgccggcg ttggagtcgc gacccagggt    300

gccgccaaga cgaagcgcga gcgcacgtcg ctgccgtcgg gaatcgcgac cgcggcgacg    360

tggaacccgg agatcgcctt ccagggcggg cgaatgatcg gatctgaggc gcgcgcgtcg    420

ggcttcaacg tcatgctcgc ggggggcgtc aacctgctgc gcgacccgcg caacgggcgc    480

aacttcgaat atggcggcga ggacccgctg ctcgccggca cgatcgtcgg ctccgagatc    540

gcgggcatcc agtccaacca gatcatctcg accacgaagc attacgcgct caacgacctc    600

gagaccggcc gcaagggcca cgacgtgcgc atcgatccag ctgccgcacg catgtcggac    660

ctgctggcat tccagctcgc gatcgaacgc ggcgatcccg gctcggtgat gtgctcgtac    720

aacaaggtgg gcggcgactt cgcctgcgag aacgactggc ttttgaacca ggtgctgaag    780

ggcgactggg gcttccgtgg ctatgtgatg agcgactggg gcgcggtgca cagcacggtc    840

aaggcggccg tcaacggcct cgaccagcaa tcgggctggc cgttcgacga caagccctat    900

ctcggctcgc tgctgaagca ggcggtcgcg tcggggcagg tgcccaaatc acggctcgac    960

gacatggctc ggcgcgtgct gtacgcgatg ttcgcgcacg gcgtcgtcga caatcccgtc   1020

accgaaggcg gagcgatcga ctatgcggcc gacgaggcag tgagccgcgc cgacgcagaa   1080

caggcgatcg tgctgctcaa gaacgaaggc aacctgctcc cgctcgatgc gcggcggatc   1140

gtcgtcatcg gcggccatgc cgacaagggc gtgctcgcgg gcagcggctc gtcactggtc   1200

tatccgcgcg gcggcaatgc ggtcccgggc ctaccgccga ccggctggcc cggaccggtg   1260

atgtatttcc cgtcatcgcc ggtgcaggcg ctgcagcggc tggtgccgaa cgcgagcgtg   1320

agcttcgtcg acggcaccga tccgactgcg gctgccgccg cggctcaggc ggcggacgtc   1380

gcgatcgtct tcggcaccca atggtcgagc gagtcgatcg acgtgccgat gaagctcgac   1440

ggcaaccagg acgcgctgat cgccgccgtc gccgcggcga acccgcggac cgtcgtcgtg   1500

ctcgaaacca atgccggggt gacgatgccc tgggccgcgc gcgtgccagc catcgtcgag   1560

gcctggtatc cgggatcggc ggggggcgag gcgatcgcca acgtcctcac cggacgcgtg   1620

aacccgtcgg gccgcctgcc ggtgaccttt tatgcgtccg aagcgcagct tccgcgcccg   1680

gcgcgtcccg gcaaggatag cgagatggac cagttcgacc tgccttacgc ggagggtgca   1740

gcggtcggct acaaatgggt cgatcgcaac aatctgcagc cgctgttccc gttcgggcac   1800

ggccttgcct acacgactta cgactatggg ccgatcagcg tcgctccgga gccgaacggc   1860

ggccttcggg tgcagttcac gctgcggaac acgggcaatc gcccgggcat ggccgtcggg   1920

caggtttacg cctcgccggc gagcggcggc tgggaggcgc ccaggcggct cgtcggcttc   1980

gccaaggtcg agctcgcgcc cggcgccgcg cagacggtcg ccgtcgatgt cgatccgcgc   2040

ctgctcgcca cgttcgacga gccggggcac agctggaaca tcgcgcccgg cacctacaat   2100

ctgatgcttg gagcctcgtc gcgcgacctg cgcagcaaca cgagcgtcac gcttccggcg   2160

ctcagccttc cggcgaactg gcggccggga caggaaggcg ctgccccagc gccgcggccc   2220

ggcgagcgag gccgataa                                                 2238


<210> 90
<211> 745
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (76)...(292)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (363)...(606)
<223> Glycosyl hydrolase family 3 C terminal domain

<400> 90
Met Asn Thr Gly Trp Arg Gly Ser Phe Leu Ala Val Ala Ala Val Ser
1               5                   10                  15      


Leu Ala Ala Leu Ala Thr Ser Ser Val Ala Gln Ala Pro Ser Ala Gly
            20                  25                  30          


Glu Arg Ala Glu Ala Ala Val Ser Lys Met Thr Val Glu Glu Lys Leu
        35                  40                  45              


Lys Leu Val Phe Gly Tyr Phe Ala Ser Asp Trp Glu Gly Lys Lys Pro
    50                  55                  60                  


Pro Ala Glu Ala His Tyr Gly Ser Ala Gly Tyr Val Pro Gly Ile Pro
65                  70                  75                  80  


Arg Leu Gly Ile Pro Pro Gln Tyr Glu Thr Asp Ala Gly Val Gly Val
                85                  90                  95      


Ala Thr Gln Gly Ala Ala Lys Thr Lys Arg Glu Arg Thr Ser Leu Pro
            100                 105                 110         


Ser Gly Ile Ala Thr Ala Ala Thr Trp Asn Pro Glu Ile Ala Phe Gln
        115                 120                 125             


Gly Gly Arg Met Ile Gly Ser Glu Ala Arg Ala Ser Gly Phe Asn Val
    130                 135                 140                 


Met Leu Ala Gly Gly Val Asn Leu Leu Arg Asp Pro Arg Asn Gly Arg
145                 150                 155                 160 


Asn Phe Glu Tyr Gly Gly Glu Asp Pro Leu Leu Ala Gly Thr Ile Val
                165                 170                 175     


Gly Ser Glu Ile Ala Gly Ile Gln Ser Asn Gln Ile Ile Ser Thr Thr
            180                 185                 190         


Lys His Tyr Ala Leu Asn Asp Leu Glu Thr Gly Arg Lys Gly His Asp
        195                 200                 205             


Val Arg Ile Asp Pro Ala Ala Ala Arg Met Ser Asp Leu Leu Ala Phe
    210                 215                 220                 


Gln Leu Ala Ile Glu Arg Gly Asp Pro Gly Ser Val Met Cys Ser Tyr
225                 230                 235                 240 


Asn Lys Val Gly Gly Asp Phe Ala Cys Glu Asn Asp Trp Leu Leu Asn
                245                 250                 255     


Gln Val Leu Lys Gly Asp Trp Gly Phe Arg Gly Tyr Val Met Ser Asp
            260                 265                 270         


Trp Gly Ala Val His Ser Thr Val Lys Ala Ala Val Asn Gly Leu Asp
        275                 280                 285             


Gln Gln Ser Gly Trp Pro Phe Asp Asp Lys Pro Tyr Leu Gly Ser Leu
    290                 295                 300                 


Leu Lys Gln Ala Val Ala Ser Gly Gln Val Pro Lys Ser Arg Leu Asp
305                 310                 315                 320 


Asp Met Ala Arg Arg Val Leu Tyr Ala Met Phe Ala His Gly Val Val
                325                 330                 335     


Asp Asn Pro Val Thr Glu Gly Gly Ala Ile Asp Tyr Ala Ala Asp Glu
            340                 345                 350         


Ala Val Ser Arg Ala Asp Ala Glu Gln Ala Ile Val Leu Leu Lys Asn
        355                 360                 365             


Glu Gly Asn Leu Leu Pro Leu Asp Ala Arg Arg Ile Val Val Ile Gly
    370                 375                 380                 


Gly His Ala Asp Lys Gly Val Leu Ala Gly Ser Gly Ser Ser Leu Val
385                 390                 395                 400 


Tyr Pro Arg Gly Gly Asn Ala Val Pro Gly Leu Pro Pro Thr Gly Trp
                405                 410                 415     


Pro Gly Pro Val Met Tyr Phe Pro Ser Ser Pro Val Gln Ala Leu Gln
            420                 425                 430         


Arg Leu Val Pro Asn Ala Ser Val Ser Phe Val Asp Gly Thr Asp Pro
        435                 440                 445             


Thr Ala Ala Ala Ala Ala Ala Gln Ala Ala Asp Val Ala Ile Val Phe
    450                 455                 460                 


Gly Thr Gln Trp Ser Ser Glu Ser Ile Asp Val Pro Met Lys Leu Asp
465                 470                 475                 480 


Gly Asn Gln Asp Ala Leu Ile Ala Ala Val Ala Ala Ala Asn Pro Arg
                485                 490                 495     


Thr Val Val Val Leu Glu Thr Asn Ala Gly Val Thr Met Pro Trp Ala
            500                 505                 510         


Ala Arg Val Pro Ala Ile Val Glu Ala Trp Tyr Pro Gly Ser Ala Gly
        515                 520                 525             


Gly Glu Ala Ile Ala Asn Val Leu Thr Gly Arg Val Asn Pro Ser Gly
    530                 535                 540                 


Arg Leu Pro Val Thr Phe Tyr Ala Ser Glu Ala Gln Leu Pro Arg Pro
545                 550                 555                 560 


Ala Arg Pro Gly Lys Asp Ser Glu Met Asp Gln Phe Asp Leu Pro Tyr
                565                 570                 575     


Ala Glu Gly Ala Ala Val Gly Tyr Lys Trp Val Asp Arg Asn Asn Leu
            580                 585                 590         


Gln Pro Leu Phe Pro Phe Gly His Gly Leu Ala Tyr Thr Thr Tyr Asp
        595                 600                 605             


Tyr Gly Pro Ile Ser Val Ala Pro Glu Pro Asn Gly Gly Leu Arg Val
    610                 615                 620                 


Gln Phe Thr Leu Arg Asn Thr Gly Asn Arg Pro Gly Met Ala Val Gly
625                 630                 635                 640 


Gln Val Tyr Ala Ser Pro Ala Ser Gly Gly Trp Glu Ala Pro Arg Arg
                645                 650                 655     


Leu Val Gly Phe Ala Lys Val Glu Leu Ala Pro Gly Ala Ala Gln Thr
            660                 665                 670         


Val Ala Val Asp Val Asp Pro Arg Leu Leu Ala Thr Phe Asp Glu Pro
        675                 680                 685             


Gly His Ser Trp Asn Ile Ala Pro Gly Thr Tyr Asn Leu Met Leu Gly
    690                 695                 700                 


Ala Ser Ser Arg Asp Leu Arg Ser Asn Thr Ser Val Thr Leu Pro Ala
705                 710                 715                 720 


Leu Ser Leu Pro Ala Asn Trp Arg Pro Gly Gln Glu Gly Ala Ala Pro
                725                 730                 735     


Ala Pro Arg Pro Gly Glu Arg Gly Arg
            740                 745 


<210> 91
<211> 2637
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 91
atgactagtg gacgaaacac atgtgtgtgt ctgttgttga ttgtgctggc gatcggtctt     60

ctgtcaaagc caccggcgag cgcgcaaaat gaggcgcctt ataaagccac gctgacgatc    120

cggttggacc aaccgggagc ggtgatcaat cgcaacatct acggccagtt tgcggagcat    180

ctcggacgtt tgatctacga cgggctctgg gttggtgaag gatcgtcgat cccgaacacg    240

cgcggattgc gtaacgacgt cgttacggcg ttaaaagaat tgcatgtgcc tgtgctgcgt    300

tggcccggcg gctgttttgc cgacgagtat cactggcgtg acggcattgg accacgcgac    360

aagcgtccgc ggcggccgaa cgcgagttgg ggcggcgtcg attcgaatgc gtttggcacg    420

catgagttca tggagctgtg cgagatgttg ggcgcagacg cttatatcaa tggcaacgtc    480

ggcagcggca cgccgcagga gatgatggaa tggatcgagt acatgacttc cgattccgat    540

tcggatctcg ccaacctgcg ccgtcgcaat ggccgcgaca agccgtggaa ggtgccgtat    600

ttcgccgtcg gcaatgagac gtggggctgt ggtggaaata tgcggccgga gttttacgcc    660

gacgtgtatc gccagtacgc cacgttcatc aagaaccatt caggcaatcg cattcagaaa    720

ctcgcgagcg gtggttacga caacaattac aactggaccg aggtgctgat ggcgcaggcg    780

gcgaagcaga tcgatggcct gtcgttgcac tattacacgc tgcccaccgg caactgggac    840

aagaaaggat cggcgacgga attcggcgaa agcgagtggc acgcgacgct cgccaggacg    900

ttgcgcatcg aggagttcat tcagaagcac agcgcgatca tggacaagca cgatccgcag    960

aagcgcgtcg gtttgatggt tgacgagtgg ggcacgtggt acgaccgcga cgagggccgc   1020

gacatgggcg cgctttatca gcagaacacg ttgcgcgatg cggttgcggc cggtatcaat   1080

ctcaatatct ttcacaagta tgccgatcgc gtgcgcatgg cgaacatcgc gcagatggtg   1140

aacgtgttgc aggcgatggt gttgacggac aaagagaaaa tggtgctgac gccgacgtat   1200

cacgtttttc ggatgtatcg cgtgcatcag ggagcgacgc tgatcccggt cgaggttagt   1260

gcgccgcagt acacgctggg tggtgcgtct gtgccgtcgt tgagcgtgtc ggcttcgcgt   1320

gacggtgaag gacgggtgca tctgtcgatc gtgaatctcg atccagcgcg ggcggcggag   1380

atcgatgcga acggaccgtt cagcagtgtc aagggagaag tattgactgc gccggcggtg   1440

aatgcgctga atactttcga tcacccggat agtgtcaagc ccgtgtcttt taatggatat   1500

aaattagaag gctctaaatt aatcctgaat attccggcga aatccgtggt ggtgttggaa   1560

cttggaccac agaaacaagc aacgctcaaa gatgcattca aaaacgattt catgatcggc   1620

gcggcgctca accggcgaca gttcttcgaa gaagacgctc gcggcgcaga gatcgtgcgc   1680

atgcatttca actcgatcac gccggagaac gtgttgaagt gggggctggt ccatcccgaa   1740

ccgaacaagt acgacttcac cgctcccgat cgcttcgtcg aattcggcga gaagcacggc   1800

atgttcgtcg tcggacacac gctcgtctgg cataaccaaa cgccgcgctg ggtttttgaa   1860

gacgaaaaga aacagccgct cgatcgcgag acgttgctga aacgaatgcg cgatcacatc   1920

ttcaccgtcg tcggccgtta caagggacgc attaaaggct gggacgtagt caacgaggcg   1980

ctgaatcagg atggcacgat gcggcagtcg ccgtggttca agatcatcgg cgaggattat   2040

ctcgtcaaag cgtttgagtt tgcccacgag gccgatccag ccgccgagct ttattacaac   2100

gactacgatc tcgagctgcc ggcgaagcgc gcaggcgccg tcgaactgct gaagaaactg   2160

aaagccgcgg gtgtgtcgct tgctggtgtg ggattgcaga accacagtct catggagtgg   2220

ccgtcagccg cagatgtgga tgcgacgatc gcggcgttcg cgaatctggg tttgaaggtt   2280

cacatcacgg aactcgacgt cgacgtgctg ccgcgcacga cgaaacccgg tgcggattac   2340

gcagtcgacg tgaaggtgac gccgcagttg aacccgtatc tcgacggctt accggaggcg   2400

cgacagtcgg cgttggcgag gcgttatgcg gagctgtttc acgtgtttag aaaacatcgc   2460

gacgcgatcg agcgtgtgac gttctgggga gttgcggacg gcgattcgtg gttgaacaac   2520

tggcccatcc gcggcaggac aaactatccg ctgctcttcg atcgttccgg ccaaccgaaa   2580

ccggcgttag cgtcggtgat cgaaaccgct aattattcaa cggaacgtcg acggtga      2637

<210> 92
<211> 878
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(28)

<220> 
<221> DOMAIN
<222> (328)...(515)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> DOMAIN
<222> (528)...(870)
<223> Glycosyl hydrolase family 10

<220> 
<221> SITE
<222> (128)...(131)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (208)...(211)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (235)...(238)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (254)...(257)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (745)...(748)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (768)...(778)
<223> Glycosyl hydrolases family 10 active site. Prosite id = PS00591

<220> 
<221> SITE
<222> (884)...(887)
<223> N-glycosylation site. Prosite id = PS00001

<400> 92
Met Thr Ser Gly Arg Asn Thr Cys Val Cys Leu Leu Leu Ile Val Leu 
1               5                   10                  15      


Ala Ile Gly Leu Leu Ser Lys Pro Pro Ala Ser Ala Gln Asn Glu Ala 
            20                  25                  30          


Pro Tyr Lys Ala Thr Leu Thr Ile Arg Leu Asp Gln Pro Gly Ala Val 
        35                  40                  45              


Ile Asn Arg Asn Ile Tyr Gly Gln Phe Ala Glu His Leu Gly Arg Leu 
    50                  55                  60                  


Ile Tyr Asp Gly Leu Trp Val Gly Glu Gly Ser Ser Ile Pro Asn Thr 
65                  70                  75                  80  


Arg Gly Leu Arg Asn Asp Val Val Thr Ala Leu Lys Glu Leu His Val 
                85                  90                  95      


Pro Val Leu Arg Trp Pro Gly Gly Cys Phe Ala Asp Glu Tyr His Trp 
            100                 105                 110         


Arg Asp Gly Ile Gly Pro Arg Asp Lys Arg Pro Arg Arg Pro Asn Ala 
        115                 120                 125             


Ser Trp Gly Gly Val Asp Ser Asn Ala Phe Gly Thr His Glu Phe Met 
    130                 135                 140                 


Glu Leu Cys Glu Met Leu Gly Ala Asp Ala Tyr Ile Asn Gly Asn Val 
145                 150                 155                 160 


Gly Ser Gly Thr Pro Gln Glu Met Met Glu Trp Ile Glu Tyr Met Thr 
                165                 170                 175     


Ser Asp Ser Asp Ser Asp Leu Ala Asn Leu Arg Arg Arg Asn Gly Arg 
            180                 185                 190         


Asp Lys Pro Trp Lys Val Pro Tyr Phe Ala Val Gly Asn Glu Thr Trp 
        195                 200                 205             


Gly Cys Gly Gly Asn Met Arg Pro Glu Phe Tyr Ala Asp Val Tyr Arg 
    210                 215                 220                 


Gln Tyr Ala Thr Phe Ile Lys Asn His Ser Gly Asn Arg Ile Gln Lys 
225                 230                 235                 240 


Leu Ala Ser Gly Gly Tyr Asp Asn Asn Tyr Asn Trp Thr Glu Val Leu 
                245                 250                 255     


Met Ala Gln Ala Ala Lys Gln Ile Asp Gly Leu Ser Leu His Tyr Tyr 
            260                 265                 270         


Thr Leu Pro Thr Gly Asn Trp Asp Lys Lys Gly Ser Ala Thr Glu Phe 
        275                 280                 285             


Gly Glu Ser Glu Trp His Ala Thr Leu Ala Arg Thr Leu Arg Ile Glu 
    290                 295                 300                 


Glu Phe Ile Gln Lys His Ser Ala Ile Met Asp Lys His Asp Pro Gln 
305                 310                 315                 320 


Lys Arg Val Gly Leu Met Val Asp Glu Trp Gly Thr Trp Tyr Asp Arg 
                325                 330                 335     


Asp Glu Gly Arg Asp Met Gly Ala Leu Tyr Gln Gln Asn Thr Leu Arg 
            340                 345                 350         


Asp Ala Val Ala Ala Gly Ile Asn Leu Asn Ile Phe His Lys Tyr Ala 
        355                 360                 365             


Asp Arg Val Arg Met Ala Asn Ile Ala Gln Met Val Asn Val Leu Gln 
    370                 375                 380                 


Ala Met Val Leu Thr Asp Lys Glu Lys Met Val Leu Thr Pro Thr Tyr 
385                 390                 395                 400 


His Val Phe Arg Met Tyr Arg Val His Gln Gly Ala Thr Leu Ile Pro 
                405                 410                 415     


Val Glu Val Ser Ala Pro Gln Tyr Thr Leu Gly Gly Ala Ser Val Pro 
            420                 425                 430         


Ser Leu Ser Val Ser Ala Ser Arg Asp Gly Glu Gly Arg Val His Leu 
        435                 440                 445             


Ser Ile Val Asn Leu Asp Pro Ala Arg Ala Ala Glu Ile Asp Ala Asn 
    450                 455                 460                 


Gly Pro Phe Ser Ser Val Lys Gly Glu Val Leu Thr Ala Pro Ala Val 
465                 470                 475                 480 


Asn Ala Leu Asn Thr Phe Asp His Pro Asp Ser Val Lys Pro Val Ser 
                485                 490                 495     


Phe Asn Gly Tyr Lys Leu Glu Gly Ser Lys Leu Ile Leu Asn Ile Pro 
            500                 505                 510         


Ala Lys Ser Val Val Val Leu Glu Leu Gly Pro Gln Lys Gln Ala Thr 
        515                 520                 525             


Leu Lys Asp Ala Phe Lys Asn Asp Phe Met Ile Gly Ala Ala Leu Asn 
    530                 535                 540                 


Arg Arg Gln Phe Phe Glu Glu Asp Ala Arg Gly Ala Glu Ile Val Arg 
545                 550                 555                 560 


Met His Phe Asn Ser Ile Thr Pro Glu Asn Val Leu Lys Trp Gly Leu 
                565                 570                 575     


Val His Pro Glu Pro Asn Lys Tyr Asp Phe Thr Ala Pro Asp Arg Phe 
            580                 585                 590         


Val Glu Phe Gly Glu Lys His Gly Met Phe Val Val Gly His Thr Leu 
        595                 600                 605             


Val Trp His Asn Gln Thr Pro Arg Trp Val Phe Glu Asp Glu Lys Lys 
    610                 615                 620                 


Gln Pro Leu Asp Arg Glu Thr Leu Leu Lys Arg Met Arg Asp His Ile 
625                 630                 635                 640 


Phe Thr Val Val Gly Arg Tyr Lys Gly Arg Ile Lys Gly Trp Asp Val 
                645                 650                 655     


Val Asn Glu Ala Leu Asn Gln Asp Gly Thr Met Arg Gln Ser Pro Trp 
            660                 665                 670         


Phe Lys Ile Ile Gly Glu Asp Tyr Leu Val Lys Ala Phe Glu Phe Ala 
        675                 680                 685             


His Glu Ala Asp Pro Ala Ala Glu Leu Tyr Tyr Asn Asp Tyr Asp Leu 
    690                 695                 700                 


Glu Leu Pro Ala Lys Arg Ala Gly Ala Val Glu Leu Leu Lys Lys Leu 
705                 710                 715                 720 


Lys Ala Ala Gly Val Ser Leu Ala Gly Val Gly Leu Gln Asn His Ser 
                725                 730                 735     


Leu Met Glu Trp Pro Ser Ala Ala Asp Val Asp Ala Thr Ile Ala Ala 
            740                 745                 750         


Phe Ala Asn Leu Gly Leu Lys Val His Ile Thr Glu Leu Asp Val Asp 
        755                 760                 765             


Val Leu Pro Arg Thr Thr Lys Pro Gly Ala Asp Tyr Ala Val Asp Val 
    770                 775                 780                 


Lys Val Thr Pro Gln Leu Asn Pro Tyr Leu Asp Gly Leu Pro Glu Ala 
785                 790                 795                 800 


Arg Gln Ser Ala Leu Ala Arg Arg Tyr Ala Glu Leu Phe His Val Phe 
                805                 810                 815     


Arg Lys His Arg Asp Ala Ile Glu Arg Val Thr Phe Trp Gly Val Ala 
            820                 825                 830         


Asp Gly Asp Ser Trp Leu Asn Asn Trp Pro Ile Arg Gly Arg Thr Asn 
        835                 840                 845             


Tyr Pro Leu Leu Phe Asp Arg Ser Gly Gln Pro Lys Pro Ala Leu Ala 
    850                 855                 860                 


Ser Val Ile Glu Thr Ala Asn Tyr Ser Thr Glu Arg Arg Arg 
865                 870                 875             


<210> 93
<211> 1362
<212> DNA
<213> Bacteria

<400> 93
atgcttcagt ttccgaaaga ttttatttgg ggagctgcaa cttcatcgta tcaaattgaa     60

ggaacagcga ctggagaaga taaaatttac tcgatctggg atcacttttc ccgcattcct    120

ggcaaagtag cgaatgggga taatggcgat atcgcaattg atcattacaa tcgttatgtt    180

gaagacatcg cattaatgaa agcgcttcat ttgaaagcgt atcgattttc gactagttgg    240

gcgagacttt attgtgaaac gccagggaag tttaacgaaa aaggtttaga tttttataag    300

cgtcttgtac atgaattgct agagaacggt atcgagccaa tgttgaccat ttatcattgg    360

gatatgccac aagctcttca agagaaaggt ggctgggaaa atcgtgatat cgttcactac    420

ttccaagaat acgctgcttt cctttacgag aatcttgggg atgtcgtgaa aaaatggatt    480

acgcataatg agccgtgggt tgtcacctat ttaggatatg ggaatggcga acatgcccca    540

gggattcaaa actttacatc atttttaaaa gcagcacatc atgttcttct ctcacacggg    600

gaagcggtaa aagcgtttcg agcaatcggt tcgaaagatg gggaaattgg tattacgttg    660

aatttgacac ctggatatgc ggtcgatccg aaagatgaaa aagcagttga tgccgctcga    720

aaatgggacg gctttatgaa tcgttggttt ttagatcctg tatttaaggg acaatatcca    780

gcagatatgt tagaagtgta taaagattat ttaccagacg tttacaaaga gggagattta    840

caaacgattc agcaaccgat cgactttttc ggatttaact attattcaac agcaacatta    900

aaagattgga aaacaggtga ccgtgaaccg atcgtatttg aacatgtgag cacaggaaga    960

cctgtgacgg atatgaattg ggaagtgaat ccaaacggtt tgtttgattt aatggtgcga   1020

ttgaaaaaag attatggcga tattccatta tacattaccg aaaacggtgc tgcatacaaa   1080

gatcgcgtca acgaacaagg tgaagtagaa gatgatgagc gagttgctta tatacgggag   1140

catttaatcg cttgccaccg cgcgattgaa caaggcgtca atttaaaagg atattatgta   1200

tggtcgctgt tcgataattt tgagtgggca tttggatatg ataagcgctt tgggattgta   1260

tacgtggatt atgaaacgct agagcgcatc ccgaaaaaga gtgcattatg gtacaaggaa   1320

acgattataa acaacggatt gcaagtagac aatgacaaat aa                      1362

<210> 94
<211> 453
<212> PRT
<213> Bacteria

<220> 
<221> DOMAIN
<222> (1)...(447)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (8)...(22)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (186)...(189)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (355)...(363)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 94
Met Leu Gln Phe Pro Lys Asp Phe Ile Trp Gly Ala Ala Thr Ser Ser 
1               5                   10                  15      


Tyr Gln Ile Glu Gly Thr Ala Thr Gly Glu Asp Lys Ile Tyr Ser Ile 
            20                  25                  30          


Trp Asp His Phe Ser Arg Ile Pro Gly Lys Val Ala Asn Gly Asp Asn 
        35                  40                  45              


Gly Asp Ile Ala Ile Asp His Tyr Asn Arg Tyr Val Glu Asp Ile Ala 
    50                  55                  60                  


Leu Met Lys Ala Leu His Leu Lys Ala Tyr Arg Phe Ser Thr Ser Trp 
65                  70                  75                  80  


Ala Arg Leu Tyr Cys Glu Thr Pro Gly Lys Phe Asn Glu Lys Gly Leu 
                85                  90                  95      


Asp Phe Tyr Lys Arg Leu Val His Glu Leu Leu Glu Asn Gly Ile Glu 
            100                 105                 110         


Pro Met Leu Thr Ile Tyr His Trp Asp Met Pro Gln Ala Leu Gln Glu 
        115                 120                 125             


Lys Gly Gly Trp Glu Asn Arg Asp Ile Val His Tyr Phe Gln Glu Tyr 
    130                 135                 140                 


Ala Ala Phe Leu Tyr Glu Asn Leu Gly Asp Val Val Lys Lys Trp Ile 
145                 150                 155                 160 


Thr His Asn Glu Pro Trp Val Val Thr Tyr Leu Gly Tyr Gly Asn Gly 
                165                 170                 175     


Glu His Ala Pro Gly Ile Gln Asn Phe Thr Ser Phe Leu Lys Ala Ala 
            180                 185                 190         


His His Val Leu Leu Ser His Gly Glu Ala Val Lys Ala Phe Arg Ala 
        195                 200                 205             


Ile Gly Ser Lys Asp Gly Glu Ile Gly Ile Thr Leu Asn Leu Thr Pro 
    210                 215                 220                 


Gly Tyr Ala Val Asp Pro Lys Asp Glu Lys Ala Val Asp Ala Ala Arg 
225                 230                 235                 240 


Lys Trp Asp Gly Phe Met Asn Arg Trp Phe Leu Asp Pro Val Phe Lys 
                245                 250                 255     


Gly Gln Tyr Pro Ala Asp Met Leu Glu Val Tyr Lys Asp Tyr Leu Pro 
            260                 265                 270         


Asp Val Tyr Lys Glu Gly Asp Leu Gln Thr Ile Gln Gln Pro Ile Asp 
        275                 280                 285             


Phe Phe Gly Phe Asn Tyr Tyr Ser Thr Ala Thr Leu Lys Asp Trp Lys 
    290                 295                 300                 


Thr Gly Asp Arg Glu Pro Ile Val Phe Glu His Val Ser Thr Gly Arg 
305                 310                 315                 320 


Pro Val Thr Asp Met Asn Trp Glu Val Asn Pro Asn Gly Leu Phe Asp 
                325                 330                 335     


Leu Met Val Arg Leu Lys Lys Asp Tyr Gly Asp Ile Pro Leu Tyr Ile 
            340                 345                 350         


Thr Glu Asn Gly Ala Ala Tyr Lys Asp Arg Val Asn Glu Gln Gly Glu 
        355                 360                 365             


Val Glu Asp Asp Glu Arg Val Ala Tyr Ile Arg Glu His Leu Ile Ala 
    370                 375                 380                 


Cys His Arg Ala Ile Glu Gln Gly Val Asn Leu Lys Gly Tyr Tyr Val 
385                 390                 395                 400 


Trp Ser Leu Phe Asp Asn Phe Glu Trp Ala Phe Gly Tyr Asp Lys Arg 
                405                 410                 415     


Phe Gly Ile Val Tyr Val Asp Tyr Glu Thr Leu Glu Arg Ile Pro Lys 
            420                 425                 430         


Lys Ser Ala Leu Trp Tyr Lys Glu Thr Ile Ile Asn Asn Gly Leu Gln 
        435                 440                 445             


Val Asp Asn Asp Lys 
    450             


<210> 95
<211> 2163
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 95
atgaaacatc acaactataa cgcgcatcat tcgccaatcg gcgctttcgg ctcattcacg     60

ctcggttttc gtggtgctca gggcggcctc ggactggagt taggcggccc ggccaatcac    120

aacatgtaca tcggagtgga agacgagcag cgcaccttcc attgccttcc cttttttggg    180

gatgctgctg caggggccga ggaagcactg cgctacgatg tggaaggcag ccaatccagt    240

gacgatccac tggccggcgc ctatgtcgga cacccagagg atgcgccatc gctgcctccg    300

gccaagttgc gtgcgctgga ccaaagcgcc atctcacggg attttcaact tacgaccgac    360

acctggacag caccggattt ctcactcacg atctattcgc cggtacgcgg cgtgcccgat    420

ccgacaacgg ctgcggaaga cgaattgaaa gccattcttg tgcccgctgt actctgcgag    480

ttgacggtgg ataactcgag tgggcagcag tctcggcgtg cgctctttgg tttcaccggg    540

aacgatcctt attggggaac gcggcgcctt gatgatgtag cgaatagtgc gttcgtgggg    600

gttggcgagg gaaatcatct ggccattgcg tcacgagatg aaggagtgac ggcggcgctg    660

ggcttcaaca tcaatggcgt tatcaacgag actttgcctg agaattacgc ctttggtctg    720

ggcaaatgcg cagttttgct ctgcgaggtg cctgccggtg aaaagcgcac gttccatatc    780

gccgtctgtt ttcatcggag cggcatcgcc accaccggtt tgaagatgcg ctattattac    840

acgcgctttt tccctgacat cgaaagcgta gccgcttatg cactggagca gttcgattct    900

ctcaaaagtg cagctctcca agacaatcaa ctagtggaga acgcgtcgct ttcagaagac    960

cagaaatgga tgttctgcca cgcggtgcgc tcgtactatg gctcaacgga gttgctggag   1020

tataacgaca atccggtgtg ggtggtcaac gaaggcgaat atcgtatgat gaacaccttc   1080

gacttgacgg tggatcatct ctactgggaa ctgcgcctga atccctgggt tgtgaaaaat   1140

cagctcgact ggtttgtgga tcgctactcg tatgaggaca aggtgcgctt tcctggtgac   1200

aaaactgagt acccatgcgg tctctccttc acgcacgata tgggcgtgac gaatgtgtgg   1260

tcgcgccccg gctattcgtc ttatgagaag cagggactca agggtgtctt ttcgtatatg   1320

acgcacgagc aactcgtcaa ctggctctgc tgcgccacgg tgtatgtgga acagaccggt   1380

gaccaggagt ggcttgaaca acggtggccg attttcaaca ggtgctttga gagtttgctc   1440

aaccgcgatc accccgatcc tgaaaagcgg cgcggcttaa tgcaactcga ttcgacccgt   1500

tgcgccggtg gtgcggagat caccacttac gacagccttg atgtctcgct ggggcagtca   1560

cgcaacaaca cctatctggg tgggaaaatc tgggcgagct atctggcact cgaaaaattg   1620

ttccgcgagc gcggcgacgt ggaaagagcg caagtggcgc atcaacaggc gcatcgcacg   1680

gcgcaaacgc tgctggagaa tgtcggcgag aatgggacga ttcccgccgt actcgaaggc   1740

agcaatcagt cgagaatcat tcccgtgatt gaggggttga tcttccccta cttcaccggg   1800

cgcaaggatg tcctcagttc cgatggcgac ttcggcgaga tgttttcggc actcaagcgc   1860

catcttgaag ccgtgctaaa acccggtatc tgtctgtttg aagatggtgg ctggaagtta   1920

agttcaacct ctgataactc gtggctgagc aaaatctacc tgtgccagtt cgtcgcgcgc   1980

cagatattgg gccgtgaacg cgatgacatt gacaagcgcg ccgatgccgc gcacgtgggc   2040

tggctgctcg atgagcgcaa tgcctacttc gcgtggagtg accagatgct ggccggcttt   2100

gcggaaggct ccaagtacta cccgcgcggt gtgaccagcg cattgtggtt gctggaaggc   2160

tga                                                                 2163

<210> 96
<211> 720
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (19)...(456)
<223> Glycosyl hydrolase family 52

<220> 
<221> SITE
<222> (167)...(170)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (197)...(200)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (232)...(235)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (318)...(321)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (530)...(533)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (579)...(582)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (590)...(593)
<223> N-glycosylation site. Prosite id = PS00001

<400> 96
Met Lys His His Asn Tyr Asn Ala His His Ser Pro Ile Gly Ala Phe 
1               5                   10                  15      


Gly Ser Phe Thr Leu Gly Phe Arg Gly Ala Gln Gly Gly Leu Gly Leu 
            20                  25                  30          


Glu Leu Gly Gly Pro Ala Asn His Asn Met Tyr Ile Gly Val Glu Asp 
        35                  40                  45              


Glu Gln Arg Thr Phe His Cys Leu Pro Phe Phe Gly Asp Ala Ala Ala 
    50                  55                  60                  


Gly Ala Glu Glu Ala Leu Arg Tyr Asp Val Glu Gly Ser Gln Ser Ser 
65                  70                  75                  80  


Asp Asp Pro Leu Ala Gly Ala Tyr Val Gly His Pro Glu Asp Ala Pro 
                85                  90                  95      


Ser Leu Pro Pro Ala Lys Leu Arg Ala Leu Asp Gln Ser Ala Ile Ser 
            100                 105                 110         


Arg Asp Phe Gln Leu Thr Thr Asp Thr Trp Thr Ala Pro Asp Phe Ser 
        115                 120                 125             


Leu Thr Ile Tyr Ser Pro Val Arg Gly Val Pro Asp Pro Thr Thr Ala 
    130                 135                 140                 


Ala Glu Asp Glu Leu Lys Ala Ile Leu Val Pro Ala Val Leu Cys Glu 
145                 150                 155                 160 


Leu Thr Val Asp Asn Ser Ser Gly Gln Gln Ser Arg Arg Ala Leu Phe 
                165                 170                 175     


Gly Phe Thr Gly Asn Asp Pro Tyr Trp Gly Thr Arg Arg Leu Asp Asp 
            180                 185                 190         


Val Ala Asn Ser Ala Phe Val Gly Val Gly Glu Gly Asn His Leu Ala 
        195                 200                 205             


Ile Ala Ser Arg Asp Glu Gly Val Thr Ala Ala Leu Gly Phe Asn Ile 
    210                 215                 220                 


Asn Gly Val Ile Asn Glu Thr Leu Pro Glu Asn Tyr Ala Phe Gly Leu 
225                 230                 235                 240 


Gly Lys Cys Ala Val Leu Leu Cys Glu Val Pro Ala Gly Glu Lys Arg 
                245                 250                 255     


Thr Phe His Ile Ala Val Cys Phe His Arg Ser Gly Ile Ala Thr Thr 
            260                 265                 270         


Gly Leu Lys Met Arg Tyr Tyr Tyr Thr Arg Phe Phe Pro Asp Ile Glu 
        275                 280                 285             


Ser Val Ala Ala Tyr Ala Leu Glu Gln Phe Asp Ser Leu Lys Ser Ala 
    290                 295                 300                 


Ala Leu Gln Asp Asn Gln Leu Val Glu Asn Ala Ser Leu Ser Glu Asp 
305                 310                 315                 320 


Gln Lys Trp Met Phe Cys His Ala Val Arg Ser Tyr Tyr Gly Ser Thr 
                325                 330                 335     


Glu Leu Leu Glu Tyr Asn Asp Asn Pro Val Trp Val Val Asn Glu Gly 
            340                 345                 350         


Glu Tyr Arg Met Met Asn Thr Phe Asp Leu Thr Val Asp His Leu Tyr 
        355                 360                 365             


Trp Glu Leu Arg Leu Asn Pro Trp Val Val Lys Asn Gln Leu Asp Trp 
    370                 375                 380                 


Phe Val Asp Arg Tyr Ser Tyr Glu Asp Lys Val Arg Phe Pro Gly Asp 
385                 390                 395                 400 


Lys Thr Glu Tyr Pro Cys Gly Leu Ser Phe Thr His Asp Met Gly Val 
                405                 410                 415     


Thr Asn Val Trp Ser Arg Pro Gly Tyr Ser Ser Tyr Glu Lys Gln Gly 
            420                 425                 430         


Leu Lys Gly Val Phe Ser Tyr Met Thr His Glu Gln Leu Val Asn Trp 
        435                 440                 445             


Leu Cys Cys Ala Thr Val Tyr Val Glu Gln Thr Gly Asp Gln Glu Trp 
    450                 455                 460                 


Leu Glu Gln Arg Trp Pro Ile Phe Asn Arg Cys Phe Glu Ser Leu Leu 
465                 470                 475                 480 


Asn Arg Asp His Pro Asp Pro Glu Lys Arg Arg Gly Leu Met Gln Leu 
                485                 490                 495     


Asp Ser Thr Arg Cys Ala Gly Gly Ala Glu Ile Thr Thr Tyr Asp Ser 
            500                 505                 510         


Leu Asp Val Ser Leu Gly Gln Ser Arg Asn Asn Thr Tyr Leu Gly Gly 
        515                 520                 525             


Lys Ile Trp Ala Ser Tyr Leu Ala Leu Glu Lys Leu Phe Arg Glu Arg 
    530                 535                 540                 


Gly Asp Val Glu Arg Ala Gln Val Ala His Gln Gln Ala His Arg Thr 
545                 550                 555                 560 


Ala Gln Thr Leu Leu Glu Asn Val Gly Glu Asn Gly Thr Ile Pro Ala 
                565                 570                 575     


Val Leu Glu Gly Ser Asn Gln Ser Arg Ile Ile Pro Val Ile Glu Gly 
            580                 585                 590         


Leu Ile Phe Pro Tyr Phe Thr Gly Arg Lys Asp Val Leu Ser Ser Asp 
        595                 600                 605             


Gly Asp Phe Gly Glu Met Phe Ser Ala Leu Lys Arg His Leu Glu Ala 
    610                 615                 620                 


Val Leu Lys Pro Gly Ile Cys Leu Phe Glu Asp Gly Gly Trp Lys Leu 
625                 630                 635                 640 


Ser Ser Thr Ser Asp Asn Ser Trp Leu Ser Lys Ile Tyr Leu Cys Gln 
                645                 650                 655     


Phe Val Ala Arg Gln Ile Leu Gly Arg Glu Arg Asp Asp Ile Asp Lys 
            660                 665                 670         


Arg Ala Asp Ala Ala His Val Gly Trp Leu Leu Asp Glu Arg Asn Ala 
        675                 680                 685             


Tyr Phe Ala Trp Ser Asp Gln Met Leu Ala Gly Phe Ala Glu Gly Ser 
    690                 695                 700                 


Lys Tyr Tyr Pro Arg Gly Val Thr Ser Ala Leu Trp Leu Leu Glu Gly 
705                 710                 715                 720 


<210> 97
<211> 1413
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 97
atgcgctata catggtcggt cgcggcggcg ctgctgccat gcgcaatcca ggctcagcaa     60

accctctatg gacaatgtgg tggtcagggc tactccggac tcaccagctg cgtggcggga    120

gcaacatgct ccaccgtaaa tgaatactac gctcagtgta cgccagcagc aggcagcgcc    180

acttccacca ccttgaagac aactacgacc accgctgggg cgacgacgac gacgactagc    240

aagacttctg cttcccagac gtctactact aaaacctcaa ccagtaccgc ctcaacaacc    300

acggctacaa ccacggccag cgcgagcggc aacccgttca gtgggtacca gctctacgtg    360

aacccctact actcctccga agtggcctcc ctggctatcc catccctcac ggggacactt    420

tcctcgctcc aggctgcagc cacagccgca gccaaggtgc cctctttcgt ctggctggac    480

gtggctgcca aggtgccgac gatggccacc tacctggccg acatcaaagc ccagaatgca    540

gcgggagcca acccccccgt cgccggccag tttgtggtct acgacctccc tgaccgcgac    600

tgcgccgcgc tggccagcaa cggcgagtac tccatcgcca acaacggtgt ggccaactac    660

aaggcctaca tcgactccat ccgcaaggtc ctggtgcagt actcggatgt gcacaccatt    720

ctggtgatcg agcccgacag tctcgccaac ctggtgacca acctcaatgt ggccaaatgt    780

gccaacgctc agagcgccta cctcgaatgc accaactatg ccctggagca gctgaacctc    840

cccaacgtgg ccatgtatct tgatgccgga cacgccggct ggctcggctg gcccgcgaac    900

cagcaaccgg ccgccaatct gtacgcgagc gtgtacaaga acgccagctc gcccgccgca    960

gtgcgcggcc tggccacgaa cgtcgccaac tacaacgcct tcaccatcgc ctcgtgcccg   1020

tcgtacaccc agggcaacag cgtctgcgac gagcagcagt acatcaacgc gatcgccccg   1080

ctcctgtcag cgcagggctt caacgcccac ttcatcgtcg acaccggccg caacggcaaa   1140

cagcccaccg gccaacaagc ctggggcgac tggtgcaacg tcatcaacac ggggttcggc   1200

gtgcgcccga ccaccaacac gggcgacgcg ctcgtcgacg ccttcgtctg ggtcaagccc   1260

ggcggcgaga gcgacggcac ctccgatagc tcggcgaccc gctacgacgc ccactgcggg   1320

tacagcgatg ccttgcagcc ggcgccggag gcggggacct ggttccaggc ctacttcgta   1380

caattgctct cgaacgccaa tccggctttc tag                                1413

<210> 98
<211> 470
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> SIGNAL
<222> (1)...(18)

<220> 
<221> DOMAIN
<222> (22)...(50)
<223> Fungal cellulose binding domain

<220> 
<221> DOMAIN
<222> (120)...(437)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (26)...(53)
<223> Cellulose-binding domain, fungal type. Prosite id = PS00562

<220> 
<221> SITE
<222> (243)...(252)
<223> Glycosyl hydrolases family 6 signature 2. Prosite id = PS00656

<220> 
<221> SITE
<222> (318)...(321)
<223> N-glycosylation site. Prosite id = PS00001

<400> 98
Met Arg Tyr Thr Trp Ser Val Ala Ala Ala Leu Leu Pro Cys Ala Ile 
1               5                   10                  15      


Gln Ala Gln Gln Thr Leu Tyr Gly Gln Cys Gly Gly Gln Gly Tyr Ser 
            20                  25                  30          


Gly Leu Thr Ser Cys Val Ala Gly Ala Thr Cys Ser Thr Val Asn Glu 
        35                  40                  45              


Tyr Tyr Ala Gln Cys Thr Pro Ala Ala Gly Ser Ala Thr Ser Thr Thr 
    50                  55                  60                  


Leu Lys Thr Thr Thr Thr Thr Ala Gly Ala Thr Thr Thr Thr Thr Ser 
65                  70                  75                  80  


Lys Thr Ser Ala Ser Gln Thr Ser Thr Thr Lys Thr Ser Thr Ser Thr 
                85                  90                  95      


Ala Ser Thr Thr Thr Ala Thr Thr Thr Ala Ser Ala Ser Gly Asn Pro 
            100                 105                 110         


Phe Ser Gly Tyr Gln Leu Tyr Val Asn Pro Tyr Tyr Ser Ser Glu Val 
        115                 120                 125             


Ala Ser Leu Ala Ile Pro Ser Leu Thr Gly Thr Leu Ser Ser Leu Gln 
    130                 135                 140                 


Ala Ala Ala Thr Ala Ala Ala Lys Val Pro Ser Phe Val Trp Leu Asp 
145                 150                 155                 160 


Val Ala Ala Lys Val Pro Thr Met Ala Thr Tyr Leu Ala Asp Ile Lys 
                165                 170                 175     


Ala Gln Asn Ala Ala Gly Ala Asn Pro Pro Val Ala Gly Gln Phe Val 
            180                 185                 190         


Val Tyr Asp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly 
        195                 200                 205             


Glu Tyr Ser Ile Ala Asn Asn Gly Val Ala Asn Tyr Lys Ala Tyr Ile 
    210                 215                 220                 


Asp Ser Ile Arg Lys Val Leu Val Gln Tyr Ser Asp Val His Thr Ile 
225                 230                 235                 240 


Leu Val Ile Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn Leu Asn 
                245                 250                 255     


Val Ala Lys Cys Ala Asn Ala Gln Ser Ala Tyr Leu Glu Cys Thr Asn 
            260                 265                 270         


Tyr Ala Leu Glu Gln Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp 
        275                 280                 285             


Ala Gly His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gln Gln Pro Ala 
    290                 295                 300                 


Ala Asn Leu Tyr Ala Ser Val Tyr Lys Asn Ala Ser Ser Pro Ala Ala 
305                 310                 315                 320 


Val Arg Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Ala Phe Thr Ile 
                325                 330                 335     


Ala Ser Cys Pro Ser Tyr Thr Gln Gly Asn Ser Val Cys Asp Glu Gln 
            340                 345                 350         


Gln Tyr Ile Asn Ala Ile Ala Pro Leu Leu Ser Ala Gln Gly Phe Asn 
        355                 360                 365             


Ala His Phe Ile Val Asp Thr Gly Arg Asn Gly Lys Gln Pro Thr Gly 
    370                 375                 380                 


Gln Gln Ala Trp Gly Asp Trp Cys Asn Val Ile Asn Thr Gly Phe Gly 
385                 390                 395                 400 


Val Arg Pro Thr Thr Asn Thr Gly Asp Ala Leu Val Asp Ala Phe Val 
                405                 410                 415     


Trp Val Lys Pro Gly Gly Glu Ser Asp Gly Thr Ser Asp Ser Ser Ala 
            420                 425                 430         


Thr Arg Tyr Asp Ala His Cys Gly Tyr Ser Asp Ala Leu Gln Pro Ala 
        435                 440                 445             


Pro Glu Ala Gly Thr Trp Phe Gln Ala Tyr Phe Val Gln Leu Leu Ser 
    450                 455                 460                 


Asn Ala Asn Pro Ala Phe 
465                 470 


<210> 99
<211> 1041
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 99
atgatcagtc tcaaacgagt ggcggcgctc ctgtgcgtcg caggtctggg catgtctgcg     60

gcaaacgcgc agacctgcct cacgtcgagt caaaccggca ctaacaatgg cttctattat    120

tccttctgga aggacagtcc gggcacggtg aatttttgcc tgcagtccgg cggccgttac    180

acatcgaact ggagcggcat caacaactgg gtgggcggca agggatggca gaccggttca    240

cgccggaaca tcacgtactc gggcagcttc aattcaccgg gcaacggcta cctggcgctt    300

tacggatgga ccaccaatcc actcgtcgag tactacgtcg tcgatagctg ggggagctgg    360

cgtccgccgg gttcggacgg aacgttcctg gggacggtca acagcgatgg cggaacgtat    420

gacatctatc gcgcgcagcg ggtcaacgcg ccgtccatca tcggcaacgc cacgttctat    480

caatactgga gcgttcggca gtcgaagcgg gtaggtggga cgatcaccac cggaaaccac    540

ttcgacgcgt gggccagcgt gggcctgaac ctgggcactc acaactacca gatcatggcg    600

accgagggct accaaagcag cggcagctcc gacatcacgg tgagtgaagg cggtagcagc    660

agtggtggcg gaagcagcac gagcagcagc agcggcggtg gtggcaccaa gagcttcacg    720

gttcgtgcgc gcggtaccgc gggcggtgag tccatcacgc tgcgcgtgaa caaccagaac    780

gtgcagacct ggacgctggg caccagcatg acgaactaca cggcgtcgac ttcactgagc    840

ggcggcatca ccgtggtgta cacgaacgac agcggtaacc gcgacgtgca ggtggactac    900

atcgtcgtga acggccagac gcgccagtcc gaagcccaga gctacaacac cggcctttat    960

gcgaacgggc gttgcggcgg tggctccaac agcgaatgga tgcattgcaa cggcgccatc   1020

ggctacggca atacaccgta a                                             1041

<210> 100
<211> 346
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(23)

<220> 
<221> DOMAIN
<222> (33)...(214)
<223> Glycosyl hydrolases family 11

<220> 
<221> SITE
<222> (63)...(66)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (84)...(87)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (108)...(118)
<223> Glycosyl hydrolases family 11 active site signature 1. Prosite id = PS00776

<220> 
<221> SITE
<222> (158)...(161)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (202)...(213)
<223> Glycosyl hydrolases family 11 active site signature 2. Prosite id = PS00777

<220> 
<221> SITE
<222> (276)...(279)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (293)...(296)
<223> N-glycosylation site. Prosite id = PS00001

<400> 100
Met Ile Ser Leu Lys Arg Val Ala Ala Leu Leu Cys Val Ala Gly Leu 
1               5                   10                  15      


Gly Met Ser Ala Ala Asn Ala Gln Thr Cys Leu Thr Ser Ser Gln Thr 
            20                  25                  30          


Gly Thr Asn Asn Gly Phe Tyr Tyr Ser Phe Trp Lys Asp Ser Pro Gly 
        35                  40                  45              


Thr Val Asn Phe Cys Leu Gln Ser Gly Gly Arg Tyr Thr Ser Asn Trp 
    50                  55                  60                  


Ser Gly Ile Asn Asn Trp Val Gly Gly Lys Gly Trp Gln Thr Gly Ser 
65                  70                  75                  80  


Arg Arg Asn Ile Thr Tyr Ser Gly Ser Phe Asn Ser Pro Gly Asn Gly 
                85                  90                  95      


Tyr Leu Ala Leu Tyr Gly Trp Thr Thr Asn Pro Leu Val Glu Tyr Tyr 
            100                 105                 110         


Val Val Asp Ser Trp Gly Ser Trp Arg Pro Pro Gly Ser Asp Gly Thr 
        115                 120                 125             


Phe Leu Gly Thr Val Asn Ser Asp Gly Gly Thr Tyr Asp Ile Tyr Arg 
    130                 135                 140                 


Ala Gln Arg Val Asn Ala Pro Ser Ile Ile Gly Asn Ala Thr Phe Tyr 
145                 150                 155                 160 


Gln Tyr Trp Ser Val Arg Gln Ser Lys Arg Val Gly Gly Thr Ile Thr 
                165                 170                 175     


Thr Gly Asn His Phe Asp Ala Trp Ala Ser Val Gly Leu Asn Leu Gly 
            180                 185                 190         


Thr His Asn Tyr Gln Ile Met Ala Thr Glu Gly Tyr Gln Ser Ser Gly 
        195                 200                 205             


Ser Ser Asp Ile Thr Val Ser Glu Gly Gly Ser Ser Ser Gly Gly Gly 
    210                 215                 220                 


Ser Ser Thr Ser Ser Ser Ser Gly Gly Gly Gly Thr Lys Ser Phe Thr 
225                 230                 235                 240 


Val Arg Ala Arg Gly Thr Ala Gly Gly Glu Ser Ile Thr Leu Arg Val 
                245                 250                 255     


Asn Asn Gln Asn Val Gln Thr Trp Thr Leu Gly Thr Ser Met Thr Asn 
            260                 265                 270         


Tyr Thr Ala Ser Thr Ser Leu Ser Gly Gly Ile Thr Val Val Tyr Thr 
        275                 280                 285             


Asn Asp Ser Gly Asn Arg Asp Val Gln Val Asp Tyr Ile Val Val Asn 
    290                 295                 300                 


Gly Gln Thr Arg Gln Ser Glu Ala Gln Ser Tyr Asn Thr Gly Leu Tyr 
305                 310                 315                 320 


Ala Asn Gly Arg Cys Gly Gly Gly Ser Asn Ser Glu Trp Met His Cys 
                325                 330                 335     


Asn Gly Ala Ile Gly Tyr Gly Asn Thr Pro 
            340                 345     


<210> 101
<211> 1725
<212> DNA
<213> Clostridium thermocellum

<400> 101
gtgtggaagc ccggattgtg gaatttcctt caaatggcag atgaagccgg attgacgagg     60

gatggaaaca ctccggttcc gacacccagt ccaaagccgg ctaacacacg tattgaagcg    120

gaagattatg acggtattaa ttcttcaagt attgagataa taggtgttcc acctgaagga    180

ggcagaggaa taggttatat taccagtggt gattatctgg tatacaagag tatagacttt    240

ggaaacggag caacgtcgtt taaggccaag gttgcaaatg caaatacttc caatattgaa    300

cttagattaa acggtccgaa tggtactctc ataggcacac tctcggtaaa atccacagga    360

gattggaata catatgagga gcaaacttgc agcattagca aagtcaccgg aataaatgat    420

ttgtacttgg tattcaaagg ccctgtaaac atagactggt tcacttttgg cgttgaaagc    480

agttccacag gtctggggga tttaaatggt gacggaaata ttaactcgtc ggaccttcag    540

gcgttaaaga ggcatttgct cggtatatca ccgcttacgg gagaggctct tttaagagcg    600

gatgtaaata ggagcggcaa agtggattct actgactatt cagtgctgaa aagatatata    660

ctccgcatta ttacagagtt ccccggacaa ggtgatgtac agacacccaa tccgtctgtt    720

actccgacac aaactcctat ccccacgatt tcgggaaatg ctcttaggga ttatgcggag    780

gcaaggggaa taaaaatcgg aacatgtgtc aactatccgt tttacaacaa ttcagatcca    840

acctacaaca gcattttgca aagagaattt tcaatggttg tatgtgaaaa tgaaatgaag    900

tttgatgctt tgcagccgag acaaaacgtt tttgattttt cgaaaggaga ccagttgctt    960

gcttttgcag aaagaaacgg tatgcagatg aggggacata cgttgatttg gcacaatcaa   1020

aacccgtcat ggcttacaaa cggtaactgg aaccgggatt cgctgcttgc ggtaatgaaa   1080

aatcacatta ccactgttat gacccattac aaaggtaaaa ttgttgagtg ggatgtggca   1140

aacgaatgta tggatgattc cggcaacggc ttaagaagca gcatatggag aaatgtaatc   1200

ggtcaggact accttgacta tgctttcagg tatgcaagag aagcagatcc cgatgcactt   1260

cttttctaca atgattataa tattgaagac ttgggtccaa agtccaatgc ggtatttaac   1320

atgattaaaa gtatgaagga aagaggtgtg ccgattgacg gagtaggatt ccaatgccac   1380

tttatcaatg gaatgagccc cgagtacctt gccagcattg atcaaaatat taagagatat   1440

gcggaaatag gcgttatagt atcctttacc gaaatagata tacgcatacc tcagtcggaa   1500

aacccggcaa ctgcattcca ggtacaggca aacaactata aggaacttat gaaaatttgt   1560

ctggcaaacc ccaattgcaa tacctttgta atgtggggat tcacagataa atacacatgg   1620

attccgggaa ctttcccagg atatggcaat ccattgattt atgacagcaa ttacaatccg   1680

aaaccggcat acaatgcaat aaaggaagct cttatgggct attga                   1725

<210> 102
<211> 574
<212> PRT
<213> Clostridium thermocellum

<220> 
<221> DOMAIN
<222> (39)...(158)
<223> Carbohydrate binding module (family 6)

<220> 
<221> DOMAIN
<222> (167)...(187)
<223> Dockerin type I repeat

<220> 
<221> DOMAIN
<222> (201)...(221)
<223> Dockerin type I repeat

<220> 
<221> DOMAIN
<222> (254)...(571)
<223> Glycosyl hydrolase family 10

<220> 
<221> SITE
<222> (47)...(50)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (96)...(99)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (108)...(111)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (169)...(181)
<223> EF-hand calcium-binding domain. Prosite id = PS00018

<220> 
<221> SITE
<222> (169)...(188)
<223> Clostridium cellulosome enzymes repeated domain signature. Prosite id = PS00448

<220> 
<221> SITE
<222> (177)...(180)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (204)...(216)
<223> EF-hand calcium-binding domain. Prosite id = PS00018

<220> 
<221> SITE
<222> (204)...(223)
<223> Clostridium cellulosome enzymes repeated domain signature. Prosite id = PS00448

<220> 
<221> SITE
<222> (206)...(209)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (280)...(283)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (491)...(501)
<223> Glycosyl hydrolases family 10 active site. Prosite id = PS00591

<400> 102
Met Trp Lys Pro Gly Leu Trp Asn Phe Leu Gln Met Ala Asp Glu Ala 
1               5                   10                  15      


Gly Leu Thr Arg Asp Gly Asn Thr Pro Val Pro Thr Pro Ser Pro Lys 
            20                  25                  30          


Pro Ala Asn Thr Arg Ile Glu Ala Glu Asp Tyr Asp Gly Ile Asn Ser 
        35                  40                  45              


Ser Ser Ile Glu Ile Ile Gly Val Pro Pro Glu Gly Gly Arg Gly Ile 
    50                  55                  60                  


Gly Tyr Ile Thr Ser Gly Asp Tyr Leu Val Tyr Lys Ser Ile Asp Phe 
65                  70                  75                  80  


Gly Asn Gly Ala Thr Ser Phe Lys Ala Lys Val Ala Asn Ala Asn Thr 
                85                  90                  95      


Ser Asn Ile Glu Leu Arg Leu Asn Gly Pro Asn Gly Thr Leu Ile Gly 
            100                 105                 110         


Thr Leu Ser Val Lys Ser Thr Gly Asp Trp Asn Thr Tyr Glu Glu Gln 
        115                 120                 125             


Thr Cys Ser Ile Ser Lys Val Thr Gly Ile Asn Asp Leu Tyr Leu Val 
    130                 135                 140                 


Phe Lys Gly Pro Val Asn Ile Asp Trp Phe Thr Phe Gly Val Glu Ser 
145                 150                 155                 160 


Ser Ser Thr Gly Leu Gly Asp Leu Asn Gly Asp Gly Asn Ile Asn Ser 
                165                 170                 175     


Ser Asp Leu Gln Ala Leu Lys Arg His Leu Leu Gly Ile Ser Pro Leu 
            180                 185                 190         


Thr Gly Glu Ala Leu Leu Arg Ala Asp Val Asn Arg Ser Gly Lys Val 
        195                 200                 205             


Asp Ser Thr Asp Tyr Ser Val Leu Lys Arg Tyr Ile Leu Arg Ile Ile 
    210                 215                 220                 


Thr Glu Phe Pro Gly Gln Gly Asp Val Gln Thr Pro Asn Pro Ser Val 
225                 230                 235                 240 


Thr Pro Thr Gln Thr Pro Ile Pro Thr Ile Ser Gly Asn Ala Leu Arg 
                245                 250                 255     


Asp Tyr Ala Glu Ala Arg Gly Ile Lys Ile Gly Thr Cys Val Asn Tyr 
            260                 265                 270         


Pro Phe Tyr Asn Asn Ser Asp Pro Thr Tyr Asn Ser Ile Leu Gln Arg 
        275                 280                 285             


Glu Phe Ser Met Val Val Cys Glu Asn Glu Met Lys Phe Asp Ala Leu 
    290                 295                 300                 


Gln Pro Arg Gln Asn Val Phe Asp Phe Ser Lys Gly Asp Gln Leu Leu 
305                 310                 315                 320 


Ala Phe Ala Glu Arg Asn Gly Met Gln Met Arg Gly His Thr Leu Ile 
                325                 330                 335     


Trp His Asn Gln Asn Pro Ser Trp Leu Thr Asn Gly Asn Trp Asn Arg 
            340                 345                 350         


Asp Ser Leu Leu Ala Val Met Lys Asn His Ile Thr Thr Val Met Thr 
        355                 360                 365             


His Tyr Lys Gly Lys Ile Val Glu Trp Asp Val Ala Asn Glu Cys Met 
    370                 375                 380                 


Asp Asp Ser Gly Asn Gly Leu Arg Ser Ser Ile Trp Arg Asn Val Ile 
385                 390                 395                 400 


Gly Gln Asp Tyr Leu Asp Tyr Ala Phe Arg Tyr Ala Arg Glu Ala Asp 
                405                 410                 415     


Pro Asp Ala Leu Leu Phe Tyr Asn Asp Tyr Asn Ile Glu Asp Leu Gly 
            420                 425                 430         


Pro Lys Ser Asn Ala Val Phe Asn Met Ile Lys Ser Met Lys Glu Arg 
        435                 440                 445             


Gly Val Pro Ile Asp Gly Val Gly Phe Gln Cys His Phe Ile Asn Gly 
    450                 455                 460                 


Met Ser Pro Glu Tyr Leu Ala Ser Ile Asp Gln Asn Ile Lys Arg Tyr 
465                 470                 475                 480 


Ala Glu Ile Gly Val Ile Val Ser Phe Thr Glu Ile Asp Ile Arg Ile 
                485                 490                 495     


Pro Gln Ser Glu Asn Pro Ala Thr Ala Phe Gln Val Gln Ala Asn Asn 
            500                 505                 510         


Tyr Lys Glu Leu Met Lys Ile Cys Leu Ala Asn Pro Asn Cys Asn Thr 
        515                 520                 525             


Phe Val Met Trp Gly Phe Thr Asp Lys Tyr Thr Trp Ile Pro Gly Thr 
    530                 535                 540                 


Phe Pro Gly Tyr Gly Asn Pro Leu Ile Tyr Asp Ser Asn Tyr Asn Pro 
545                 550                 555                 560 


Lys Pro Ala Tyr Asn Ala Ile Lys Glu Ala Leu Met Gly Tyr 
                565                 570                 


<210> 103
<211> 930
<212> DNA
<213> Cochliobolus heterostrophus ATCC 48331

<400> 103
gtgtcgccca agcaggacag ccgtcaaatc cagggtatca aggacccgac gattatccag     60

aacaatggtg tataccatgt ctttgccagc acggccaagg aagcgggata caacctagtc    120

tacttcaact ttaccgactt cagcagggcc aaccaggcgc cattcttcta cctcgaccag    180

tcaggtatcg gcacaggtta ccgtgctgct cctcaagtct tctacttcgc cccccagaag    240

ctctggtacc tcatctacca aaacggcaat gcagcataca gcaccaaccc cgacatttcc    300

aacccacgag gctggaccgc cccgcaagtc ttctacccca acggaacccc ccagacgatc    360

caaaacggcc taggaacgac cggctactgg gtcgacatgt gggtaatctg cgacacggcc    420

ctctgccacc tgtactcatc cgacgacaac ggcggcctat accgcagcca aacgcccgtc    480

tcgcaattcc cacgcggcat gaacgagccc gtggtaacgc tcaaggccaa caaaaacgac    540

ctctttgaag cctcgaccgt gtacaacatt gtaaacacca gcacctacct cctcatggtc    600

gaatgcatcg gctccggcaa ctcccccggc ggcctgcgct acttccgctc ctggaccacc    660

cagtccctca ccagcgacaa gtggactccc cttgccgcat cccagcaaac ccctttcctc    720

ggcgccgcta acacccagtt ccccgccggc cgctggtccc agagcttgtc ccacggcgag    780

ctcgttcgca caaatgtaga ccagaggctc cagattcgcc cctgtgaaat gaggtacctc    840

taccagggta tcgatcctaa tgctacgggc acttacaatg ccctgccctg gaaactcgcc    900

cttgcaaccc agacaaactc caagtgttag                                     930

<210> 104
<211> 309
<212> PRT
<213> Cochliobolus heterostrophus ATCC 48331

<220> 
<221> DOMAIN
<222> (3)...(276)
<223> Glycosyl hydrolase family 62

<220> 
<221> SITE
<222> (43)...(46)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (194)...(197)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<400> 104
Met Ser Pro Lys Gln Asp Ser Arg Gln Ile Gln Gly Ile Lys Asp Pro 
1               5                   10                  15      


Thr Ile Ile Gln Asn Asn Gly Val Tyr His Val Phe Ala Ser Thr Ala 
            20                  25                  30          


Lys Glu Ala Gly Tyr Asn Leu Val Tyr Phe Asn Phe Thr Asp Phe Ser 
        35                  40                  45              


Arg Ala Asn Gln Ala Pro Phe Phe Tyr Leu Asp Gln Ser Gly Ile Gly 
    50                  55                  60                  


Thr Gly Tyr Arg Ala Ala Pro Gln Val Phe Tyr Phe Ala Pro Gln Lys 
65                  70                  75                  80  


Leu Trp Tyr Leu Ile Tyr Gln Asn Gly Asn Ala Ala Tyr Ser Thr Asn 
                85                  90                  95      


Pro Asp Ile Ser Asn Pro Arg Gly Trp Thr Ala Pro Gln Val Phe Tyr 
            100                 105                 110         


Pro Asn Gly Thr Pro Gln Thr Ile Gln Asn Gly Leu Gly Thr Thr Gly 
        115                 120                 125             


Tyr Trp Val Asp Met Trp Val Ile Cys Asp Thr Ala Leu Cys His Leu 
    130                 135                 140                 


Tyr Ser Ser Asp Asp Asn Gly Gly Leu Tyr Arg Ser Gln Thr Pro Val 
145                 150                 155                 160 


Ser Gln Phe Pro Arg Gly Met Asn Glu Pro Val Val Thr Leu Lys Ala 
                165                 170                 175     


Asn Lys Asn Asp Leu Phe Glu Ala Ser Thr Val Tyr Asn Ile Val Asn 
            180                 185                 190         


Thr Ser Thr Tyr Leu Leu Met Val Glu Cys Ile Gly Ser Gly Asn Ser 
        195                 200                 205             


Pro Gly Gly Leu Arg Tyr Phe Arg Ser Trp Thr Thr Gln Ser Leu Thr 
    210                 215                 220                 


Ser Asp Lys Trp Thr Pro Leu Ala Ala Ser Gln Gln Thr Pro Phe Leu 
225                 230                 235                 240 


Gly Ala Ala Asn Thr Gln Phe Pro Ala Gly Arg Trp Ser Gln Ser Leu 
                245                 250                 255     


Ser His Gly Glu Leu Val Arg Thr Asn Val Asp Gln Arg Leu Gln Ile 
            260                 265                 270         


Arg Pro Cys Glu Met Arg Tyr Leu Tyr Gln Gly Ile Asp Pro Asn Ala 
        275                 280                 285             


Thr Gly Thr Tyr Asn Ala Leu Pro Trp Lys Leu Ala Leu Ala Thr Gln 
    290                 295                 300                 


Thr Asn Ser Lys Cys 
305                 


<210> 105
<211> 1365
<212> DNA
<213> Clostridium thermocellum

<400> 105
gtgaaaaaaa ttgtttcttt ggtttgtgtg cttgtgatgc tggtaagcat cttaggctcg     60

ttttcagtcg tagcggcatc accggtaaaa ggctttcagg tatcgggaac aaagcttttg    120

gatgcaagcg gaaacgagct tgtaatgagg ggcatgcgtg atatttcagc aatagatttg    180

gttaaagaaa taaaaatcgg atggaatttg ggaaatactt tggatgctcc tacagagact    240

gcctggggaa atccaaggac aaccaaggca atgatagaaa aggtaaggga aatgggcttt    300

aatgccgtca gagtgcctgt tacctgggat acgcacatcg gacctgctcc ggactataaa    360

attgacgaag catggctgaa cagagttgag gaagtggtaa actatgttct tgactgcggt    420

atgtacgcga tcataaatgt tcaccatgac aatacatgga ttatacctac atatgccaat    480

gagcaaagga gtaaagaaaa acttgtaaaa gtttgggaac aaatagcaac ccgttttaaa    540

gattatgacg accatttgtt gtttgagaca atgaacgaac cgagagaagt aggttcacct    600

atggaatgga tgggcggaac gtatgaaaac cgagatgtga taaacagatt taatttggcg    660

gttgttaata ccatcagagc aagcggcgga aataacgata aaagattcat actggttccg    720

accaatgcgg caaccggcct ggatgttgca ttaaacgacc ttgtcattcc gaacaatgac    780

agcagagtca tagtatccat acatgcttat tcaccgtatt tctttgctat ggatgtcaac    840

ggaacttcat attggggaag tgactatgac aaggcttctc ttacaagtga acttgatgct    900

atttacaaca gatttgtgaa aaacggaagg gctgtaatta tcggagaatt cggaaccatt    960

gacaagaaca acctgtcttc aagggtggct catgccgagc actatgcaag agaagcagtt   1020

tcaagaggaa ttgctgtttt ctggtgggat aacggctatt acaatccggg tgatgcagag   1080

acttatgcat tgctgaacag aaaaactctc tcatggtatt atcctgaaat tgtccaggct   1140

cttatgagag gtgccggcgt tgaaccttta gtttcaccga ctcctacacc tacattaatg   1200

ccgaccccct cgcccacggt gacagcaaat attttgtacg gtgacgtaaa cggggacgga   1260

aaaataaatt ctacagactg tacaatgcta aagagatata ttttgcgtgg catagaagaa   1320

ttcccaagtc ctagcggaat tatagccgct gacgtaaatg cggat                   1365

<210> 106
<211> 455
<212> PRT
<213> Clostridium thermocellum

<220> 
<221> SIGNAL
<222> (1)...(25)

<220> 
<221> DOMAIN
<222> (71)...(359)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (415)...(435)
<223> Dockerin type I repeat

<220> 
<221> SITE
<222> (188)...(197)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (284)...(287)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (421)...(440)
<223> Clostridium cellulosome enzymes repeated domain signature. Prosite id = PS00448

<220> 
<221> SITE
<222> (429)...(432)
<223> N-glycosylation site. Prosite id = PS00001

<400> 106
Met Lys Lys Ile Val Ser Leu Val Cys Val Leu Val Met Leu Val Ser 
1               5                   10                  15      


Ile Leu Gly Ser Phe Ser Val Val Ala Ala Ser Pro Val Lys Gly Phe 
            20                  25                  30          


Gln Val Ser Gly Thr Lys Leu Leu Asp Ala Ser Gly Asn Glu Leu Val 
        35                  40                  45              


Met Arg Gly Met Arg Asp Ile Ser Ala Ile Asp Leu Val Lys Glu Ile 
    50                  55                  60                  


Lys Ile Gly Trp Asn Leu Gly Asn Thr Leu Asp Ala Pro Thr Glu Thr 
65                  70                  75                  80  


Ala Trp Gly Asn Pro Arg Thr Thr Lys Ala Met Ile Glu Lys Val Arg 
                85                  90                  95      


Glu Met Gly Phe Asn Ala Val Arg Val Pro Val Thr Trp Asp Thr His 
            100                 105                 110         


Ile Gly Pro Ala Pro Asp Tyr Lys Ile Asp Glu Ala Trp Leu Asn Arg 
        115                 120                 125             


Val Glu Glu Val Val Asn Tyr Val Leu Asp Cys Gly Met Tyr Ala Ile 
    130                 135                 140                 


Ile Asn Val His His Asp Asn Thr Trp Ile Ile Pro Thr Tyr Ala Asn 
145                 150                 155                 160 


Glu Gln Arg Ser Lys Glu Lys Leu Val Lys Val Trp Glu Gln Ile Ala 
                165                 170                 175     


Thr Arg Phe Lys Asp Tyr Asp Asp His Leu Leu Phe Glu Thr Met Asn 
            180                 185                 190         


Glu Pro Arg Glu Val Gly Ser Pro Met Glu Trp Met Gly Gly Thr Tyr 
        195                 200                 205             


Glu Asn Arg Asp Val Ile Asn Arg Phe Asn Leu Ala Val Val Asn Thr 
    210                 215                 220                 


Ile Arg Ala Ser Gly Gly Asn Asn Asp Lys Arg Phe Ile Leu Val Pro 
225                 230                 235                 240 


Thr Asn Ala Ala Thr Gly Leu Asp Val Ala Leu Asn Asp Leu Val Ile 
                245                 250                 255     


Pro Asn Asn Asp Ser Arg Val Ile Val Ser Ile His Ala Tyr Ser Pro 
            260                 265                 270         


Tyr Phe Phe Ala Met Asp Val Asn Gly Thr Ser Tyr Trp Gly Ser Asp 
        275                 280                 285             


Tyr Asp Lys Ala Ser Leu Thr Ser Glu Leu Asp Ala Ile Tyr Asn Arg 
    290                 295                 300                 


Phe Val Lys Asn Gly Arg Ala Val Ile Ile Gly Glu Phe Gly Thr Ile 
305                 310                 315                 320 


Asp Lys Asn Asn Leu Ser Ser Arg Val Ala His Ala Glu His Tyr Ala 
                325                 330                 335     


Arg Glu Ala Val Ser Arg Gly Ile Ala Val Phe Trp Trp Asp Asn Gly 
            340                 345                 350         


Tyr Tyr Asn Pro Gly Asp Ala Glu Thr Tyr Ala Leu Leu Asn Arg Lys 
        355                 360                 365             


Thr Leu Ser Trp Tyr Tyr Pro Glu Ile Val Gln Ala Leu Met Arg Gly 
    370                 375                 380                 


Ala Gly Val Glu Pro Leu Val Ser Pro Thr Pro Thr Pro Thr Leu Met 
385                 390                 395                 400 


Pro Thr Pro Ser Pro Thr Val Thr Ala Asn Ile Leu Tyr Gly Asp Val 
                405                 410                 415     


Asn Gly Asp Gly Lys Ile Asn Ser Thr Asp Cys Thr Met Leu Lys Arg 
            420                 425                 430         


Tyr Ile Leu Arg Gly Ile Glu Glu Phe Pro Ser Pro Ser Gly Ile Ile 
        435                 440                 445             


Ala Ala Asp Val Asn Ala Asp 
    450                 455 


<210> 107
<211> 948
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 107
gtgactccaa caaatacgcc catccctgct ccgacattac accgcggtgt caactttggt     60

aacatgctcg aaccacccaa cgaaggtgaa tggggactgt atgtacagga ggaatatttc    120

aaccttgtaa aagaagcagg ttttgacttt gtccgtttgc ctgttagttg ggatgctcat    180

gctgagggag ccgaaccata cacgatcgat cctttatttt tctaccgcgt ggatcaagtt    240

cttgcttggg ctttggatcg aaaccttaca gtcattcttg attttcacaa ctatgacgac    300

atgatgtcca atccgtgggg aaataaggaa cgtttctttg ccatctggaa gcaagtttca    360

gaacgataca aggattaccc cgctaatctc ctattcgagt tgctcaatga gccgaattcg    420

gttctggatg cgcaactttg gaatcaatat gttggtgaag cattagccat cattcgtgat    480

acaaacccta cacgcgatgt ggtgatcggt ccgacacaat ggaattccta tggctggatt    540

tccacgttgg acgtgccaga tgatccgcat atgatcttca cttttcatta ctacgagcct    600

ttccatttca cacatcaagg cgcggagtgg gtgggcgatg aagcacaagg ctggttgggt    660

acaacctggg atgcgaccga tgagcagaaa gctgaagtca tcaataactt tgactcggtt    720

gccgattggt cgaagcgaca tggaaacgtc cgcattttgc tcggtgagtt tggcgcatac    780

tcgacagctc cgcaagactc acgcgtccgt tggacgacgt ttatccgtga gcaggcagaa    840

gctcacgggt tcgcgtgggc ttattgggaa ctggcggcag gcttcggggt gtatgatccg    900

gatgccaaag cgtggagaga agatctgttg aaggcattga tcccctga                 948

<210> 108
<211> 315
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (20)...(298)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (89)...(92)
<223> N-glycosylation site. Prosite id = PS00001

<400> 108
Met Thr Pro Thr Asn Thr Pro Ile Pro Ala Pro Thr Leu His Arg Gly 
1               5                   10                  15      


Val Asn Phe Gly Asn Met Leu Glu Pro Pro Asn Glu Gly Glu Trp Gly 
            20                  25                  30          


Leu Tyr Val Gln Glu Glu Tyr Phe Asn Leu Val Lys Glu Ala Gly Phe 
        35                  40                  45              


Asp Phe Val Arg Leu Pro Val Ser Trp Asp Ala His Ala Glu Gly Ala 
    50                  55                  60                  


Glu Pro Tyr Thr Ile Asp Pro Leu Phe Phe Tyr Arg Val Asp Gln Val 
65                  70                  75                  80  


Leu Ala Trp Ala Leu Asp Arg Asn Leu Thr Val Ile Leu Asp Phe His 
                85                  90                  95      


Asn Tyr Asp Asp Met Met Ser Asn Pro Trp Gly Asn Lys Glu Arg Phe 
            100                 105                 110         


Phe Ala Ile Trp Lys Gln Val Ser Glu Arg Tyr Lys Asp Tyr Pro Ala 
        115                 120                 125             


Asn Leu Leu Phe Glu Leu Leu Asn Glu Pro Asn Ser Val Leu Asp Ala 
    130                 135                 140                 


Gln Leu Trp Asn Gln Tyr Val Gly Glu Ala Leu Ala Ile Ile Arg Asp 
145                 150                 155                 160 


Thr Asn Pro Thr Arg Asp Val Val Ile Gly Pro Thr Gln Trp Asn Ser 
                165                 170                 175     


Tyr Gly Trp Ile Ser Thr Leu Asp Val Pro Asp Asp Pro His Met Ile 
            180                 185                 190         


Phe Thr Phe His Tyr Tyr Glu Pro Phe His Phe Thr His Gln Gly Ala 
        195                 200                 205             


Glu Trp Val Gly Asp Glu Ala Gln Gly Trp Leu Gly Thr Thr Trp Asp 
    210                 215                 220                 


Ala Thr Asp Glu Gln Lys Ala Glu Val Ile Asn Asn Phe Asp Ser Val 
225                 230                 235                 240 


Ala Asp Trp Ser Lys Arg His Gly Asn Val Arg Ile Leu Leu Gly Glu 
                245                 250                 255     


Phe Gly Ala Tyr Ser Thr Ala Pro Gln Asp Ser Arg Val Arg Trp Thr 
            260                 265                 270         


Thr Phe Ile Arg Glu Gln Ala Glu Ala His Gly Phe Ala Trp Ala Tyr 
        275                 280                 285             


Trp Glu Leu Ala Ala Gly Phe Gly Val Tyr Asp Pro Asp Ala Lys Ala 
    290                 295                 300                 


Trp Arg Glu Asp Leu Leu Lys Ala Leu Ile Pro 
305                 310                 315 


<210> 109
<211> 1401
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 109
atgagtagct ttaaagcctc tgcgatcaac cctcgcatgg cgggtgcgct aacacgctcc     60

ctctatgcgg ccgggttctc cctggctgta tccactcttt ccacacaggc ttatgccggt    120

tgccagtacg tggtcaacaa ccagtggaat aatggattcg atgccaaaat aagaataact    180

aacgacggga caaccgcgat caacggctgg aacattagct ggcgatacaa tggcgataac    240

cgcatcacca gcagttacaa cgccactttg tcaggcacca acccttactc cgctaccaac    300

ctgagctgga atggcaatat ccaacccggc caggctgttg agtttggatt ccagggcagc    360

aaaggcgccg ccgccgctga agtacctgtg attaccggca ccgcttgtgg tactacgacc    420

gcttcttctg ctgcgcgctc ttccaccaca accacttcag taccggcaac ttccagttcc    480

cgctcgtcca ccgcagtggc atcaagcgtt ggcaatgtta cccaaggtgt tgcgcctttg    540

gtggtccaag gcaacaaggt aactgccaat ggccaacccg ctaacctcgc cggcatgagc    600

ctgttctgga gcaacaccgg gtggggcggt gaaaagtatt acaacgccca agtggtgtcc    660

tggttgaaat cagactggaa agccaacctc gtacgcgttg ccatgggcgt tgaagatgcg    720

ggcggttatt tgaccgactc caccaacaag actcgtgcaa ccaccgttat tgatgcggcg    780

attgccaata acatgtatgt cattatcgac tggcataccc atcgcgccga aaataacaaa    840

gccgctgccg tagctttctt caaagaaatg gctaccaagt atggccaata caacaacgtg    900

atttacgagg tatacaacga acctctgaac gtctcctgga gtggggtcat caagccctac    960

gcaactgatg tcatcaagga aatccgtgcc atcgacccgg acaacctgat tatcgtaggg   1020

actcccaact ggtcacaaga cgtcgatgtg gccgcaaatg atcccatcac cacctacagc   1080

aacatcgcct acaccctgca cttctacgcg ggcacccaca agcaattcct gcgtgacaag   1140

gcacaaaccg cccttaaccg tggcattgcc ttgttcgtta ccgagtgggg ctcagtgaat   1200

gcggacggtg gcggtgctgt ggatgcagca gaaaccaaca cctggttgaa cttcctcaaa   1260

accaacggca ttagccatgc taactgggct ttgaatgata aagccgaagg cgcttctgct   1320

ttggttcccg gtgcgagcgc aaacggtggc tggaccagtg cccagctgac cgcatcaggc   1380

acgttgattc gcaatgcgat t                                             1401

<210> 110
<211> 467
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (39)...(136)
<223> Cellulose binding domain

<220> 
<221> DOMAIN
<222> (184)...(439)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (72)...(75)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (88)...(91)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (101)...(104)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (174)...(177)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (252)...(255)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (304)...(313)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (314)...(317)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (348)...(351)
<223> N-glycosylation site. Prosite id = PS00001

<400> 110
Met Ser Ser Phe Lys Ala Ser Ala Ile Asn Pro Arg Met Ala Gly Ala 
1               5                   10                  15      


Leu Thr Arg Ser Leu Tyr Ala Ala Gly Phe Ser Leu Ala Val Ser Thr 
            20                  25                  30          


Leu Ser Thr Gln Ala Tyr Ala Gly Cys Gln Tyr Val Val Asn Asn Gln 
        35                  40                  45              


Trp Asn Asn Gly Phe Asp Ala Lys Ile Arg Ile Thr Asn Asp Gly Thr 
    50                  55                  60                  


Thr Ala Ile Asn Gly Trp Asn Ile Ser Trp Arg Tyr Asn Gly Asp Asn 
65                  70                  75                  80  


Arg Ile Thr Ser Ser Tyr Asn Ala Thr Leu Ser Gly Thr Asn Pro Tyr 
                85                  90                  95      


Ser Ala Thr Asn Leu Ser Trp Asn Gly Asn Ile Gln Pro Gly Gln Ala 
            100                 105                 110         


Val Glu Phe Gly Phe Gln Gly Ser Lys Gly Ala Ala Ala Ala Glu Val 
        115                 120                 125             


Pro Val Ile Thr Gly Thr Ala Cys Gly Thr Thr Thr Ala Ser Ser Ala 
    130                 135                 140                 


Ala Arg Ser Ser Thr Thr Thr Thr Ser Val Pro Ala Thr Ser Ser Ser 
145                 150                 155                 160 


Arg Ser Ser Thr Ala Val Ala Ser Ser Val Gly Asn Val Thr Gln Gly 
                165                 170                 175     


Val Ala Pro Leu Val Val Gln Gly Asn Lys Val Thr Ala Asn Gly Gln 
            180                 185                 190         


Pro Ala Asn Leu Ala Gly Met Ser Leu Phe Trp Ser Asn Thr Gly Trp 
        195                 200                 205             


Gly Gly Glu Lys Tyr Tyr Asn Ala Gln Val Val Ser Trp Leu Lys Ser 
    210                 215                 220                 


Asp Trp Lys Ala Asn Leu Val Arg Val Ala Met Gly Val Glu Asp Ala 
225                 230                 235                 240 


Gly Gly Tyr Leu Thr Asp Ser Thr Asn Lys Thr Arg Ala Thr Thr Val 
                245                 250                 255     


Ile Asp Ala Ala Ile Ala Asn Asn Met Tyr Val Ile Ile Asp Trp His 
            260                 265                 270         


Thr His Arg Ala Glu Asn Asn Lys Ala Ala Ala Val Ala Phe Phe Lys 
        275                 280                 285             


Glu Met Ala Thr Lys Tyr Gly Gln Tyr Asn Asn Val Ile Tyr Glu Val 
    290                 295                 300                 


Tyr Asn Glu Pro Leu Asn Val Ser Trp Ser Gly Val Ile Lys Pro Tyr 
305                 310                 315                 320 


Ala Thr Asp Val Ile Lys Glu Ile Arg Ala Ile Asp Pro Asp Asn Leu 
                325                 330                 335     


Ile Ile Val Gly Thr Pro Asn Trp Ser Gln Asp Val Asp Val Ala Ala 
            340                 345                 350         


Asn Asp Pro Ile Thr Thr Tyr Ser Asn Ile Ala Tyr Thr Leu His Phe 
        355                 360                 365             


Tyr Ala Gly Thr His Lys Gln Phe Leu Arg Asp Lys Ala Gln Thr Ala 
    370                 375                 380                 


Leu Asn Arg Gly Ile Ala Leu Phe Val Thr Glu Trp Gly Ser Val Asn 
385                 390                 395                 400 


Ala Asp Gly Gly Gly Ala Val Asp Ala Ala Glu Thr Asn Thr Trp Leu 
                405                 410                 415     


Asn Phe Leu Lys Thr Asn Gly Ile Ser His Ala Asn Trp Ala Leu Asn 
            420                 425                 430         


Asp Lys Ala Glu Gly Ala Ser Ala Leu Val Pro Gly Ala Ser Ala Asn 
        435                 440                 445             


Gly Gly Trp Thr Ser Ala Gln Leu Thr Ala Ser Gly Thr Leu Ile Arg 
    450                 455                 460                 


Asn Ala Ile 
465         


<210> 111
<211> 2043
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 111
atgagtagct ttaaagcctc tgcgatcaac cctcgcatgg cgggtgcgct aacacgctcc     60

ctctatgcgg ccgggttctc cctggctgta tccactcttt ccacacaggc ttatgccggt    120

tgccagtacg tggtcaacaa ccagtggaat aatggattcg atgccaaaat aagaataact    180

aacgacggga caaccgcgat caacggctgg aacattagct ggcgatacaa tggcgataac    240

cgcatcacca gcagttacaa cgccactttg tcaggcacca acccttactc cgctaccaac    300

ctgagctgga atggcaatat ccaacccggc caggctgttg agtttggatt ccagggcagc    360

aaaggcgccg ccgccgctga agtacctgtg attaccggca ccgcttgtgg tactacgacc    420

gcttcttctg ctgcgcgctc ttccaccaca accacttcag taccggcaac ttccagttcc    480

cgctcgtcca ccgcagtggc atcaagcgtt ggcaatgtta cccaaggtgt tgcgcctttg    540

gtggtccaag gcaacaaggt aactgccaat ggccaacccg ctaacctcgc cggcatgagc    600

ctgttctgga gcaacaccgg gtggggcggt gaaaagtatt acaacgccca agtggtgtcc    660

tggttgaaat cagactggaa agccaacctc gtacgcgttg ccatgggcgt tgaagatgcg    720

ggcggttatt tgaccgactc caccaacaag actcgtgcaa ccaccgttat tgatgcggcg    780

attgccaata acatgtatgt cattatcgac tggcataccc atcgcgccga aaataacaaa    840

gccgctgccg tagctttctt caaagaaatg gctaccaagt atggccaata caacaacgtg    900

atttacgagg tatacaacga acctctgaac gtctcctgga gtggggtcat caagccctac    960

gcaactgatg tcatcaagga aatccgtgcc atcgacccgg acaacctgat tatcgtaggg   1020

actcccaact ggtcacaaga cgtcgatgtg gccgcaaatg atcccatcac cacctacagc   1080

aacatcgcct acaccctgca cttctacgcg ggcacccaca agcaattcct gcgtgacaag   1140

gcacaaaccg cccttaaccg tggcattgcc ttgttcgtta ccgagtgggg ctcagtgaat   1200

gcggacggtg gcggtgctgt ggatgcagca gaaaccaaca cctggttgaa cttcctcaaa   1260

accaacggca ttagccatgc taactgggct ttgaatgata aagccgaagg cgcttctgct   1320

ttggttcccg gtgcgagcgc aaacggtggc tggaccagtg cccagctgac cgcatcaggc   1380

acgttgattc gcaatgcgat tattgccaac aacaacggca ccacctcaag tgcggcacct   1440

tccagcatcc cgcgtagttc ggtggctccg tcatcggtag ctcgttcttc cagtagttca   1500

gtaacaccaa gctcggtacc cgcgtcttca gttgcaccca gctctagcag ccgtgccgcc   1560

tcctcagtgg ccaacactgc cgcctggaac ctgaatacca ccgattctta cctgaacttc   1620

gtgaccacca agaacaccca caacgtggaa gttcatagct tcaccggctt gactggcgat   1680

atcagcagcg cgggcgtagc caccctgacc atcgacctca acagtgtgaa caccggtgta   1740

gccctgcgcg accaacgcat gaaggacctg ctgtttgaaa cagcaaccta cccgactgcg   1800

actgtcactg tcaccgtacc tactacagtg atcagcagcc tggccgttgg ccaatcagcc   1860

gctaccgata tctctgccac cctggacctg cacggtgtga ctggcactat caccaccaaa   1920

gtctctgtcc agaaactgtc caacagccgc atcctggtgc aaaccctggc gccggtattg   1980

gttaaagcgg gcgactactc gctcaccaat ggtgtggaag cactgcgtgc tgcggtagca   2040

att                                                                 2043

<210> 112
<211> 681
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (39)...(136)
<223> Cellulose binding domain

<220> 
<221> DOMAIN
<222> (184)...(439)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (526)...(677)
<223> YceI like family

<220> 
<221> SITE
<222> (72)...(75)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (88)...(91)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (101)...(104)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (174)...(177)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (252)...(255)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (304)...(313)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (314)...(317)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (348)...(351)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (479)...(482)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (540)...(543)
<223> N-glycosylation site. Prosite id = PS00001

<400> 112
Met Ser Ser Phe Lys Ala Ser Ala Ile Asn Pro Arg Met Ala Gly Ala 
1               5                   10                  15      


Leu Thr Arg Ser Leu Tyr Ala Ala Gly Phe Ser Leu Ala Val Ser Thr 
            20                  25                  30          


Leu Ser Thr Gln Ala Tyr Ala Gly Cys Gln Tyr Val Val Asn Asn Gln 
        35                  40                  45              


Trp Asn Asn Gly Phe Asp Ala Lys Ile Arg Ile Thr Asn Asp Gly Thr 
    50                  55                  60                  


Thr Ala Ile Asn Gly Trp Asn Ile Ser Trp Arg Tyr Asn Gly Asp Asn 
65                  70                  75                  80  


Arg Ile Thr Ser Ser Tyr Asn Ala Thr Leu Ser Gly Thr Asn Pro Tyr 
                85                  90                  95      


Ser Ala Thr Asn Leu Ser Trp Asn Gly Asn Ile Gln Pro Gly Gln Ala 
            100                 105                 110         


Val Glu Phe Gly Phe Gln Gly Ser Lys Gly Ala Ala Ala Ala Glu Val 
        115                 120                 125             


Pro Val Ile Thr Gly Thr Ala Cys Gly Thr Thr Thr Ala Ser Ser Ala 
    130                 135                 140                 


Ala Arg Ser Ser Thr Thr Thr Thr Ser Val Pro Ala Thr Ser Ser Ser 
145                 150                 155                 160 


Arg Ser Ser Thr Ala Val Ala Ser Ser Val Gly Asn Val Thr Gln Gly 
                165                 170                 175     


Val Ala Pro Leu Val Val Gln Gly Asn Lys Val Thr Ala Asn Gly Gln 
            180                 185                 190         


Pro Ala Asn Leu Ala Gly Met Ser Leu Phe Trp Ser Asn Thr Gly Trp 
        195                 200                 205             


Gly Gly Glu Lys Tyr Tyr Asn Ala Gln Val Val Ser Trp Leu Lys Ser 
    210                 215                 220                 


Asp Trp Lys Ala Asn Leu Val Arg Val Ala Met Gly Val Glu Asp Ala 
225                 230                 235                 240 


Gly Gly Tyr Leu Thr Asp Ser Thr Asn Lys Thr Arg Ala Thr Thr Val 
                245                 250                 255     


Ile Asp Ala Ala Ile Ala Asn Asn Met Tyr Val Ile Ile Asp Trp His 
            260                 265                 270         


Thr His Arg Ala Glu Asn Asn Lys Ala Ala Ala Val Ala Phe Phe Lys 
        275                 280                 285             


Glu Met Ala Thr Lys Tyr Gly Gln Tyr Asn Asn Val Ile Tyr Glu Val 
    290                 295                 300                 


Tyr Asn Glu Pro Leu Asn Val Ser Trp Ser Gly Val Ile Lys Pro Tyr 
305                 310                 315                 320 


Ala Thr Asp Val Ile Lys Glu Ile Arg Ala Ile Asp Pro Asp Asn Leu 
                325                 330                 335     


Ile Ile Val Gly Thr Pro Asn Trp Ser Gln Asp Val Asp Val Ala Ala 
            340                 345                 350         


Asn Asp Pro Ile Thr Thr Tyr Ser Asn Ile Ala Tyr Thr Leu His Phe 
        355                 360                 365             


Tyr Ala Gly Thr His Lys Gln Phe Leu Arg Asp Lys Ala Gln Thr Ala 
    370                 375                 380                 


Leu Asn Arg Gly Ile Ala Leu Phe Val Thr Glu Trp Gly Ser Val Asn 
385                 390                 395                 400 


Ala Asp Gly Gly Gly Ala Val Asp Ala Ala Glu Thr Asn Thr Trp Leu 
                405                 410                 415     


Asn Phe Leu Lys Thr Asn Gly Ile Ser His Ala Asn Trp Ala Leu Asn 
            420                 425                 430         


Asp Lys Ala Glu Gly Ala Ser Ala Leu Val Pro Gly Ala Ser Ala Asn 
        435                 440                 445             


Gly Gly Trp Thr Ser Ala Gln Leu Thr Ala Ser Gly Thr Leu Ile Arg 
    450                 455                 460                 


Asn Ala Ile Ile Ala Asn Asn Asn Gly Thr Thr Ser Ser Ala Ala Pro 
465                 470                 475                 480 


Ser Ser Ile Pro Arg Ser Ser Val Ala Pro Ser Ser Val Ala Arg Ser 
                485                 490                 495     


Ser Ser Ser Ser Val Thr Pro Ser Ser Val Pro Ala Ser Ser Val Ala 
            500                 505                 510         


Pro Ser Ser Ser Ser Arg Ala Ala Ser Ser Val Ala Asn Thr Ala Ala 
        515                 520                 525             


Trp Asn Leu Asn Thr Thr Asp Ser Tyr Leu Asn Phe Val Thr Thr Lys 
    530                 535                 540                 


Asn Thr His Asn Val Glu Val His Ser Phe Thr Gly Leu Thr Gly Asp 
545                 550                 555                 560 


Ile Ser Ser Ala Gly Val Ala Thr Leu Thr Ile Asp Leu Asn Ser Val 
                565                 570                 575     


Asn Thr Gly Val Ala Leu Arg Asp Gln Arg Met Lys Asp Leu Leu Phe 
            580                 585                 590         


Glu Thr Ala Thr Tyr Pro Thr Ala Thr Val Thr Val Thr Val Pro Thr 
        595                 600                 605             


Thr Val Ile Ser Ser Leu Ala Val Gly Gln Ser Ala Ala Thr Asp Ile 
    610                 615                 620                 


Ser Ala Thr Leu Asp Leu His Gly Val Thr Gly Thr Ile Thr Thr Lys 
625                 630                 635                 640 


Val Ser Val Gln Lys Leu Ser Asn Ser Arg Ile Leu Val Gln Thr Leu 
                645                 650                 655     


Ala Pro Val Leu Val Lys Ala Gly Asp Tyr Ser Leu Thr Asn Gly Val 
            660                 665                 670         


Glu Ala Leu Arg Ala Ala Val Ala Ile 
        675                 680     


<210> 113
<211> 2661
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 113
atgagtagct ttaaagcctc tgcgatcaac cctcgcatgg caggtacgct aacgcgttcc     60

ctctatgcgg ccgggttctc cctggctgta tccactcttt ccacccaggc ctatgccggt    120

tgccagtacg tgatcgacag ccagtggaat aatgggttcg gcgccaaaat aagaataacc    180

aatgatggga caactgcgat caatggctgg aatgtgagct ggcgatacag cggcgataac    240

cgcattacca gcagttacaa cgccaccctg accggttcca acccctactc cgccactaac    300

ctgagctgga acgctaccat ccagccaaag caaaccgttg agtttggttt ccagggctcc    360

aaaggcgccg cagcggctga agtgccagtg attaccggta cgggttgtgg tactgccacc    420

tcctcggcag ccccatcatc agccccaccc gtctcatcag caccaactac cagctcacgc    480

tcctcggccg cgtccagcaa tggcactgtc caaggcgtag cacccctggt agtacagggc    540

aacaaagtaa ccgccaatgg ccaaccggcc aacctggcgg gcatgagcct gttctggagt    600

aacaccggct ggggcggcga gaaatactac aacgcccagg tagtttcctg gctgaaatcc    660

gactggaaag ccaatctgat ccgtgttgct atgggcacgg aagaagccgg tggctacctg    720

accgacgcct ccaacaaaac ccgtgctacc gccgtaattg atgcggcgat tgccaacaac    780

atgtatgtca ttatcgattg gcatacccat cacgctgaag ataacaaggc cgctgccatt    840

accttcttca aggaaatggc aaccaagtac ggcaactaca acaacgtgat ctatgaggta    900

tacaacgagc cgttgaatat ctcctggagc ggtgtactca agccatacgc gactgatgtg    960

atccgcgaaa tccgtgccat tgacccggac aacctgatta ttgtaggtac gcctaactgg   1020

tctcaggacg ttgacgtagc cgctaacgac ccgatcaccg cctacagcaa cattgcctac   1080

accctgcact tctatgccgg tacccacaaa caattcctgc gtgacaaggc gcaaaccgca   1140

ctgaatcgcg gcattgccct gtttgttacc gagtggggtg cagtgaatgc cgatggcggt   1200

ggcggtgtgg attcagccga gaccgcgacc tggttgaact tcctcaaaac caacggcatc   1260

agccatgcca actgggcatt gaatgataag gccgagggcg cttctgccct ggtgcccggc   1320

gccagtgtca acggtggctg gaccagcgca caactgaccg catccggcac cctggtgcgt   1380

aacgcgatca tcgccaacaa cggcaatgtg accagctcta ccccggctac ctcttcacgc   1440

agctcggtac cgtccagcag ctcggtcgcc ctcacctcca gctcgcgttc cagcagctcg   1500

gtggcgattg gcagcagctc gtctaccaac aacaatggcg gcggcgtact cttctccgaa   1560

acctttgaaa acggtgcagt caatacccaa ccggctggct gggaaaactt catcggctac   1620

gtacgcaaca ataacaacac cacctccggt tcggcttatg ccttgatcga tagcagcaaa   1680

gcctacagcg gtggcaagtc tatccgtttc aaaggcggtt catcgccagc gcaaatcgtg   1740

cgcgccctgc ctgctggcac ccaacgcctc tacacccgcg cttacgtgaa catgagcgta   1800

gctatgggta atgtggcggg tgacaaccac gaacacatct tcggtatcaa gaaaagcttc   1860

gatgccaata atgaaattcg tgtcggtcaa atcaaagggg tattgggtac caaccacgtt   1920

cctagcgata acatcgcgcc caaacaaagc cagtggtaca gcggcccgca aatggcagcc   1980

aacagatggt actgcgtgga aacggcttac ctggccgatg aagcctatga caccttgcat   2040

atgtgggtag atggcaatct ggttcacact gtcgattcat ctgacgactg gaacaacggc   2100

gccctgagtg ccaattggtt gagcgataac tttaacttcg tcatgttcgg cttccacagc   2160

ttcagcaacc gcaatgcgga tgtctggatg gacgacatcg tggtttctac ccaacctatc   2220

ggttgcggta ccgtaaaccc acccagcagc agctcggtcg cctcggttgt accaagctcc   2280

tccagccgca gctcggtagc accttcatca tccagcgttg ctatcaccag cagttcagta   2340

ccttcgtcct ctgttgcgcc aagctccagc agccgctcca gctcgtctgt tgccaacacc   2400

gctgcctgga acctgaacac caccgactcc tacctgaact tcgtcaccac caaaaacacc   2460

cacaacgtcg aagtgcatag cttcaccggg ttaacaggtg acatcagcag cgctggcgtt   2520

gcgaccctga ccattgacct cagcagcgtg agcaccggtg tgacactgcg tgacgagcgt   2580

atgagggatt tgttgtttga aacagcgacc tatccaacgg ccaccatcac agtgactgtg   2640

ccttctaccc tgatccacta g                                             2661

<210> 114
<211> 886
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(39)

<220> 
<221> DOMAIN
<222> (39)...(136)
<223> Cellulose binding domain

<220> 
<221> DOMAIN
<222> (180)...(435)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (72)...(75)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (88)...(91)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (101)...(104)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (105)...(108)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (169)...(172)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (248)...(251)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (300)...(309)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (310)...(313)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (344)...(347)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (476)...(479)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (553)...(556)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (606)...(609)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (818)...(821)
<223> N-glycosylation site. Prosite id = PS00001

<400> 114
Met Ser Ser Phe Lys Ala Ser Ala Ile Asn Pro Arg Met Ala Gly Thr 
1               5                   10                  15      


Leu Thr Arg Ser Leu Tyr Ala Ala Gly Phe Ser Leu Ala Val Ser Thr 
            20                  25                  30          


Leu Ser Thr Gln Ala Tyr Ala Gly Cys Gln Tyr Val Ile Asp Ser Gln 
        35                  40                  45              


Trp Asn Asn Gly Phe Gly Ala Lys Ile Arg Ile Thr Asn Asp Gly Thr 
    50                  55                  60                  


Thr Ala Ile Asn Gly Trp Asn Val Ser Trp Arg Tyr Ser Gly Asp Asn 
65                  70                  75                  80  


Arg Ile Thr Ser Ser Tyr Asn Ala Thr Leu Thr Gly Ser Asn Pro Tyr 
                85                  90                  95      


Ser Ala Thr Asn Leu Ser Trp Asn Ala Thr Ile Gln Pro Lys Gln Thr 
            100                 105                 110         


Val Glu Phe Gly Phe Gln Gly Ser Lys Gly Ala Ala Ala Ala Glu Val 
        115                 120                 125             


Pro Val Ile Thr Gly Thr Gly Cys Gly Thr Ala Thr Ser Ser Ala Ala 
    130                 135                 140                 


Pro Ser Ser Ala Pro Pro Val Ser Ser Ala Pro Thr Thr Ser Ser Arg 
145                 150                 155                 160 


Ser Ser Ala Ala Ser Ser Asn Gly Thr Val Gln Gly Val Ala Pro Leu 
                165                 170                 175     


Val Val Gln Gly Asn Lys Val Thr Ala Asn Gly Gln Pro Ala Asn Leu 
            180                 185                 190         


Ala Gly Met Ser Leu Phe Trp Ser Asn Thr Gly Trp Gly Gly Glu Lys 
        195                 200                 205             


Tyr Tyr Asn Ala Gln Val Val Ser Trp Leu Lys Ser Asp Trp Lys Ala 
    210                 215                 220                 


Asn Leu Ile Arg Val Ala Met Gly Thr Glu Glu Ala Gly Gly Tyr Leu 
225                 230                 235                 240 


Thr Asp Ala Ser Asn Lys Thr Arg Ala Thr Ala Val Ile Asp Ala Ala 
                245                 250                 255     


Ile Ala Asn Asn Met Tyr Val Ile Ile Asp Trp His Thr His His Ala 
            260                 265                 270         


Glu Asp Asn Lys Ala Ala Ala Ile Thr Phe Phe Lys Glu Met Ala Thr 
        275                 280                 285             


Lys Tyr Gly Asn Tyr Asn Asn Val Ile Tyr Glu Val Tyr Asn Glu Pro 
    290                 295                 300                 


Leu Asn Ile Ser Trp Ser Gly Val Leu Lys Pro Tyr Ala Thr Asp Val 
305                 310                 315                 320 


Ile Arg Glu Ile Arg Ala Ile Asp Pro Asp Asn Leu Ile Ile Val Gly 
                325                 330                 335     


Thr Pro Asn Trp Ser Gln Asp Val Asp Val Ala Ala Asn Asp Pro Ile 
            340                 345                 350         


Thr Ala Tyr Ser Asn Ile Ala Tyr Thr Leu His Phe Tyr Ala Gly Thr 
        355                 360                 365             


His Lys Gln Phe Leu Arg Asp Lys Ala Gln Thr Ala Leu Asn Arg Gly 
    370                 375                 380                 


Ile Ala Leu Phe Val Thr Glu Trp Gly Ala Val Asn Ala Asp Gly Gly 
385                 390                 395                 400 


Gly Gly Val Asp Ser Ala Glu Thr Ala Thr Trp Leu Asn Phe Leu Lys 
                405                 410                 415     


Thr Asn Gly Ile Ser His Ala Asn Trp Ala Leu Asn Asp Lys Ala Glu 
            420                 425                 430         


Gly Ala Ser Ala Leu Val Pro Gly Ala Ser Val Asn Gly Gly Trp Thr 
        435                 440                 445             


Ser Ala Gln Leu Thr Ala Ser Gly Thr Leu Val Arg Asn Ala Ile Ile 
    450                 455                 460                 


Ala Asn Asn Gly Asn Val Thr Ser Ser Thr Pro Ala Thr Ser Ser Arg 
465                 470                 475                 480 


Ser Ser Val Pro Ser Ser Ser Ser Val Ala Leu Thr Ser Ser Ser Arg 
                485                 490                 495     


Ser Ser Ser Ser Val Ala Ile Gly Ser Ser Ser Ser Thr Asn Asn Asn 
            500                 505                 510         


Gly Gly Gly Val Leu Phe Ser Glu Thr Phe Glu Asn Gly Ala Val Asn 
        515                 520                 525             


Thr Gln Pro Ala Gly Trp Glu Asn Phe Ile Gly Tyr Val Arg Asn Asn 
    530                 535                 540                 


Asn Asn Thr Thr Ser Gly Ser Ala Tyr Ala Leu Ile Asp Ser Ser Lys 
545                 550                 555                 560 


Ala Tyr Ser Gly Gly Lys Ser Ile Arg Phe Lys Gly Gly Ser Ser Pro 
                565                 570                 575     


Ala Gln Ile Val Arg Ala Leu Pro Ala Gly Thr Gln Arg Leu Tyr Thr 
            580                 585                 590         


Arg Ala Tyr Val Asn Met Ser Val Ala Met Gly Asn Val Ala Gly Asp 
        595                 600                 605             


Asn His Glu His Ile Phe Gly Ile Lys Lys Ser Phe Asp Ala Asn Asn 
    610                 615                 620                 


Glu Ile Arg Val Gly Gln Ile Lys Gly Val Leu Gly Thr Asn His Val 
625                 630                 635                 640 


Pro Ser Asp Asn Ile Ala Pro Lys Gln Ser Gln Trp Tyr Ser Gly Pro 
                645                 650                 655     


Gln Met Ala Ala Asn Arg Trp Tyr Cys Val Glu Thr Ala Tyr Leu Ala 
            660                 665                 670         


Asp Glu Ala Tyr Asp Thr Leu His Met Trp Val Asp Gly Asn Leu Val 
        675                 680                 685             


His Thr Val Asp Ser Ser Asp Asp Trp Asn Asn Gly Ala Leu Ser Ala 
    690                 695                 700                 


Asn Trp Leu Ser Asp Asn Phe Asn Phe Val Met Phe Gly Phe His Ser 
705                 710                 715                 720 


Phe Ser Asn Arg Asn Ala Asp Val Trp Met Asp Asp Ile Val Val Ser 
                725                 730                 735     


Thr Gln Pro Ile Gly Cys Gly Thr Val Asn Pro Pro Ser Ser Ser Ser 
            740                 745                 750         


Val Ala Ser Val Val Pro Ser Ser Ser Ser Arg Ser Ser Val Ala Pro 
        755                 760                 765             


Ser Ser Ser Ser Val Ala Ile Thr Ser Ser Ser Val Pro Ser Ser Ser 
    770                 775                 780                 


Val Ala Pro Ser Ser Ser Ser Arg Ser Ser Ser Ser Val Ala Asn Thr 
785                 790                 795                 800 


Ala Ala Trp Asn Leu Asn Thr Thr Asp Ser Tyr Leu Asn Phe Val Thr 
                805                 810                 815     


Thr Lys Asn Thr His Asn Val Glu Val His Ser Phe Thr Gly Leu Thr 
            820                 825                 830         


Gly Asp Ile Ser Ser Ala Gly Val Ala Thr Leu Thr Ile Asp Leu Ser 
        835                 840                 845             


Ser Val Ser Thr Gly Val Thr Leu Arg Asp Glu Arg Met Arg Asp Leu 
    850                 855                 860                 


Leu Phe Glu Thr Ala Thr Tyr Pro Thr Ala Thr Ile Thr Val Thr Val 
865                 870                 875                 880 


Pro Ser Thr Leu Ile His 
                885     


<210> 115
<211> 2034
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 115
atgaggaaaa ttattttaaa gttttgtgca ctcatgatgg tagtgatttt gattgtttcc     60

attttacaaa tattacctgt atttgcccag agcataccgt atgaaaagga aaaatatcca    120

catcttcttg gcaatcaggt agttaaaaaa ccatcggttg ccggcagact gcagattatt    180

gaaaaggacg gaaaaaagta tttagctgac cagaaaggag aaataattca gcttcgtggt    240

atgagtacac atggacttca gtggtatggt gatattataa acaaaaatgc atttgaagct    300

ctttcaaaag attgggagtg caacgttata aggcttgcga tgtatgtggg tgaaggcggc    360

tatgcttcaa atccaagtat taaagaaaaa gttatagaag ggattaagct tgctattgag    420

aatgacatgt atgtaattgt tgactggcat gtattaaatc ccggtgaccc gaacgcagaa    480

atttataaag gggcaaaaga ctttttcaaa gagatagcta caagttttcc caatgactat    540

cacataatat atgaactttg caatgaacca aatccaaatg aaccgggagt agaaaatagc    600

ttggatggct ggaaaaaagt aaaggcttat gcacagccca tcataaaaat gctcagaagt    660

ttggggaatc agaacattat aattgtaggt tcgccaaact ggagtcagag acctgacttt    720

gcaattcaag accctataaa tgataaaaat gttatgtatt cagttcattt ttactctgga    780

actcacaaag ttgatggata tgtttttgaa aacatgaaaa atgcgtttga aaatggcgtg    840

ccaattttcg tgagtgaatg gggaacaagt ttggcaagcg gtgatggtgg accgtatctt    900

gatgaagcag ataagtggct tgaatattta aattcaaact atattagctg ggtgaactgg    960

tcgctgtcaa acaaaaatga gacatcagct gcttttgttc catatataaa cggtatgcat   1020

gatgccacac cacttgaccc tggtgatgat aaggtgtggg acatagaaga gcttagtatt   1080

tctggagagt atgtgagggc aaggataaaa ggaattgctt atcagccaat taagagagat   1140

aacaaaataa aagaaggaga aaatgcacct ttaggcgaaa aagtcttacc atccacgttt   1200

gaagatgaca ctcgtcaggg ctgggattgg gatggaccat ctggtgtgaa aggtcctatt   1260

actatcgaaa gtgcgaatgg ttcaaaagcg ctatctttta attttgagta tccagagaaa   1320

aaaccacaag atggctgggc aacagctgca aggcttatac ttaaagacat aaatgtagaa   1380

aggggaaata ataaatattt ggcttttgat ttttatttga aaccagatag ggcttcaaaa   1440

ggtatgattc agatattttt agctttttca cctccttcct taggttactg ggctcaggta   1500

caagacagtt ttaatattga ccttgcaaaa ctgtcaagtg caaaaaagat agaagacaga   1560

atttataagt tcaatgtatt ttttgactta gacaagatac aagataataa agtactgagt   1620

ccagacacac tcttgagaga tataatagta gtcatagcag atggcaatag cgattttaag   1680

gggaaaatgt atatagataa tgttagattt accaatatcc tttttgagga tatcaatttt   1740

gaaaatagcc tttatgatgt tatcgacaag ctttattcta aaggaatcat aaaaggaatt   1800

tcagtattta agtacttgcc agataaaacc attacaaggg ctgaatttgc tgcactttgt   1860

gtcagggcac tgaacctgaa aattgaaaaa tacgatggta gattttctga tgtgaaaagc   1920

ggcaactggt attcagatgt agtttatacg gcgtataaaa ataaattgtt tgaaataaaa   1980

gagaataaat tctttcctga aaatatttta aaaagagaag aagcagtagc tttg         2034

<210> 116
<211> 678
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (66)...(330)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (352)...(571)
<223> Carbohydrate binding domain (family 17/28)

<220> 
<221> DOMAIN
<222> (575)...(617)
<223> S-layer homology domain

<220> 
<221> DOMAIN
<222> (635)...(676)
<223> S-layer homology domain

<220> 
<221> SITE
<222> (184)...(193)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (236)...(239)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (323)...(326)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (331)...(334)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (432)...(435)
<223> N-glycosylation site. Prosite id = PS00001

<400> 116
Met Arg Lys Ile Ile Leu Lys Phe Cys Ala Leu Met Met Val Val Ile 
1               5                   10                  15      


Leu Ile Val Ser Ile Leu Gln Ile Leu Pro Val Phe Ala Gln Ser Ile 
            20                  25                  30          


Pro Tyr Glu Lys Glu Lys Tyr Pro His Leu Leu Gly Asn Gln Val Val 
        35                  40                  45              


Lys Lys Pro Ser Val Ala Gly Arg Leu Gln Ile Ile Glu Lys Asp Gly 
    50                  55                  60                  


Lys Lys Tyr Leu Ala Asp Gln Lys Gly Glu Ile Ile Gln Leu Arg Gly 
65                  70                  75                  80  


Met Ser Thr His Gly Leu Gln Trp Tyr Gly Asp Ile Ile Asn Lys Asn 
                85                  90                  95      


Ala Phe Glu Ala Leu Ser Lys Asp Trp Glu Cys Asn Val Ile Arg Leu 
            100                 105                 110         


Ala Met Tyr Val Gly Glu Gly Gly Tyr Ala Ser Asn Pro Ser Ile Lys 
        115                 120                 125             


Glu Lys Val Ile Glu Gly Ile Lys Leu Ala Ile Glu Asn Asp Met Tyr 
    130                 135                 140                 


Val Ile Val Asp Trp His Val Leu Asn Pro Gly Asp Pro Asn Ala Glu 
145                 150                 155                 160 


Ile Tyr Lys Gly Ala Lys Asp Phe Phe Lys Glu Ile Ala Thr Ser Phe 
                165                 170                 175     


Pro Asn Asp Tyr His Ile Ile Tyr Glu Leu Cys Asn Glu Pro Asn Pro 
            180                 185                 190         


Asn Glu Pro Gly Val Glu Asn Ser Leu Asp Gly Trp Lys Lys Val Lys 
        195                 200                 205             


Ala Tyr Ala Gln Pro Ile Ile Lys Met Leu Arg Ser Leu Gly Asn Gln 
    210                 215                 220                 


Asn Ile Ile Ile Val Gly Ser Pro Asn Trp Ser Gln Arg Pro Asp Phe 
225                 230                 235                 240 


Ala Ile Gln Asp Pro Ile Asn Asp Lys Asn Val Met Tyr Ser Val His 
                245                 250                 255     


Phe Tyr Ser Gly Thr His Lys Val Asp Gly Tyr Val Phe Glu Asn Met 
            260                 265                 270         


Lys Asn Ala Phe Glu Asn Gly Val Pro Ile Phe Val Ser Glu Trp Gly 
        275                 280                 285             


Thr Ser Leu Ala Ser Gly Asp Gly Gly Pro Tyr Leu Asp Glu Ala Asp 
    290                 295                 300                 


Lys Trp Leu Glu Tyr Leu Asn Ser Asn Tyr Ile Ser Trp Val Asn Trp 
305                 310                 315                 320 


Ser Leu Ser Asn Lys Asn Glu Thr Ser Ala Ala Phe Val Pro Tyr Ile 
                325                 330                 335     


Asn Gly Met His Asp Ala Thr Pro Leu Asp Pro Gly Asp Asp Lys Val 
            340                 345                 350         


Trp Asp Ile Glu Glu Leu Ser Ile Ser Gly Glu Tyr Val Arg Ala Arg 
        355                 360                 365             


Ile Lys Gly Ile Ala Tyr Gln Pro Ile Lys Arg Asp Asn Lys Ile Lys 
    370                 375                 380                 


Glu Gly Glu Asn Ala Pro Leu Gly Glu Lys Val Leu Pro Ser Thr Phe 
385                 390                 395                 400 


Glu Asp Asp Thr Arg Gln Gly Trp Asp Trp Asp Gly Pro Ser Gly Val 
                405                 410                 415     


Lys Gly Pro Ile Thr Ile Glu Ser Ala Asn Gly Ser Lys Ala Leu Ser 
            420                 425                 430         


Phe Asn Phe Glu Tyr Pro Glu Lys Lys Pro Gln Asp Gly Trp Ala Thr 
        435                 440                 445             


Ala Ala Arg Leu Ile Leu Lys Asp Ile Asn Val Glu Arg Gly Asn Asn 
    450                 455                 460                 


Lys Tyr Leu Ala Phe Asp Phe Tyr Leu Lys Pro Asp Arg Ala Ser Lys 
465                 470                 475                 480 


Gly Met Ile Gln Ile Phe Leu Ala Phe Ser Pro Pro Ser Leu Gly Tyr 
                485                 490                 495     


Trp Ala Gln Val Gln Asp Ser Phe Asn Ile Asp Leu Ala Lys Leu Ser 
            500                 505                 510         


Ser Ala Lys Lys Ile Glu Asp Arg Ile Tyr Lys Phe Asn Val Phe Phe 
        515                 520                 525             


Asp Leu Asp Lys Ile Gln Asp Asn Lys Val Leu Ser Pro Asp Thr Leu 
    530                 535                 540                 


Leu Arg Asp Ile Ile Val Val Ile Ala Asp Gly Asn Ser Asp Phe Lys 
545                 550                 555                 560 


Gly Lys Met Tyr Ile Asp Asn Val Arg Phe Thr Asn Ile Leu Phe Glu 
                565                 570                 575     


Asp Ile Asn Phe Glu Asn Ser Leu Tyr Asp Val Ile Asp Lys Leu Tyr 
            580                 585                 590         


Ser Lys Gly Ile Ile Lys Gly Ile Ser Val Phe Lys Tyr Leu Pro Asp 
        595                 600                 605             


Lys Thr Ile Thr Arg Ala Glu Phe Ala Ala Leu Cys Val Arg Ala Leu 
    610                 615                 620                 


Asn Leu Lys Ile Glu Lys Tyr Asp Gly Arg Phe Ser Asp Val Lys Ser 
625                 630                 635                 640 


Gly Asn Trp Tyr Ser Asp Val Val Tyr Thr Ala Tyr Lys Asn Lys Leu 
                645                 650                 655     


Phe Glu Ile Lys Glu Asn Lys Phe Phe Pro Glu Asn Ile Leu Lys Arg 
            660                 665                 670         


Glu Glu Ala Val Ala Leu 
        675             


<210> 117
<211> 2244
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 117
atgaaaaaga gacaaggttt tatcaaaaaa gggctggttt tgggcgtttc attgcttttg     60

ctggcgttga tcatgatgag cgccacatcg caaacatcgg ccagtccaca gaatccgttt    120

ttatggcctt ataatcaacc gacccacatc agctttaacg aagccgacgt gtatcaggcg    180

tggacggtct ggcgcgacgc gcagattacg gcgaacaatg cggggggaaa cggccgttac    240

cgggtcatgg gcggcgtgga tggcggcagc accgtctccg aagggcaggc ttacggcatt    300

ctcttcgctt ccatctttga cgagcaaacc ctgtttgacg gtttgttcct gttcgccaaa    360

gaccactata acgaaaacgg cgtcatggac tggcatatcg gcagccctgg tgtgcgtatt    420

ggcagcggcg gcgccaccga cgctgaagtg gacatggcgt taggattggt caacgcctgc    480

gtcaaagtgc agaaaaacgc ctggcctgcc agtccggccg gcgtcaacta ctgcgtcgaa    540

gccactaacc tgatcaacgc catttatacc tacgaggtag accatgccgg cagtgcccct    600

cctggcgggc tgcccaacaa tcagggcaac gaactgctgc ccggtgatac ctggaacatc    660

agcgaacttt atccacaagg catcatcaac ctctcctact tcccgcccgg ctatttcacc    720

gttttcggca agtttacggg caacgaagcc gcctggaacg ccgtcatcaa tcgcaactat    780

caggtggtgg acatggtgca ggccaaaccg aataactgct ccggactggt gcctaactgg    840

aataattaca acggcgacgc ccaactcgtc tcctggcaga ccaacaatta cagttggtgg    900

agctacgacg cggcccgttt cgcctggcgt attgccgtcg atcaggcctg gtacggccgt    960

tctgaggccg ccgaaaccat gaatgaaatt ggcggtttct tcagcagtac cggcttcaac   1020

aacatcggcg aacacaacat gaatgggcag cgggtgggca gtggtccctg gcctttcttt   1080

gtggcgaatg cggcggcggc ggtgtgggca gcgcccaatg caacggccgt taactgcggt   1140

acagccacag gttcgctgca ggaaagcgcc caatccgctt ataaccgggt gctgaccagc   1200

aaagacacgc ccaacagcta ctacggcaac gcctggcggc tgttctctat gctgctcatg   1260

acgggcaact tccccaactt ctacgaaatg gccgacggcg gcgtcacacc ggttcccacc   1320

ctgccgccga cccagacgcc ccacgccacc caaccgccta cggcgactgc caccacctcc   1380

tccggcggcg gcgtttgtgc cgtggattac gtcattgcca accagtgggg caatggcttc   1440

caggccaacg tcaccatcac caatcacagc gccgtgccgg tgaacggcta cactctggcc   1500

tggacccacg cgccggggca ggttgtcacc aacggctgga acgtgaccat cgcccaaagc   1560

ggcagcgccg tcagcgccag caatccggcc agttattgga acggcgtgat tggagccaac   1620

ggcggcaagg tttcttttgg tttccaggga tctctggcag gcggcagcgc ggtcgcgccc   1680

acttattttg ccttgaacgg cgctgcctgt aacggggccg tccttccgcc caccgccacc   1740

agtgcgccac ccacacaaac ggccgttccg cccaccgcca ccacgatact gcccacacaa   1800

acggccgttc cccctaccgc caccacgata ccacccacac aaacggccgt tccgcccacc   1860

gccaccagtc aaccaccgac gcaaacggcc gttcccccca ccgccacagc taccccgcca   1920

tccggggttg gcgcctgcac cgttgtctac gccatcacca acgattgggg cagcggcttc   1980

accgccaacg tcaccctcac caataaaagc agcgccgccc ttaacggctg gactctggcc   2040

tatgccttcc ccggcaatca aaccatcagc aatgcctgga acggaacggc cgttcagtca   2100

ggtaaaaacg tcagcgtcac caatgtaggt tggaacggca gcctgccgcc caacggtgtc   2160

gccagctttg gcttccaggc gagctacagc ggtagtaaca gcgctcctac cagctttacg   2220

ctgaacgggc agcgatgcga ttga                                          2244

<210> 118
<211> 747
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(34)

<220> 
<221> DOMAIN
<222> (38)...(422)
<223> Glycosyl hydrolases family 8

<220> 
<221> DOMAIN
<222> (466)...(570)
<223> Cellulose binding domain

<220> 
<221> DOMAIN
<222> (646)...(746)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (222)...(225)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (233)...(236)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (276)...(279)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (300)...(303)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (378)...(381)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (490)...(493)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (495)...(498)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (521)...(524)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (673)...(676)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (678)...(681)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (696)...(699)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (704)...(707)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (713)...(716)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (722)...(725)
<223> N-glycosylation site. Prosite id = PS00001

<400> 118
Met Lys Lys Arg Gln Gly Phe Ile Lys Lys Gly Leu Val Leu Gly Val 
1               5                   10                  15      


Ser Leu Leu Leu Leu Ala Leu Ile Met Met Ser Ala Thr Ser Gln Thr 
            20                  25                  30          


Ser Ala Ser Pro Gln Asn Pro Phe Leu Trp Pro Tyr Asn Gln Pro Thr 
        35                  40                  45              


His Ile Ser Phe Asn Glu Ala Asp Val Tyr Gln Ala Trp Thr Val Trp 
    50                  55                  60                  


Arg Asp Ala Gln Ile Thr Ala Asn Asn Ala Gly Gly Asn Gly Arg Tyr 
65                  70                  75                  80  


Arg Val Met Gly Gly Val Asp Gly Gly Ser Thr Val Ser Glu Gly Gln 
                85                  90                  95      


Ala Tyr Gly Ile Leu Phe Ala Ser Ile Phe Asp Glu Gln Thr Leu Phe 
            100                 105                 110         


Asp Gly Leu Phe Leu Phe Ala Lys Asp His Tyr Asn Glu Asn Gly Val 
        115                 120                 125             


Met Asp Trp His Ile Gly Ser Pro Gly Val Arg Ile Gly Ser Gly Gly 
    130                 135                 140                 


Ala Thr Asp Ala Glu Val Asp Met Ala Leu Gly Leu Val Asn Ala Cys 
145                 150                 155                 160 


Val Lys Val Gln Lys Asn Ala Trp Pro Ala Ser Pro Ala Gly Val Asn 
                165                 170                 175     


Tyr Cys Val Glu Ala Thr Asn Leu Ile Asn Ala Ile Tyr Thr Tyr Glu 
            180                 185                 190         


Val Asp His Ala Gly Ser Ala Pro Pro Gly Gly Leu Pro Asn Asn Gln 
        195                 200                 205             


Gly Asn Glu Leu Leu Pro Gly Asp Thr Trp Asn Ile Ser Glu Leu Tyr 
    210                 215                 220                 


Pro Gln Gly Ile Ile Asn Leu Ser Tyr Phe Pro Pro Gly Tyr Phe Thr 
225                 230                 235                 240 


Val Phe Gly Lys Phe Thr Gly Asn Glu Ala Ala Trp Asn Ala Val Ile 
                245                 250                 255     


Asn Arg Asn Tyr Gln Val Val Asp Met Val Gln Ala Lys Pro Asn Asn 
            260                 265                 270         


Cys Ser Gly Leu Val Pro Asn Trp Asn Asn Tyr Asn Gly Asp Ala Gln 
        275                 280                 285             


Leu Val Ser Trp Gln Thr Asn Asn Tyr Ser Trp Trp Ser Tyr Asp Ala 
    290                 295                 300                 


Ala Arg Phe Ala Trp Arg Ile Ala Val Asp Gln Ala Trp Tyr Gly Arg 
305                 310                 315                 320 


Ser Glu Ala Ala Glu Thr Met Asn Glu Ile Gly Gly Phe Phe Ser Ser 
                325                 330                 335     


Thr Gly Phe Asn Asn Ile Gly Glu His Asn Met Asn Gly Gln Arg Val 
            340                 345                 350         


Gly Ser Gly Pro Trp Pro Phe Phe Val Ala Asn Ala Ala Ala Ala Val 
        355                 360                 365             


Trp Ala Ala Pro Asn Ala Thr Ala Val Asn Cys Gly Thr Ala Thr Gly 
    370                 375                 380                 


Ser Leu Gln Glu Ser Ala Gln Ser Ala Tyr Asn Arg Val Leu Thr Ser 
385                 390                 395                 400 


Lys Asp Thr Pro Asn Ser Tyr Tyr Gly Asn Ala Trp Arg Leu Phe Ser 
                405                 410                 415     


Met Leu Leu Met Thr Gly Asn Phe Pro Asn Phe Tyr Glu Met Ala Asp 
            420                 425                 430         


Gly Gly Val Thr Pro Val Pro Thr Leu Pro Pro Thr Gln Thr Pro His 
        435                 440                 445             


Ala Thr Gln Pro Pro Thr Ala Thr Ala Thr Thr Ser Ser Gly Gly Gly 
    450                 455                 460                 


Val Cys Ala Val Asp Tyr Val Ile Ala Asn Gln Trp Gly Asn Gly Phe 
465                 470                 475                 480 


Gln Ala Asn Val Thr Ile Thr Asn His Ser Ala Val Pro Val Asn Gly 
                485                 490                 495     


Tyr Thr Leu Ala Trp Thr His Ala Pro Gly Gln Val Val Thr Asn Gly 
            500                 505                 510         


Trp Asn Val Thr Ile Ala Gln Ser Gly Ser Ala Val Ser Ala Ser Asn 
        515                 520                 525             


Pro Ala Ser Tyr Trp Asn Gly Val Ile Gly Ala Asn Gly Gly Lys Val 
    530                 535                 540                 


Ser Phe Gly Phe Gln Gly Ser Leu Ala Gly Gly Ser Ala Val Ala Pro 
545                 550                 555                 560 


Thr Tyr Phe Ala Leu Asn Gly Ala Ala Cys Asn Gly Ala Val Leu Pro 
                565                 570                 575     


Pro Thr Ala Thr Ser Ala Pro Pro Thr Gln Thr Ala Val Pro Pro Thr 
            580                 585                 590         


Ala Thr Thr Ile Leu Pro Thr Gln Thr Ala Val Pro Pro Thr Ala Thr 
        595                 600                 605             


Thr Ile Pro Pro Thr Gln Thr Ala Val Pro Pro Thr Ala Thr Ser Gln 
    610                 615                 620                 


Pro Pro Thr Gln Thr Ala Val Pro Pro Thr Ala Thr Ala Thr Pro Pro 
625                 630                 635                 640 


Ser Gly Val Gly Ala Cys Thr Val Val Tyr Ala Ile Thr Asn Asp Trp 
                645                 650                 655     


Gly Ser Gly Phe Thr Ala Asn Val Thr Leu Thr Asn Lys Ser Ser Ala 
            660                 665                 670         


Ala Leu Asn Gly Trp Thr Leu Ala Tyr Ala Phe Pro Gly Asn Gln Thr 
        675                 680                 685             


Ile Ser Asn Ala Trp Asn Gly Thr Ala Val Gln Ser Gly Lys Asn Val 
    690                 695                 700                 


Ser Val Thr Asn Val Gly Trp Asn Gly Ser Leu Pro Pro Asn Gly Val 
705                 710                 715                 720 


Ala Ser Phe Gly Phe Gln Ala Ser Tyr Ser Gly Ser Asn Ser Ala Pro 
                725                 730                 735     


Thr Ser Phe Thr Leu Asn Gly Gln Arg Cys Asp 
            740                 745         


<210> 119
<211> 3450
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 119
atgtctcgta atattaggaa aagttcattc attttttctc tgctgacgat cattgtattg     60

attgccagca tgttcctgca aacccaaacg gcacaggcaa tcagcactcc ctggttgagc    120

acctctggca gattcatccg ggatccgcag ggcaataatg tcgttctgcg aggtgtctcg    180

ctggtggata ttggtgaggt gaaccttggg cggacgcgca atgtcagcca gctgattaat    240

atggcgacca atgaagccga tggctggtat gcgcgtgtag tgcgcctgcc agtctatccg    300

aatgcgattg atagttcgcc tggctggctg gcaaacccgg atgcttattt caataaccat    360

ctcaacccag ctattcagaa ctgtgtggcg cgccagatct actgcatcat cgactggcac    420

tatatcgcgg actataacaa cagcacgatt gacacaaaca cacgcgcctt ctggaactat    480

gtggcgccac gttatgctaa tactccgaat gtaatcttcg agttgtacaa tgaaccagtc    540

aaccctgata actggtcaac gtggaagcaa tgggcgcaac cctgggtcga tatcattcgc    600

tctcatgcgc cgaacaactt gatcctgatc ggtggtccgc gctggtcgca gaatctttcg    660

agcgcggcga gcagtccatt tactggcagt aatcttgtgt atgttgctca catttatccc    720

gaacacggcg gacaaagcaa ctgggattca tggttcggca atgccgcgaa ctctgttccc    780

ttctttgtca ctgagtgggg ctggatacag ggcggcgcca ccccaactaa tggcacacag    840

tctggctacg gtgttccgtt cagtaactac cttgaatcaa agggcttgag ttggaccgcc    900

tgggtctttg atcaatattg ggatcctaaa atgtgggatg agaactggaa cctgctcggc    960

ggtgaaaatt tcatgggaca attcacaaag gatttgctct tcgcacaccg caatgacagt   1020

ttgcccagta gtaccaacac accgggtgga cccactgcca cgcgtacaaa tacatcgcct   1080

ccaccaacag cgacgaatac atcagcgagt ggtggagcgc tgaaagtcca attggtaacg   1140

ggcggcaccg agaatagcca acaatctgct ttccattata agattttgaa cacgggcgcc   1200

agtgctcaat ctaatatttc tgtgcgcatt tactttacgt tggatggttc gcaagcagcc   1260

tccaaatatg ttctcgagaa atattatgac caatctggag tagcgacgat ttccggacca   1320

acccaggtat caggctcttc ttattacttt acagtgagct atggtacgac tgctcttgcc   1380

gcaggtgccg gttgggaata tcacaccgca ctccgcctga gtgattggag cgcaaatttc   1440

tcaagcgcga acgattggtg gcgtgcaact ggctctatgc cagccagtta cacggactgg   1500

cccacgatcc ctgcttatgt gaacggctcg ctagtgtggg gtagtgcacc aggtggtgga   1560

ccaaccgcta ccaatacgcc tgttacacca actgccacat tcacacgtac gaacacaccc   1620

agtggtccga cttttactcc tacacggacc aacacaccga tcacaccgac ggccacattc   1680

acacgtacga acacgcccag cgggccaacc ttcactttca cacccactgc gactgctacc   1740

cgtacgaata cccccagtgg tccaaccacg cttaaggttc aatataaagc tgcggataca   1800

aacgctggtg ataaccagat caagccgcac ttcaacattg tcaacacagg tgcaagcgcg   1860

gtacctctgg gtgaattgaa gatccgttat tggtacacgc gtgaaggaac agtggggcag   1920

actttcttct gcgattactc tgccattaca ggtgggtgtg gcaacctcag tggagcgttt   1980

gtgcaagtca gcccggctcg aacaggggca gatttctacc tggaaatcag cttcaatact   2040

gcggcaggat ccatcgcggc aggcggtcag agcggagaaa ttcaagctcg ctttgctaag   2100

accgattggt cgaattacaa cgagaccggc gattattcct ttgaccccac caagactgcc   2160

ttcgccgatt ggacaaatgt gactctctat cgcaacggag cgctcgtctg gggcactgaa   2220

cctggtggtg gcggacccac caacacacca accgctacga atacacccgg tggaccaact   2280

aatacgccaa cccgtaccaa cactccgatc actccaacct ttacaccgac tcgaacaaat   2340

actccaggcg gtccgaccaa tacaccaact cgtacaccga ctgcaacatt gacgccacct   2400

cctggaacgc atttggataa tccctttgtg ggtgcgacgt tctatagaaa tgtggattac   2460

gttgcttcag tcaacgccgc agcggattca cagaccggga cgctcgccgc gcagatgagg   2520

ttggttgcca attacccgac ctttgtctgg ctggatagca ttgacgcggt caacggcacg   2580

aacggttatc ccagaagttt ggctggtcat ttgaatgcag ctattactca gggtgccaac   2640

gccattggca ttgtcgtcta tgacctgccc aaccgcgact gctcagcctt agcctctaac   2700

ggcgaattgc tcatcgcaca aaacggactc aatcgctaca agaccgaata cattgatgcg   2760

atctacaaca cgattagcca accacaatat agtaatttgc gcatcatcat ggtcatcgag   2820

ccagattctc tgccgaacct agtcaccaac ttgagcttcg ccaaatgctc tgaagctcaa   2880

tcaacgggtg cttatgtaca gggcgttcaa tatgcccttg gcacattgcg ctcactcaat   2940

aacacttacg cctatattga tgtggcgcat gcggcgtggc tcggctggcc ttccaacttt   3000

actccatttg tgaacttgct caagacagtt ggcacaggta ttcccggcgg caacagcaaa   3060

gtggatggtt tcatcagcaa cacagccaac tacaacccag tggatgaacc attcatggat   3120

gcgaacacga tggttggtgg caatcccgtt cgctcactgc aaggttggta cgactggaat   3180

gattacattg atgaacaacc ctacatcctt gctctgcgga ctgcgctcac gacgggaacc   3240

gacgcgtacc caaccagtgt gggcgtgatc attgatactt cgcgcaatgg ctggggtggt   3300

acgaaccgtc cgactgctgc aagtacatca acagtactaa gcacctttgt gatggaaagc   3360

cgaatcgata aacgcatcca caaaggcaat tctttggatc cactagtgtc gacctgcagg   3420

cgcgcgagct ccagcttttg ttccctttag                                    3450

<210> 120
<211> 1149
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(33)

<220> 
<221> DOMAIN
<222> (44)...(311)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (593)...(677)
<223> Cellulose binding domain

<220> 
<221> DOMAIN
<222> (816)...(1148)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (75)...(78)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (148)...(151)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (173)...(182)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (186)...(189)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (221)...(224)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (281)...(284)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (343)...(346)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (371)...(374)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (411)...(414)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (486)...(489)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (515)...(518)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (665)...(668)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (717)...(720)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (737)...(740)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (871)...(874)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (898)...(914)
<223> Glycosyl hydrolases family 6 signature 1. Prosite id = PS00655

<220> 
<221> SITE
<222> (964)...(967)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (995)...(998)
<223> N-glycosylation site. Prosite id = PS00001

<400> 120
Met Ser Arg Asn Ile Arg Lys Ser Ser Phe Ile Phe Ser Leu Leu Thr 
1               5                   10                  15      


Ile Ile Val Leu Ile Ala Ser Met Phe Leu Gln Thr Gln Thr Ala Gln 
            20                  25                  30          


Ala Ile Ser Thr Pro Trp Leu Ser Thr Ser Gly Arg Phe Ile Arg Asp 
        35                  40                  45              


Pro Gln Gly Asn Asn Val Val Leu Arg Gly Val Ser Leu Val Asp Ile 
    50                  55                  60                  


Gly Glu Val Asn Leu Gly Arg Thr Arg Asn Val Ser Gln Leu Ile Asn 
65                  70                  75                  80  


Met Ala Thr Asn Glu Ala Asp Gly Trp Tyr Ala Arg Val Val Arg Leu 
                85                  90                  95      


Pro Val Tyr Pro Asn Ala Ile Asp Ser Ser Pro Gly Trp Leu Ala Asn 
            100                 105                 110         


Pro Asp Ala Tyr Phe Asn Asn His Leu Asn Pro Ala Ile Gln Asn Cys 
        115                 120                 125             


Val Ala Arg Gln Ile Tyr Cys Ile Ile Asp Trp His Tyr Ile Ala Asp 
    130                 135                 140                 


Tyr Asn Asn Ser Thr Ile Asp Thr Asn Thr Arg Ala Phe Trp Asn Tyr 
145                 150                 155                 160 


Val Ala Pro Arg Tyr Ala Asn Thr Pro Asn Val Ile Phe Glu Leu Tyr 
                165                 170                 175     


Asn Glu Pro Val Asn Pro Asp Asn Trp Ser Thr Trp Lys Gln Trp Ala 
            180                 185                 190         


Gln Pro Trp Val Asp Ile Ile Arg Ser His Ala Pro Asn Asn Leu Ile 
        195                 200                 205             


Leu Ile Gly Gly Pro Arg Trp Ser Gln Asn Leu Ser Ser Ala Ala Ser 
    210                 215                 220                 


Ser Pro Phe Thr Gly Ser Asn Leu Val Tyr Val Ala His Ile Tyr Pro 
225                 230                 235                 240 


Glu His Gly Gly Gln Ser Asn Trp Asp Ser Trp Phe Gly Asn Ala Ala 
                245                 250                 255     


Asn Ser Val Pro Phe Phe Val Thr Glu Trp Gly Trp Ile Gln Gly Gly 
            260                 265                 270         


Ala Thr Pro Thr Asn Gly Thr Gln Ser Gly Tyr Gly Val Pro Phe Ser 
        275                 280                 285             


Asn Tyr Leu Glu Ser Lys Gly Leu Ser Trp Thr Ala Trp Val Phe Asp 
    290                 295                 300                 


Gln Tyr Trp Asp Pro Lys Met Trp Asp Glu Asn Trp Asn Leu Leu Gly 
305                 310                 315                 320 


Gly Glu Asn Phe Met Gly Gln Phe Thr Lys Asp Leu Leu Phe Ala His 
                325                 330                 335     


Arg Asn Asp Ser Leu Pro Ser Ser Thr Asn Thr Pro Gly Gly Pro Thr 
            340                 345                 350         


Ala Thr Arg Thr Asn Thr Ser Pro Pro Pro Thr Ala Thr Asn Thr Ser 
        355                 360                 365             


Ala Ser Gly Gly Ala Leu Lys Val Gln Leu Val Thr Gly Gly Thr Glu 
    370                 375                 380                 


Asn Ser Gln Gln Ser Ala Phe His Tyr Lys Ile Leu Asn Thr Gly Ala 
385                 390                 395                 400 


Ser Ala Gln Ser Asn Ile Ser Val Arg Ile Tyr Phe Thr Leu Asp Gly 
                405                 410                 415     


Ser Gln Ala Ala Ser Lys Tyr Val Leu Glu Lys Tyr Tyr Asp Gln Ser 
            420                 425                 430         


Gly Val Ala Thr Ile Ser Gly Pro Thr Gln Val Ser Gly Ser Ser Tyr 
        435                 440                 445             


Tyr Phe Thr Val Ser Tyr Gly Thr Thr Ala Leu Ala Ala Gly Ala Gly 
    450                 455                 460                 


Trp Glu Tyr His Thr Ala Leu Arg Leu Ser Asp Trp Ser Ala Asn Phe 
465                 470                 475                 480 


Ser Ser Ala Asn Asp Trp Trp Arg Ala Thr Gly Ser Met Pro Ala Ser 
                485                 490                 495     


Tyr Thr Asp Trp Pro Thr Ile Pro Ala Tyr Val Asn Gly Ser Leu Val 
            500                 505                 510         


Trp Gly Ser Ala Pro Gly Gly Gly Pro Thr Ala Thr Asn Thr Pro Val 
        515                 520                 525             


Thr Pro Thr Ala Thr Phe Thr Arg Thr Asn Thr Pro Ser Gly Pro Thr 
    530                 535                 540                 


Phe Thr Pro Thr Arg Thr Asn Thr Pro Ile Thr Pro Thr Ala Thr Phe 
545                 550                 555                 560 


Thr Arg Thr Asn Thr Pro Ser Gly Pro Thr Phe Thr Phe Thr Pro Thr 
                565                 570                 575     


Ala Thr Ala Thr Arg Thr Asn Thr Pro Ser Gly Pro Thr Thr Leu Lys 
            580                 585                 590         


Val Gln Tyr Lys Ala Ala Asp Thr Asn Ala Gly Asp Asn Gln Ile Lys 
        595                 600                 605             


Pro His Phe Asn Ile Val Asn Thr Gly Ala Ser Ala Val Pro Leu Gly 
    610                 615                 620                 


Glu Leu Lys Ile Arg Tyr Trp Tyr Thr Arg Glu Gly Thr Val Gly Gln 
625                 630                 635                 640 


Thr Phe Phe Cys Asp Tyr Ser Ala Ile Thr Gly Gly Cys Gly Asn Leu 
                645                 650                 655     


Ser Gly Ala Phe Val Gln Val Ser Pro Ala Arg Thr Gly Ala Asp Phe 
            660                 665                 670         


Tyr Leu Glu Ile Ser Phe Asn Thr Ala Ala Gly Ser Ile Ala Ala Gly 
        675                 680                 685             


Gly Gln Ser Gly Glu Ile Gln Ala Arg Phe Ala Lys Thr Asp Trp Ser 
    690                 695                 700                 


Asn Tyr Asn Glu Thr Gly Asp Tyr Ser Phe Asp Pro Thr Lys Thr Ala 
705                 710                 715                 720 


Phe Ala Asp Trp Thr Asn Val Thr Leu Tyr Arg Asn Gly Ala Leu Val 
                725                 730                 735     


Trp Gly Thr Glu Pro Gly Gly Gly Gly Pro Thr Asn Thr Pro Thr Ala 
            740                 745                 750         


Thr Asn Thr Pro Gly Gly Pro Thr Asn Thr Pro Thr Arg Thr Asn Thr 
        755                 760                 765             


Pro Ile Thr Pro Thr Phe Thr Pro Thr Arg Thr Asn Thr Pro Gly Gly 
    770                 775                 780                 


Pro Thr Asn Thr Pro Thr Arg Thr Pro Thr Ala Thr Leu Thr Pro Pro 
785                 790                 795                 800 


Pro Gly Thr His Leu Asp Asn Pro Phe Val Gly Ala Thr Phe Tyr Arg 
                805                 810                 815     


Asn Val Asp Tyr Val Ala Ser Val Asn Ala Ala Ala Asp Ser Gln Thr 
            820                 825                 830         


Gly Thr Leu Ala Ala Gln Met Arg Leu Val Ala Asn Tyr Pro Thr Phe 
        835                 840                 845             


Val Trp Leu Asp Ser Ile Asp Ala Val Asn Gly Thr Asn Gly Tyr Pro 
    850                 855                 860                 


Arg Ser Leu Ala Gly His Leu Asn Ala Ala Ile Thr Gln Gly Ala Asn 
865                 870                 875                 880 


Ala Ile Gly Ile Val Val Tyr Asp Leu Pro Asn Arg Asp Cys Ser Ala 
                885                 890                 895     


Leu Ala Ser Asn Gly Glu Leu Leu Ile Ala Gln Asn Gly Leu Asn Arg 
            900                 905                 910         


Tyr Lys Thr Glu Tyr Ile Asp Ala Ile Tyr Asn Thr Ile Ser Gln Pro 
        915                 920                 925             


Gln Tyr Ser Asn Leu Arg Ile Ile Met Val Ile Glu Pro Asp Ser Leu 
    930                 935                 940                 


Pro Asn Leu Val Thr Asn Leu Ser Phe Ala Lys Cys Ser Glu Ala Gln 
945                 950                 955                 960 


Ser Thr Gly Ala Tyr Val Gln Gly Val Gln Tyr Ala Leu Gly Thr Leu 
                965                 970                 975     


Arg Ser Leu Asn Asn Thr Tyr Ala Tyr Ile Asp Val Ala His Ala Ala 
            980                 985                 990         


Trp Leu Gly Trp Pro Ser Asn Phe  Thr Pro Phe Val Asn  Leu Leu Lys 
        995                 1000                 1005             


Thr Val  Gly Thr Gly Ile Pro  Gly Gly Asn Ser Lys  Val Asp Gly 
    1010                 1015                 1020             


Phe Ile  Ser Asn Thr Ala Asn  Tyr Asn Pro Val Asp  Glu Pro Phe 
    1025                 1030                 1035             


Met Asp  Ala Asn Thr Met Val  Gly Gly Asn Pro Val  Arg Ser Leu 
    1040                 1045                 1050             


Gln Gly  Trp Tyr Asp Trp Asn  Asp Tyr Ile Asp Glu  Gln Pro Tyr 
    1055                 1060                 1065             


Ile Leu  Ala Leu Arg Thr Ala  Leu Thr Thr Gly Thr  Asp Ala Tyr 
    1070                 1075                 1080             


Pro Thr  Ser Val Gly Val Ile  Ile Asp Thr Ser Arg  Asn Gly Trp 
    1085                 1090                 1095             


Gly Gly  Thr Asn Arg Pro Thr  Ala Ala Ser Thr Ser  Thr Val Leu 
    1100                 1105                 1110             


Ser Thr  Phe Val Met Glu Ser  Arg Ile Asp Lys Arg  Ile His Lys 
    1115                 1120                 1125             


Gly Asn  Ser Leu Asp Pro Leu  Val Ser Thr Cys Arg  Arg Ala Ser 
    1130                 1135                 1140             


Ser Ser  Phe Cys Ser Leu 
    1145                 


<210> 121
<211> 1158
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 121
gtgctgatcc ggctggcggc ggccggcgcc ctgctgctcg gcgccgtctt cgtcgcggtg     60

agcccggcgg ccgcggccac cgcctgcgac gtcacctaca ccgccaacca gtggagcacc    120

ggcttcaccg ccgacgtgcg ggtcaccaac aacggcgcgc cgatcaacgg ctgggcggtg    180

acctggacgt tcaccggcaa ccagcaggtc acctccggct ggaacgcgca gctgacccag    240

tccggggcca cggtgacggc cgcggcgccg tcctacaacc agaccctggc caccggggcc    300

tcggccggct tcggcttcca ggcgacgtac tcgggcagca acccggcgcc ggcgtccttc    360

gccctcaacg gcgtctcgtg caacggcgaa gccccaccga cgtcggcccc accgacgccg    420

agcacgccgc cgacctcgat cccgccgtct cccagcaccc caccgaccac gaccccgccg    480

ccggccggct gcaccaccgg ggtccgctgt gacggcttcg agggcaccca ggccgactgg    540

gcggtgacct acccggactg ctccggcgcg ggcaaggcgg ccttcgacac cgcagtcgcc    600

cacggcggcg gcacctcgct gcggatcgac ggggcggcgg gttactgcaa ccacgtcttc    660

atgcggaaca ccgcgctgat cccggtcggg gccaccgcgc tgttcgtgcg ctactgggtg    720

cggcacacca ccgcactgcc ggccgcgcac accacggcgg tggcgttgcg ggacgccaac    780

gacggccacc gggacctgcg cttcggcggc cagaacggcg ccctgcaatg gaaccgggcg    840

tccgacgacg ccacgctgcc ggaacagagc ccggccgggg tggcgttgtc cgcgccgctg    900

ccgacgggca cctggaactg cgtggagttc aaggtcgacc agggcaacgg cacgatgcag    960

acctggctca acggcacatc cgtgcccggc ctgctgcagg acggcgtgcc gacgcacgac   1020

atcgacggcc agtggctgaa caagacctgg cggccggccc tgaccgacct gcggctgggc   1080

tgggagagct acggcgaagg cgcagacacc ctctggtacg acgacgtcgc cgtgggcacc   1140

acccgcgtcg gctgctga                                                 1158

<210> 122
<211> 385
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (29)...(127)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (94)...(97)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (320)...(323)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (352)...(355)
<223> N-glycosylation site. Prosite id = PS00001

<400> 122
Met Leu Ile Arg Leu Ala Ala Ala Gly Ala Leu Leu Leu Gly Ala Val 
1               5                   10                  15      


Phe Val Ala Val Ser Pro Ala Ala Ala Ala Thr Ala Cys Asp Val Thr 
            20                  25                  30          


Tyr Thr Ala Asn Gln Trp Ser Thr Gly Phe Thr Ala Asp Val Arg Val 
        35                  40                  45              


Thr Asn Asn Gly Ala Pro Ile Asn Gly Trp Ala Val Thr Trp Thr Phe 
    50                  55                  60                  


Thr Gly Asn Gln Gln Val Thr Ser Gly Trp Asn Ala Gln Leu Thr Gln 
65                  70                  75                  80  


Ser Gly Ala Thr Val Thr Ala Ala Ala Pro Ser Tyr Asn Gln Thr Leu 
                85                  90                  95      


Ala Thr Gly Ala Ser Ala Gly Phe Gly Phe Gln Ala Thr Tyr Ser Gly 
            100                 105                 110         


Ser Asn Pro Ala Pro Ala Ser Phe Ala Leu Asn Gly Val Ser Cys Asn 
        115                 120                 125             


Gly Glu Ala Pro Pro Thr Ser Ala Pro Pro Thr Pro Ser Thr Pro Pro 
    130                 135                 140                 


Thr Ser Ile Pro Pro Ser Pro Ser Thr Pro Pro Thr Thr Thr Pro Pro 
145                 150                 155                 160 


Pro Ala Gly Cys Thr Thr Gly Val Arg Cys Asp Gly Phe Glu Gly Thr 
                165                 170                 175     


Gln Ala Asp Trp Ala Val Thr Tyr Pro Asp Cys Ser Gly Ala Gly Lys 
            180                 185                 190         


Ala Ala Phe Asp Thr Ala Val Ala His Gly Gly Gly Thr Ser Leu Arg 
        195                 200                 205             


Ile Asp Gly Ala Ala Gly Tyr Cys Asn His Val Phe Met Arg Asn Thr 
    210                 215                 220                 


Ala Leu Ile Pro Val Gly Ala Thr Ala Leu Phe Val Arg Tyr Trp Val 
225                 230                 235                 240 


Arg His Thr Thr Ala Leu Pro Ala Ala His Thr Thr Ala Val Ala Leu 
                245                 250                 255     


Arg Asp Ala Asn Asp Gly His Arg Asp Leu Arg Phe Gly Gly Gln Asn 
            260                 265                 270         


Gly Ala Leu Gln Trp Asn Arg Ala Ser Asp Asp Ala Thr Leu Pro Glu 
        275                 280                 285             


Gln Ser Pro Ala Gly Val Ala Leu Ser Ala Pro Leu Pro Thr Gly Thr 
    290                 295                 300                 


Trp Asn Cys Val Glu Phe Lys Val Asp Gln Gly Asn Gly Thr Met Gln 
305                 310                 315                 320 


Thr Trp Leu Asn Gly Thr Ser Val Pro Gly Leu Leu Gln Asp Gly Val 
                325                 330                 335     


Pro Thr His Asp Ile Asp Gly Gln Trp Leu Asn Lys Thr Trp Arg Pro 
            340                 345                 350         


Ala Leu Thr Asp Leu Arg Leu Gly Trp Glu Ser Tyr Gly Glu Gly Ala 
        355                 360                 365             


Asp Thr Leu Trp Tyr Asp Asp Val Ala Val Gly Thr Thr Arg Val Gly 
    370                 375                 380                 


Cys 
385 


<210> 123
<211> 1356
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 123
atgcgcttga agaccctcgc caccgccacg gcggccgccg ccgtggtcgc cggcaccgcc     60

gtgctctggc ccggctccgc ctccgccgcc gagtcaccct tctatgtcga tccgcagacc    120

ggcgccgccc gctgggtcgc cgccaacccg ggcgactcca aggccgcggt gatccgcgac    180

cggatcgcca gcgtcccgca gggccgctgg tacacccaga acaacaccgc cacggtcgcc    240

gccgaggtgg actccttcgt gggcgccgcc gccgcggccg gcaagacgcc gatcatggtc    300

gtctacaaca tccccaaccg cgactgcagc ggcgccagct ccggtggcct ggccaaccac    360

accgtctacc ggcagtggat cgaccaggtg gccgccgggc tcaagggccg cgcggccgcg    420

atcatcctgg aaccggacgt gttgccgatc atgtcgagct gcatgtcgtc cgcgcagcag    480

gaagaggtct acgcctcgat ggcgtacgcg ggcaagaagc tcaaggccgc ctcgtccgcg    540

gccaaggtct acttcgacgc cgggcactcg gcctggctgt ccccggggga catggccgcc    600

cggctggtcc gcgccgacat cgccaacagc gccgacggca tctcggtcaa cgtctccaac    660

taccgcacca ccgccgagtc gaccacctat gtccgcaacg tgctggccgc ggtcggcgtc    720

tcccggctgc gcggcgtgat cgacaccagc cgcaacggca acggcccggc cggcagcgag    780

tggtgtgacc cgggcggacg ggcgatcggc attcccagca cgaacgcggt gtccgatgcg    840

atcctggacg cgtacctgtg gatcaagctg cccggcgaag ccgacggctg catcgccggg    900

gccggtcagt tcgtcccgca gcgcgcctac gacctggccg tcgcggccgg gccgtacaca    960

ccgccgccgg cgaccacccc accgaccacc accccgccgg ccaccacccc gccggtcacg   1020

ccgccggtga ccaccccgcc gccggccggt ggctgctcgg tgtcctacac ggcgaactcc   1080

tggtcgaacg gcttcaccgc cgatgtccgg atcaccaacc gcggcgccgc gctgagctcc   1140

tggacgctga ccttcaccgt gcccgccaac gtcaccctga gcagcggctg gagcggcacc   1200

tggagccagt ccggcagcac gatcacggtc cggaacgcgg cctggaacgg cgcgctcggc   1260

agcggcgcca ccaccagcac cggcttccag gcgaccttca ccggtgcggc cccggccagc   1320

ccgaccggct tcgcgctgaa cggcacgccg tgctga                             1356

<210> 124
<211> 451
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (36)...(311)
<223> Glycosyl hydrolases family 6

<220> 
<221> DOMAIN
<222> (352)...(451)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (75)...(78)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (101)...(117)
<223> Glycosyl hydrolases family 6 signature 1. Prosite id = PS00655

<220> 
<221> SITE
<222> (120)...(123)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (220)...(223)
<223> N-glycosylation site. Prosite id = PS00001

<400> 124
Met Arg Leu Lys Thr Leu Ala Thr Ala Thr Ala Ala Ala Ala Val Val 
1               5                   10                  15      


Ala Gly Thr Ala Val Leu Trp Pro Gly Ser Ala Ser Ala Ala Glu Ser 
            20                  25                  30          


Pro Phe Tyr Val Asp Pro Gln Thr Gly Ala Ala Arg Trp Val Ala Ala 
        35                  40                  45              


Asn Pro Gly Asp Ser Lys Ala Ala Val Ile Arg Asp Arg Ile Ala Ser 
    50                  55                  60                  


Val Pro Gln Gly Arg Trp Tyr Thr Gln Asn Asn Thr Ala Thr Val Ala 
65                  70                  75                  80  


Ala Glu Val Asp Ser Phe Val Gly Ala Ala Ala Ala Ala Gly Lys Thr 
                85                  90                  95      


Pro Ile Met Val Val Tyr Asn Ile Pro Asn Arg Asp Cys Ser Gly Ala 
            100                 105                 110         


Ser Ser Gly Gly Leu Ala Asn His Thr Val Tyr Arg Gln Trp Ile Asp 
        115                 120                 125             


Gln Val Ala Ala Gly Leu Lys Gly Arg Ala Ala Ala Ile Ile Leu Glu 
    130                 135                 140                 


Pro Asp Val Leu Pro Ile Met Ser Ser Cys Met Ser Ser Ala Gln Gln 
145                 150                 155                 160 


Glu Glu Val Tyr Ala Ser Met Ala Tyr Ala Gly Lys Lys Leu Lys Ala 
                165                 170                 175     


Ala Ser Ser Ala Ala Lys Val Tyr Phe Asp Ala Gly His Ser Ala Trp 
            180                 185                 190         


Leu Ser Pro Gly Asp Met Ala Ala Arg Leu Val Arg Ala Asp Ile Ala 
        195                 200                 205             


Asn Ser Ala Asp Gly Ile Ser Val Asn Val Ser Asn Tyr Arg Thr Thr 
    210                 215                 220                 


Ala Glu Ser Thr Thr Tyr Val Arg Asn Val Leu Ala Ala Val Gly Val 
225                 230                 235                 240 


Ser Arg Leu Arg Gly Val Ile Asp Thr Ser Arg Asn Gly Asn Gly Pro 
                245                 250                 255     


Ala Gly Ser Glu Trp Cys Asp Pro Gly Gly Arg Ala Ile Gly Ile Pro 
            260                 265                 270         


Ser Thr Asn Ala Val Ser Asp Ala Ile Leu Asp Ala Tyr Leu Trp Ile 
        275                 280                 285             


Lys Leu Pro Gly Glu Ala Asp Gly Cys Ile Ala Gly Ala Gly Gln Phe 
    290                 295                 300                 


Val Pro Gln Arg Ala Tyr Asp Leu Ala Val Ala Ala Gly Pro Tyr Thr 
305                 310                 315                 320 


Pro Pro Pro Ala Thr Thr Pro Pro Thr Thr Thr Pro Pro Ala Thr Thr 
                325                 330                 335     


Pro Pro Val Thr Pro Pro Val Thr Thr Pro Pro Pro Ala Gly Gly Cys 
            340                 345                 350         


Ser Val Ser Tyr Thr Ala Asn Ser Trp Ser Asn Gly Phe Thr Ala Asp 
        355                 360                 365             


Val Arg Ile Thr Asn Arg Gly Ala Ala Leu Ser Ser Trp Thr Leu Thr 
    370                 375                 380                 


Phe Thr Val Pro Ala Asn Val Thr Leu Ser Ser Gly Trp Ser Gly Thr 
385                 390                 395                 400 


Trp Ser Gln Ser Gly Ser Thr Ile Thr Val Arg Asn Ala Ala Trp Asn 
                405                 410                 415     


Gly Ala Leu Gly Ser Gly Ala Thr Thr Ser Thr Gly Phe Gln Ala Thr 
            420                 425                 430         


Phe Thr Gly Ala Ala Pro Ala Ser Pro Thr Gly Phe Ala Leu Asn Gly 
        435                 440                 445             


Thr Pro Cys 
    450     


<210> 125
<211> 2370
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 125
atgttcaagc gcaatacggt tcgtgtgggc tgttttatcg cggtcaccgc gatttgtgca     60

atgctgcttt tccatgtgcc atccgttgtt tcggcgcgta ccaacgccgc cggtgcgggc    120

tactggcaca cgagcggcaa tcagatcctc gacgccaaca atcaacctgt ccgcatcgcg    180

ggcatcaact ggttcggtat ggaaacgagc aactatgcgc cgcacggttt gtggacgcgt    240

gattacaaat ccatgctcga ccagatcaaa caacagggct acaacacggt ccgcttgccg    300

tattcgaacc aactgtttga cgcgggcagc gttccgaatg gcattgactt ttcgaatggc    360

aaaaacgccg accttaaagg gttgaatggt ctgcaaatca tggacaagct ggtggcatac    420

gcggggcaaa agggcttgcg catcgtgctt gaccgccacc gccccgacag cggcggtcag    480

tctgaactgt ggtacacgag ccggtattcc gagcagcgtt ggattaacga ttggaaaatg    540

cttgccgggc gctatgccaa caatcccacg gtcatcggcg cagacctgca caacgagccg    600

cacggtcccg cgtgttgggg ctgcggcgac actgccaccg actggcgtct cgccgccgag    660

cgcgcaggga atgcgattca cacggtcaat tccaactggc tgatatttgt cgaaggcatc    720

ggctgtgtga atggtgattg cagctggtgg ggtggtcagt tgaaaaacgc gggtcagtat    780

cccgtacgtc tgaccgtcgc gaaccgactc gtctattccg ctcacgatta tcccgcttca    840

ctttacccgc agacgtggtt cagcgcccct aactatccga acaatttgcc gggggtgtgg    900

gacaattatt ggggctatct ccacaaacag aatatcgcgc cggtgcttgt cggggaattt    960

ggctcgaaac ttcaaacgac ctctgaccgc cagtggctca acaagctgac gcaatatctc   1020

ggcagcaacg gcatcagctg gacgttctgg agttggaatc cgaattcggg cgacacgggc   1080

ggcattctga atgatgattg gacgacgatc aacaccgaca aacaatcgta tctcgccggc   1140

ggcacggatg cgacaggtgt aacgcaccag tctatcctgt tcccgctcga tgtgggcggc   1200

acaccgcagc catccgcaac atttacgcgg accagtaccc ggacccgcac gccgaccgcg   1260

tgcggcaact gcccgaccgc aacgccgacg cggacgcgca ctttgaccgc gacgtcaacg   1320

cgcaccaata cacccagccc gaacaatgga ctcaagctgc aataccgcgt cggtgatggc   1380

acatcggcaa acgacaatca aatcaagccc caattccgaa ttgtcaacaa cggcgcatcg   1440

aacgtgccat tgagcgagct cgaaatccga tactggtaca cttccgaagg caatcaagcg   1500

caatcctact ggtgcgacta tgctacgctc aattgcgcca acatcacggg cagctttgtc   1560

aaattgcaaa acgcggtcaa tggcgcaaat ggatacttgc ggttgaaatt caccggggga   1620

caggtgaatg ccaattccaa cacgggtgag attcaaaatc gtttcaacaa gagcgattgg   1680

tccaattaca gcgagggcga cgattattca tacgacccga ccaaaacctc gttcaccgat   1740

tggaacaagg tgacgctcta tcgcaatggc gcgctggtct ggggcattga acccacaggc   1800

acaaagtcag tgactgccac gcccacgcgc acgcgcacca atgtgccggg gaaaaactct   1860

ccaactccaa cacccacgct gacggcaatg ccaactgaag ctcccgcggt cagtatgcat   1920

gttcaatatc gtaatggcga aaatcccgcc aaacccaaga acgcgcatct ccgcccgcag   1980

tttcgcctca tcaatgacgg tggtacgcaa gtctcgctca aggacgtgac gattcgttac   2040

tggttcacga ggcagggcga atccaaatac aagttttcgt gcgaggcggc cgcgctggac   2100

tgtgccaaca ttcgcggcaa ggtcgtcaat ctgccctcgg cgcgacccgg cgcgaatgcc   2160

tatctcgaag tgcgcttccg ccccgcagct gggatgcttg cacccggcgc caatacgggc   2220

gagatcgtca cgcacctggt caaaaaggac aagtcaaagc ataacgagaa tcgagactat   2280

tcgtttaaca agacccagtt tgactttgcc gaccgcgccg agattacggt ctatcggaat   2340

ggtgcgctcg tcggcggtat cgagccgtaa                                    2370

<210> 126
<211> 789
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(32)

<220> 
<221> DOMAIN
<222> (47)...(360)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (453)...(536)
<223> Cellulose binding domain

<220> 
<221> DOMAIN
<222> (641)...(725)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (521)...(524)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (564)...(567)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (570)...(573)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (774)...(777)
<223> N-glycosylation site. Prosite id = PS00001

<400> 126
Met Phe Lys Arg Asn Thr Val Arg Val Gly Cys Phe Ile Ala Val Thr 
1               5                   10                  15      


Ala Ile Cys Ala Met Leu Leu Phe His Val Pro Ser Val Val Ser Ala 
            20                  25                  30          


Arg Thr Asn Ala Ala Gly Ala Gly Tyr Trp His Thr Ser Gly Asn Gln 
        35                  40                  45              


Ile Leu Asp Ala Asn Asn Gln Pro Val Arg Ile Ala Gly Ile Asn Trp 
    50                  55                  60                  


Phe Gly Met Glu Thr Ser Asn Tyr Ala Pro His Gly Leu Trp Thr Arg 
65                  70                  75                  80  


Asp Tyr Lys Ser Met Leu Asp Gln Ile Lys Gln Gln Gly Tyr Asn Thr 
                85                  90                  95      


Val Arg Leu Pro Tyr Ser Asn Gln Leu Phe Asp Ala Gly Ser Val Pro 
            100                 105                 110         


Asn Gly Ile Asp Phe Ser Asn Gly Lys Asn Ala Asp Leu Lys Gly Leu 
        115                 120                 125             


Asn Gly Leu Gln Ile Met Asp Lys Leu Val Ala Tyr Ala Gly Gln Lys 
    130                 135                 140                 


Gly Leu Arg Ile Val Leu Asp Arg His Arg Pro Asp Ser Gly Gly Gln 
145                 150                 155                 160 


Ser Glu Leu Trp Tyr Thr Ser Arg Tyr Ser Glu Gln Arg Trp Ile Asn 
                165                 170                 175     


Asp Trp Lys Met Leu Ala Gly Arg Tyr Ala Asn Asn Pro Thr Val Ile 
            180                 185                 190         


Gly Ala Asp Leu His Asn Glu Pro His Gly Pro Ala Cys Trp Gly Cys 
        195                 200                 205             


Gly Asp Thr Ala Thr Asp Trp Arg Leu Ala Ala Glu Arg Ala Gly Asn 
    210                 215                 220                 


Ala Ile His Thr Val Asn Ser Asn Trp Leu Ile Phe Val Glu Gly Ile 
225                 230                 235                 240 


Gly Cys Val Asn Gly Asp Cys Ser Trp Trp Gly Gly Gln Leu Lys Asn 
                245                 250                 255     


Ala Gly Gln Tyr Pro Val Arg Leu Thr Val Ala Asn Arg Leu Val Tyr 
            260                 265                 270         


Ser Ala His Asp Tyr Pro Ala Ser Leu Tyr Pro Gln Thr Trp Phe Ser 
        275                 280                 285             


Ala Pro Asn Tyr Pro Asn Asn Leu Pro Gly Val Trp Asp Asn Tyr Trp 
    290                 295                 300                 


Gly Tyr Leu His Lys Gln Asn Ile Ala Pro Val Leu Val Gly Glu Phe 
305                 310                 315                 320 


Gly Ser Lys Leu Gln Thr Thr Ser Asp Arg Gln Trp Leu Asn Lys Leu 
                325                 330                 335     


Thr Gln Tyr Leu Gly Ser Asn Gly Ile Ser Trp Thr Phe Trp Ser Trp 
            340                 345                 350         


Asn Pro Asn Ser Gly Asp Thr Gly Gly Ile Leu Asn Asp Asp Trp Thr 
        355                 360                 365             


Thr Ile Asn Thr Asp Lys Gln Ser Tyr Leu Ala Gly Gly Thr Asp Ala 
    370                 375                 380                 


Thr Gly Val Thr His Gln Ser Ile Leu Phe Pro Leu Asp Val Gly Gly 
385                 390                 395                 400 


Thr Pro Gln Pro Ser Ala Thr Phe Thr Arg Thr Ser Thr Arg Thr Arg 
                405                 410                 415     


Thr Pro Thr Ala Cys Gly Asn Cys Pro Thr Ala Thr Pro Thr Arg Thr 
            420                 425                 430         


Arg Thr Leu Thr Ala Thr Ser Thr Arg Thr Asn Thr Pro Ser Pro Asn 
        435                 440                 445             


Asn Gly Leu Lys Leu Gln Tyr Arg Val Gly Asp Gly Thr Ser Ala Asn 
    450                 455                 460                 


Asp Asn Gln Ile Lys Pro Gln Phe Arg Ile Val Asn Asn Gly Ala Ser 
465                 470                 475                 480 


Asn Val Pro Leu Ser Glu Leu Glu Ile Arg Tyr Trp Tyr Thr Ser Glu 
                485                 490                 495     


Gly Asn Gln Ala Gln Ser Tyr Trp Cys Asp Tyr Ala Thr Leu Asn Cys 
            500                 505                 510         


Ala Asn Ile Thr Gly Ser Phe Val Lys Leu Gln Asn Ala Val Asn Gly 
        515                 520                 525             


Ala Asn Gly Tyr Leu Arg Leu Lys Phe Thr Gly Gly Gln Val Asn Ala 
    530                 535                 540                 


Asn Ser Asn Thr Gly Glu Ile Gln Asn Arg Phe Asn Lys Ser Asp Trp 
545                 550                 555                 560 


Ser Asn Tyr Ser Glu Gly Asp Asp Tyr Ser Tyr Asp Pro Thr Lys Thr 
                565                 570                 575     


Ser Phe Thr Asp Trp Asn Lys Val Thr Leu Tyr Arg Asn Gly Ala Leu 
            580                 585                 590         


Val Trp Gly Ile Glu Pro Thr Gly Thr Lys Ser Val Thr Ala Thr Pro 
        595                 600                 605             


Thr Arg Thr Arg Thr Asn Val Pro Gly Lys Asn Ser Pro Thr Pro Thr 
    610                 615                 620                 


Pro Thr Leu Thr Ala Met Pro Thr Glu Ala Pro Ala Val Ser Met His 
625                 630                 635                 640 


Val Gln Tyr Arg Asn Gly Glu Asn Pro Ala Lys Pro Lys Asn Ala His 
                645                 650                 655     


Leu Arg Pro Gln Phe Arg Leu Ile Asn Asp Gly Gly Thr Gln Val Ser 
            660                 665                 670         


Leu Lys Asp Val Thr Ile Arg Tyr Trp Phe Thr Arg Gln Gly Glu Ser 
        675                 680                 685             


Lys Tyr Lys Phe Ser Cys Glu Ala Ala Ala Leu Asp Cys Ala Asn Ile 
    690                 695                 700                 


Arg Gly Lys Val Val Asn Leu Pro Ser Ala Arg Pro Gly Ala Asn Ala 
705                 710                 715                 720 


Tyr Leu Glu Val Arg Phe Arg Pro Ala Ala Gly Met Leu Ala Pro Gly 
                725                 730                 735     


Ala Asn Thr Gly Glu Ile Val Thr His Leu Val Lys Lys Asp Lys Ser 
            740                 745                 750         


Lys His Asn Glu Asn Arg Asp Tyr Ser Phe Asn Lys Thr Gln Phe Asp 
        755                 760                 765             


Phe Ala Asp Arg Ala Glu Ile Thr Val Tyr Arg Asn Gly Ala Leu Val 
    770                 775                 780                 


Gly Gly Ile Glu Pro 
785                 


<210> 127
<211> 1425
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 127
gtgtcaggag aaccgcacgt gtcccttcgt ctctcgcgcc cccgccgtcg tacagccatc     60

ctcgccgcgg tcgccgcgtg cacggtcacc gcgggtgcct ggctcgcaac cggcaccgcc    120

tcggccggca cgctgtccgg caccctctac cgcgatccca actcgtcagc gacacgctgg    180

gtggcggcca acccgaacga ctctcgtgca tcagccatcc gcgacaagat agccagccag    240

ccggcggccc gatggctggc caatttcaac atctccacca tccagtccga ggtttcgacc    300

tacatcggtg cggccaacag cgccaatcag gtcccggtct tcgccgtgta catgatcccc    360

aaccgcgact gcggtggggc cagcgccggt ggcgcgcccg acctgaacca gtaccagacc    420

tgggtctcgg cgttcgccca gggactcggc aacaggctgg tcatcatcat cctcgagacc    480

gactcgttgg cgctcaccac ctgcctggac gcgaacgccc tggccgcccg taaccaggcc    540

atcagcaccg cggtgcagac catcaagtcg cgcaacgcga acgccaaggt gtacctcgac    600

ggcggacact ccacctggaa cagcgcgtcg gacacggcca accgccttcg gaatgccgga    660

gtccagttcg ctgacggctt cttcaccaac gtctcgaact tcaactcgac cagtagcgag    720

gtgaacttcg gccggtcggt catttcggcc ctgtcctcgc tcggcatcag tggcaagcgc    780

cagatcatcg acaccagccg taacggcggc gccagtggtg actggtgcgc agacgacaac    840

actgaccgcc ggctcggaca atggccgaca ctgaacaccg gtgatggcaa tgtggacggc    900

tacctctggg tcaagccgcc cggcgaggcc gacggttgcg ccttccaggc cggcagcttc    960

cagccgcagc tcgcgttcag cctcactcag ggaataggca acccgccgac ctcggccccg   1020

ccgaccaccg ccgtgcctac caccacgcgg ccgccgacct cggcgccgcc gaccaccacg   1080

cggccgccga cctcggcccc gccgaccacc acaccaccgc ccactacgac gggcgcctgc   1140

tccgccacca tgtcgatcac caactcgtgg cccggcggtt tccaggcgaa tgtcaccgtg   1200

gcggccggca gcgcggcgat ctccgggtgg actgtccggt ggaccctgtc cagcggtcag   1260

accatcaccc aactctggaa tggggcccag acggtgagcg ggtcagccgt gacggtcagg   1320

aatctgtcct acaacggctc gctggcagcc ggtgcgaaca cctcgttcgg gttcaccgct   1380

aacggcagcg cttcgacgcc cgcgaccacc tgcacatcgc cgtag                   1425

<210> 128
<211> 474
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(42)

<220> 
<221> DOMAIN
<222> (51)...(327)
<223> Glycosyl hydrolases family 6

<220> 
<221> DOMAIN
<222> (380)...(471)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (54)...(57)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (67)...(70)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (91)...(94)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (233)...(236)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (238)...(241)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (403)...(406)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (447)...(450)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (451)...(454)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (459)...(462)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (468)...(471)
<223> N-glycosylation site. Prosite id = PS00001

<400> 128
Met Ser Gly Glu Pro His Val Ser Leu Arg Leu Ser Arg Pro Arg Arg 
1               5                   10                  15      


Arg Thr Ala Ile Leu Ala Ala Val Ala Ala Cys Thr Val Thr Ala Gly 
            20                  25                  30          


Ala Trp Leu Ala Thr Gly Thr Ala Ser Ala Gly Thr Leu Ser Gly Thr 
        35                  40                  45              


Leu Tyr Arg Asp Pro Asn Ser Ser Ala Thr Arg Trp Val Ala Ala Asn 
    50                  55                  60                  


Pro Asn Asp Ser Arg Ala Ser Ala Ile Arg Asp Lys Ile Ala Ser Gln 
65                  70                  75                  80  


Pro Ala Ala Arg Trp Leu Ala Asn Phe Asn Ile Ser Thr Ile Gln Ser 
                85                  90                  95      


Glu Val Ser Thr Tyr Ile Gly Ala Ala Asn Ser Ala Asn Gln Val Pro 
            100                 105                 110         


Val Phe Ala Val Tyr Met Ile Pro Asn Arg Asp Cys Gly Gly Ala Ser 
        115                 120                 125             


Ala Gly Gly Ala Pro Asp Leu Asn Gln Tyr Gln Thr Trp Val Ser Ala 
    130                 135                 140                 


Phe Ala Gln Gly Leu Gly Asn Arg Leu Val Ile Ile Ile Leu Glu Thr 
145                 150                 155                 160 


Asp Ser Leu Ala Leu Thr Thr Cys Leu Asp Ala Asn Ala Leu Ala Ala 
                165                 170                 175     


Arg Asn Gln Ala Ile Ser Thr Ala Val Gln Thr Ile Lys Ser Arg Asn 
            180                 185                 190         


Ala Asn Ala Lys Val Tyr Leu Asp Gly Gly His Ser Thr Trp Asn Ser 
        195                 200                 205             


Ala Ser Asp Thr Ala Asn Arg Leu Arg Asn Ala Gly Val Gln Phe Ala 
    210                 215                 220                 


Asp Gly Phe Phe Thr Asn Val Ser Asn Phe Asn Ser Thr Ser Ser Glu 
225                 230                 235                 240 


Val Asn Phe Gly Arg Ser Val Ile Ser Ala Leu Ser Ser Leu Gly Ile 
                245                 250                 255     


Ser Gly Lys Arg Gln Ile Ile Asp Thr Ser Arg Asn Gly Gly Ala Ser 
            260                 265                 270         


Gly Asp Trp Cys Ala Asp Asp Asn Thr Asp Arg Arg Leu Gly Gln Trp 
        275                 280                 285             


Pro Thr Leu Asn Thr Gly Asp Gly Asn Val Asp Gly Tyr Leu Trp Val 
    290                 295                 300                 


Lys Pro Pro Gly Glu Ala Asp Gly Cys Ala Phe Gln Ala Gly Ser Phe 
305                 310                 315                 320 


Gln Pro Gln Leu Ala Phe Ser Leu Thr Gln Gly Ile Gly Asn Pro Pro 
                325                 330                 335     


Thr Ser Ala Pro Pro Thr Thr Ala Val Pro Thr Thr Thr Arg Pro Pro 
            340                 345                 350         


Thr Ser Ala Pro Pro Thr Thr Thr Arg Pro Pro Thr Ser Ala Pro Pro 
        355                 360                 365             


Thr Thr Thr Pro Pro Pro Thr Thr Thr Gly Ala Cys Ser Ala Thr Met 
    370                 375                 380                 


Ser Ile Thr Asn Ser Trp Pro Gly Gly Phe Gln Ala Asn Val Thr Val 
385                 390                 395                 400 


Ala Ala Gly Ser Ala Ala Ile Ser Gly Trp Thr Val Arg Trp Thr Leu 
                405                 410                 415     


Ser Ser Gly Gln Thr Ile Thr Gln Leu Trp Asn Gly Ala Gln Thr Val 
            420                 425                 430         


Ser Gly Ser Ala Val Thr Val Arg Asn Leu Ser Tyr Asn Gly Ser Leu 
        435                 440                 445             


Ala Ala Gly Ala Asn Thr Ser Phe Gly Phe Thr Ala Asn Gly Ser Ala 
    450                 455                 460                 


Ser Thr Pro Ala Thr Thr Cys Thr Ser Pro 
465                 470                 


<210> 129
<211> 1080
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 129
atgaaacggt cactatgttt agttcttgtc ctactagtga ttctttcagc ttgtgttcaa     60

aattacgcaa caccaacatc tgctcccacc acccaatcca cggagactcc aatgccacca    120

actactacat ctcagcctga tccaacatcc acacccagta caaatgtcaa acttcagcgg    180

ggcgtgaaca tgggtaatat gctcgaagcg ccgaacgaag gcgattgggg actctatgta    240

caggaagaat attttgactt aatcaaagac gcaggctttg actttgtccg tttgcccgtg    300

cggtggagca cacatgcgga agccgaatcg ccctacacaa ttgattcaac tttctttgca    360

cgcgtagatg aagtagtcaa ctgggctttg gagcgaaatc tcaggatcat cgttgacttc    420

catcactacg aagaaatgat gaccgatccg tggattcaca gagatcgtta tatcggtgtt    480

tggaaacagg tcgctgagca ttaccaggat tatccatcaa atgttttatt tgaattactt    540

aacgaaccta acaatactct gaatgcccag ctttggaatc aatatttgac tgaagcattg    600

gcagttgtaa gagagtcaaa cccaactcgt gatgtagtaa tcggccctgt caattggaat    660

gcttatgatt ggctctccac actcgatgtg ccggatgatg agcatctgat tgttacattt    720

cattactatt tgccgttcca ttttacgcat caaggcgcaa aatgggttgg agatgatgct    780

caaaattggc taggcactga atggggaagt gacgaggaaa aagcggaagt cacgggtcat    840

ttcgatgcgg tggctgattg ggcacagcga catggaaatg ttcgcatatt gatgggagag    900

tttggggcgt attcaaaagg tccgcaagac tcacgcgtcc gctggacgga gtttgtgagg    960

ggcgcagcgg aaagccacgg gtttgcgtgg gcttattggg aattcgcttc aggtttcggg   1020

gtgtatgacc cggaagctaa agtctggaga gatgatttgt tgcaggcgtt gattccgtaa   1080

<210> 130
<211> 359
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(23)

<220> 
<221> DOMAIN
<222> (65)...(343)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (177)...(186)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (186)...(189)
<223> N-glycosylation site. Prosite id = PS00001

<400> 130
Met Lys Arg Ser Leu Cys Leu Val Leu Val Leu Leu Val Ile Leu Ser 
1               5                   10                  15      


Ala Cys Val Gln Asn Tyr Ala Thr Pro Thr Ser Ala Pro Thr Thr Gln 
            20                  25                  30          


Ser Thr Glu Thr Pro Met Pro Pro Thr Thr Thr Ser Gln Pro Asp Pro 
        35                  40                  45              


Thr Ser Thr Pro Ser Thr Asn Val Lys Leu Gln Arg Gly Val Asn Met 
    50                  55                  60                  


Gly Asn Met Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Leu Tyr Val 
65                  70                  75                  80  


Gln Glu Glu Tyr Phe Asp Leu Ile Lys Asp Ala Gly Phe Asp Phe Val 
                85                  90                  95      


Arg Leu Pro Val Arg Trp Ser Thr His Ala Glu Ala Glu Ser Pro Tyr 
            100                 105                 110         


Thr Ile Asp Ser Thr Phe Phe Ala Arg Val Asp Glu Val Val Asn Trp 
        115                 120                 125             


Ala Leu Glu Arg Asn Leu Arg Ile Ile Val Asp Phe His His Tyr Glu 
    130                 135                 140                 


Glu Met Met Thr Asp Pro Trp Ile His Arg Asp Arg Tyr Ile Gly Val 
145                 150                 155                 160 


Trp Lys Gln Val Ala Glu His Tyr Gln Asp Tyr Pro Ser Asn Val Leu 
                165                 170                 175     


Phe Glu Leu Leu Asn Glu Pro Asn Asn Thr Leu Asn Ala Gln Leu Trp 
            180                 185                 190         


Asn Gln Tyr Leu Thr Glu Ala Leu Ala Val Val Arg Glu Ser Asn Pro 
        195                 200                 205             


Thr Arg Asp Val Val Ile Gly Pro Val Asn Trp Asn Ala Tyr Asp Trp 
    210                 215                 220                 


Leu Ser Thr Leu Asp Val Pro Asp Asp Glu His Leu Ile Val Thr Phe 
225                 230                 235                 240 


His Tyr Tyr Leu Pro Phe His Phe Thr His Gln Gly Ala Lys Trp Val 
                245                 250                 255     


Gly Asp Asp Ala Gln Asn Trp Leu Gly Thr Glu Trp Gly Ser Asp Glu 
            260                 265                 270         


Glu Lys Ala Glu Val Thr Gly His Phe Asp Ala Val Ala Asp Trp Ala 
        275                 280                 285             


Gln Arg His Gly Asn Val Arg Ile Leu Met Gly Glu Phe Gly Ala Tyr 
    290                 295                 300                 


Ser Lys Gly Pro Gln Asp Ser Arg Val Arg Trp Thr Glu Phe Val Arg 
305                 310                 315                 320 


Gly Ala Ala Glu Ser His Gly Phe Ala Trp Ala Tyr Trp Glu Phe Ala 
                325                 330                 335     


Ser Gly Phe Gly Val Tyr Asp Pro Glu Ala Lys Val Trp Arg Asp Asp 
            340                 345                 350         


Leu Leu Gln Ala Leu Ile Pro 
        355                 


<210> 131
<211> 1116
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 131
gtgatgaaag ggtttcgctg gtgtgtgatg gcgatggtgg tgatggcggc gacgaacgtt     60

cgcgccgcct gcacctggcc tgcatgggag cagtttaaaa aggactatat cagcgaaggc    120

gggcgcgtca ttgatcccag tgacgcgcgc aaaatcagta cctcggaagg gcaaagctac    180

gcgctgttct ttgccctcgc ggcgaacgat cgccagacct ttgatcggct gctgggctgg    240

acacgcgaca acctcgcgca gggcgatctg agccagcatc tccccgcctg gctgtgggga    300

atgaaagaaa aagagaaaga gacctgggcg gtgatcgaca gcaattccgc ctcggatgcc    360

gatatctgga ttgcctggtc gctgctggaa gcggggcgtt tgtggaaagc gccggagtac    420

acggccaccg ggaaagcgct gctcaggcgc attgcccgtg aagaggtggt caaggtgccg    480

ggcctggggc tgatgctgct gcccggtaaa gtcggctttg ccgaagagaa atcctggcgc    540

ttcaacccga gctatctccc gccgcagctg gcgaactatt tcacccgttt tggcgcaccc    600

tggaccacgc ttcgcgaaac caatttgcgg ctgctgctgg aaaccgcgcc aaaaggcttc    660

tcacccgact gggtgcagta tcaaaaaaat aagggctggc aactgaagcc agaaaaaacg    720

tttatcggca gctacgacgc gattcgcgtc tatctctggg cgggcatgtt gcacgaccgg    780

gacccgcaga aagcgcggct gctggcacgt tttaaaccca tggcgacgct tacaacgaaa    840

aaaggcgtac cccctgagaa agtcgatgtc gccagcggta aaaccacggg caatggcccg    900

gtcggtttct ccgcctcact gctgccgttt ttacaaaacc gcgatgcaca agcggtacaa    960

cgtcagcgtg tcgccgacca ttttcccggt aatgacgcct attacagcta cgtactgacc   1020

ctgtttggac aaggatggga tcagcatcgt tttcgtttca ccgcaaaggg tgaattacac   1080

cctgactggg gccaggaatg cgcaagttct cattaa                             1116

<210> 132
<211> 371
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(22)

<220> 
<221> DOMAIN
<222> (3)...(349)
<223> Glycosyl hydrolases family 8

<400> 132
Met Met Lys Gly Phe Arg Trp Cys Val Met Ala Met Val Val Met Ala 
1               5                   10                  15      


Ala Thr Asn Val Arg Ala Ala Cys Thr Trp Pro Ala Trp Glu Gln Phe 
            20                  25                  30          


Lys Lys Asp Tyr Ile Ser Glu Gly Gly Arg Val Ile Asp Pro Ser Asp 
        35                  40                  45              


Ala Arg Lys Ile Ser Thr Ser Glu Gly Gln Ser Tyr Ala Leu Phe Phe 
    50                  55                  60                  


Ala Leu Ala Ala Asn Asp Arg Gln Thr Phe Asp Arg Leu Leu Gly Trp 
65                  70                  75                  80  


Thr Arg Asp Asn Leu Ala Gln Gly Asp Leu Ser Gln His Leu Pro Ala 
                85                  90                  95      


Trp Leu Trp Gly Met Lys Glu Lys Glu Lys Glu Thr Trp Ala Val Ile 
            100                 105                 110         


Asp Ser Asn Ser Ala Ser Asp Ala Asp Ile Trp Ile Ala Trp Ser Leu 
        115                 120                 125             


Leu Glu Ala Gly Arg Leu Trp Lys Ala Pro Glu Tyr Thr Ala Thr Gly 
    130                 135                 140                 


Lys Ala Leu Leu Arg Arg Ile Ala Arg Glu Glu Val Val Lys Val Pro 
145                 150                 155                 160 


Gly Leu Gly Leu Met Leu Leu Pro Gly Lys Val Gly Phe Ala Glu Glu 
                165                 170                 175     


Lys Ser Trp Arg Phe Asn Pro Ser Tyr Leu Pro Pro Gln Leu Ala Asn 
            180                 185                 190         


Tyr Phe Thr Arg Phe Gly Ala Pro Trp Thr Thr Leu Arg Glu Thr Asn 
        195                 200                 205             


Leu Arg Leu Leu Leu Glu Thr Ala Pro Lys Gly Phe Ser Pro Asp Trp 
    210                 215                 220                 


Val Gln Tyr Gln Lys Asn Lys Gly Trp Gln Leu Lys Pro Glu Lys Thr 
225                 230                 235                 240 


Phe Ile Gly Ser Tyr Asp Ala Ile Arg Val Tyr Leu Trp Ala Gly Met 
                245                 250                 255     


Leu His Asp Arg Asp Pro Gln Lys Ala Arg Leu Leu Ala Arg Phe Lys 
            260                 265                 270         


Pro Met Ala Thr Leu Thr Thr Lys Lys Gly Val Pro Pro Glu Lys Val 
        275                 280                 285             


Asp Val Ala Ser Gly Lys Thr Thr Gly Asn Gly Pro Val Gly Phe Ser 
    290                 295                 300                 


Ala Ser Leu Leu Pro Phe Leu Gln Asn Arg Asp Ala Gln Ala Val Gln 
305                 310                 315                 320 


Arg Gln Arg Val Ala Asp His Phe Pro Gly Asn Asp Ala Tyr Tyr Ser 
                325                 330                 335     


Tyr Val Leu Thr Leu Phe Gly Gln Gly Trp Asp Gln His Arg Phe Arg 
            340                 345                 350         


Phe Thr Ala Lys Gly Glu Leu His Pro Asp Trp Gly Gln Glu Cys Ala 
        355                 360                 365             


Ser Ser His 
    370     


<210> 133
<211> 1335
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 133
gtgccgaccg ggttgagggc caagccctgc ctcacgcgct ggctcgcggc cagcgcctgc     60

gcgctcgcgc cgctgctgct cggcgcgccc gcgtccgcgc ttgccgcgac ggcgaacgcg    120

aacgccaaaa cgcctgctgc gaagacgacg tcggctgccg cgccgcaagc gcctgcctgc    180

gcgtggtccg actggaccgc gttcaagggc gcgctgctgt ccgccgacgg acgcgtgatc    240

gacgccagtt cgccgcgtca ggtgacggtc tccgaaggcc agtcgtacgc gctcttcttc    300

gcgctcgtcg cgaacgaccg cgcggccttc gacaagatcc tcgcgtggac cgagaacaat    360

ctcgcccggg gcgatctcgc ggcgcatttg cccgcgtgga tctgggggcg caccgacatc    420

gacgaaaacg gcgcgccggt cgtggcttcg gctgcgtcgg ccgcgtcgtc cagccccgct    480

acgcagacgc aaaccggcac gtggggcgtg atcgacacca actccgccag cgacgccgat    540

ctctggatcg cctacacgct gctcgaagcg ggccgcctct ggaacgtgcg ccgctacacg    600

gcgatcggca cggtgatggc gcgcaacgtc ctgcgccgcg agacggccgc gctgcccggc    660

ctcggccgca cggtgctgcc cggcccggtc ggcttcacgc tcgacaagaa cacgtggcgt    720

ctgaacccga gctatgtgcc gctgcaggtc atgcgccgct tcacgctcgc gatgcccgag    780

aaagaacggc ccgagtggaa gtcgctgctc ggcagttcgt cgaagctcgt caacggcacc    840

gcgcccaagg gcttttcgcc cgactgggtc gtctaccgcg cgcggggcaa caagggcgat    900

ttcggccctg acgaacccac gcacgccgaa agcgcctata acgcgatccg cgtctatctg    960

tgggcgggca tgctcgctta cgacgacccg gcgcgcagcg cgacgctcgc gaccttcgcg   1020

ccgctctcgg ccttcgtggc cgcgcacggc tttccgcccg agcgcgtcaa cacgcagacc   1080

ggcgaacccg gcccgaacga gggcaacggc ggcttttcgg cggcggtcgc gccgtatctg   1140

tcggcactcg gccgcaccga tctgtccgat gcgcaggttc aacgcagccg cacgctcgcg   1200

caaaaatcgc cgcccggcta ttacagcagc gtgctgatgc tgttcggcct cggctatctg   1260

caaggccttt atcgcttcga tgcgcaaggc cgcgtaattc ccgcatggac ggctcaatgc   1320

ccggcagcac gatga                                                    1335

<210> 134
<211> 444
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(35)

<220> 
<221> DOMAIN
<222> (38)...(422)
<223> Glycosyl hydrolases family 8

<220> 
<221> SITE
<222> (282)...(285)
<223> N-glycosylation site. Prosite id = PS00001

<400> 134
Met Pro Thr Gly Leu Arg Ala Lys Pro Cys Leu Thr Arg Trp Leu Ala 
1               5                   10                  15      


Ala Ser Ala Cys Ala Leu Ala Pro Leu Leu Leu Gly Ala Pro Ala Ser 
            20                  25                  30          


Ala Leu Ala Ala Thr Ala Asn Ala Asn Ala Lys Thr Pro Ala Ala Lys 
        35                  40                  45              


Thr Thr Ser Ala Ala Ala Pro Gln Ala Pro Ala Cys Ala Trp Ser Asp 
    50                  55                  60                  


Trp Thr Ala Phe Lys Gly Ala Leu Leu Ser Ala Asp Gly Arg Val Ile 
65                  70                  75                  80  


Asp Ala Ser Ser Pro Arg Gln Val Thr Val Ser Glu Gly Gln Ser Tyr 
                85                  90                  95      


Ala Leu Phe Phe Ala Leu Val Ala Asn Asp Arg Ala Ala Phe Asp Lys 
            100                 105                 110         


Ile Leu Ala Trp Thr Glu Asn Asn Leu Ala Arg Gly Asp Leu Ala Ala 
        115                 120                 125             


His Leu Pro Ala Trp Ile Trp Gly Arg Thr Asp Ile Asp Glu Asn Gly 
    130                 135                 140                 


Ala Pro Val Val Ala Ser Ala Ala Ser Ala Ala Ser Ser Ser Pro Ala 
145                 150                 155                 160 


Thr Gln Thr Gln Thr Gly Thr Trp Gly Val Ile Asp Thr Asn Ser Ala 
                165                 170                 175     


Ser Asp Ala Asp Leu Trp Ile Ala Tyr Thr Leu Leu Glu Ala Gly Arg 
            180                 185                 190         


Leu Trp Asn Val Arg Arg Tyr Thr Ala Ile Gly Thr Val Met Ala Arg 
        195                 200                 205             


Asn Val Leu Arg Arg Glu Thr Ala Ala Leu Pro Gly Leu Gly Arg Thr 
    210                 215                 220                 


Val Leu Pro Gly Pro Val Gly Phe Thr Leu Asp Lys Asn Thr Trp Arg 
225                 230                 235                 240 


Leu Asn Pro Ser Tyr Val Pro Leu Gln Val Met Arg Arg Phe Thr Leu 
                245                 250                 255     


Ala Met Pro Glu Lys Glu Arg Pro Glu Trp Lys Ser Leu Leu Gly Ser 
            260                 265                 270         


Ser Ser Lys Leu Val Asn Gly Thr Ala Pro Lys Gly Phe Ser Pro Asp 
        275                 280                 285             


Trp Val Val Tyr Arg Ala Arg Gly Asn Lys Gly Asp Phe Gly Pro Asp 
    290                 295                 300                 


Glu Pro Thr His Ala Glu Ser Ala Tyr Asn Ala Ile Arg Val Tyr Leu 
305                 310                 315                 320 


Trp Ala Gly Met Leu Ala Tyr Asp Asp Pro Ala Arg Ser Ala Thr Leu 
                325                 330                 335     


Ala Thr Phe Ala Pro Leu Ser Ala Phe Val Ala Ala His Gly Phe Pro 
            340                 345                 350         


Pro Glu Arg Val Asn Thr Gln Thr Gly Glu Pro Gly Pro Asn Glu Gly 
        355                 360                 365             


Asn Gly Gly Phe Ser Ala Ala Val Ala Pro Tyr Leu Ser Ala Leu Gly 
    370                 375                 380                 


Arg Thr Asp Leu Ser Asp Ala Gln Val Gln Arg Ser Arg Thr Leu Ala 
385                 390                 395                 400 


Gln Lys Ser Pro Pro Gly Tyr Tyr Ser Ser Val Leu Met Leu Phe Gly 
                405                 410                 415     


Leu Gly Tyr Leu Gln Gly Leu Tyr Arg Phe Asp Ala Gln Gly Arg Val 
            420                 425                 430         


Ile Pro Ala Trp Thr Ala Gln Cys Pro Ala Ala Arg 
        435                 440                 


<210> 135
<211> 1782
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 135
gtgtcaatga taacgccaaa aacaaagtct tatggcttgg cagccatgct cagccttggt     60

ttggcagttg ccaaccaaaa cgctaatgca ggctgcgttt ataaaattac taatgattgg    120

ggggcgggcc taaccggcga aatcgtcatc accaacagcg gcagctctgc cgtgaatggc    180

tggaatattg gttggcagta cgccaccaat cgcatcacca gttcatggaa tgtaaacctt    240

agcggaagta atccatattc tgcctccaac atcagttgga atggaaattt acaaccaggc    300

caaagtgcca gctttggttt tcaagtggat aagcgcggtg gttccgccga agtacccact    360

attaccggtt cggtctgctc gggcacagtg accagttcgg ccgccccttc aagcacgccg    420

gttcgctcca gcagttcaac tgcggtggcg agtagttcca acaataataa cggcggccag    480

caatgtaatt ggtatggaac aatcattccg ctgtgcgtaa ataccgcgag tggctggggt    540

tgggaaaata accaaacctg cgtttcccgt tcggtttgca ccacgattgt taacggatca    600

tcttcagcaa caccgtcttc aacgccaatt gtttcatcca gttcaagatc atcctcgtca    660

gtaccgattg ttccttccag ttcatcgcct tccagctcct catcgtcaaa taacaataac    720

cagggcattg caccattggt tgtgcaaggc aataaagtga ccgccaatgg ccagcccgcg    780

aatttggctg gcatgagttt gttctggagc aacaccggct ggggtggcga gaagtattac    840

aactcgcaag ccgttgcctg gttgaagtct gactggaaag ccaatctggt tcgcgccgcc    900

atgggggttg atgaagccgg tggttatctc accgactcaa ccaacaaaac acgtgttacc    960

gcagtggttg atgcagcaat tgccaacaat atgtatgtga ttatcgattg gcacacgcat   1020

cacgctgaag ataacaaagc ggccgccatc gccttcttta aagaaatggc gaccaaatac   1080

ggcagctata acaatgtgat ttacgaggtg tacaacgagc cactgcaagt ttcctggagc   1140

agtgtgatca agccgtatgc gactgatgtg atccgtgaga ttcgcgcaat cgatccggac   1200

aacctgatca ttgtcggtac cccaagctgg tcgcaggatg tggatgtcgc ggcgaatgac   1260

ccgatcactg cgtacaccaa catcgcttac accttgcact tctactcagg tacccataaa   1320

caattcctgc gcgacaaagc acaaacggca ttgagcaaag gtattgcgtt attcgtgacc   1380

gagtggggtt cagtgaatgc cgacggtaat ggcgcagtgg acaccgctga aaccaacgcc   1440

tggttaagct tcctgaaaac caacggtatc agccacgcga attgggcatt gaacgataaa   1500

gccgaaggtt catctgcatt aactcccgga gcaagtgcta acggaggttg gagcagtggt   1560

caattgaccg cttcaggatc attggtacgc aacgcgatta tcaccaataa caataacggc   1620

aacaccagct ctgtcgcgac tacttcaaca tcttccagtg ttagctcgat taataccaac   1680

ccgagcacca ttgcaccgga caatgcgaag attcgctaca acggtcgcgt cagcctcaac   1740

tccactgctg cgctttacga ttgggcaaat acccaaatcg aa                      1782

<210> 136
<211> 594
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (30)...(126)
<223> Cellulose binding domain

<220> 
<221> DOMAIN
<222> (250)...(505)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (80)...(83)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (91)...(94)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (94)...(107)
<223> Cellulose-binding domain, bacterial type. Prosite id = PS00561

<220> 
<221> SITE
<222> (186)...(189)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (201)...(204)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (319)...(322)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (371)...(380)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (549)...(552)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (588)...(591)
<223> N-glycosylation site. Prosite id = PS00001

<400> 136
Met Ser Met Ile Thr Pro Lys Thr Lys Ser Tyr Gly Leu Ala Ala Met 
1               5                   10                  15      


Leu Ser Leu Gly Leu Ala Val Ala Asn Gln Asn Ala Asn Ala Gly Cys 
            20                  25                  30          


Val Tyr Lys Ile Thr Asn Asp Trp Gly Ala Gly Leu Thr Gly Glu Ile 
        35                  40                  45              


Val Ile Thr Asn Ser Gly Ser Ser Ala Val Asn Gly Trp Asn Ile Gly 
    50                  55                  60                  


Trp Gln Tyr Ala Thr Asn Arg Ile Thr Ser Ser Trp Asn Val Asn Leu 
65                  70                  75                  80  


Ser Gly Ser Asn Pro Tyr Ser Ala Ser Asn Ile Ser Trp Asn Gly Asn 
                85                  90                  95      


Leu Gln Pro Gly Gln Ser Ala Ser Phe Gly Phe Gln Val Asp Lys Arg 
            100                 105                 110         


Gly Gly Ser Ala Glu Val Pro Thr Ile Thr Gly Ser Val Cys Ser Gly 
        115                 120                 125             


Thr Val Thr Ser Ser Ala Ala Pro Ser Ser Thr Pro Val Arg Ser Ser 
    130                 135                 140                 


Ser Ser Thr Ala Val Ala Ser Ser Ser Asn Asn Asn Asn Gly Gly Gln 
145                 150                 155                 160 


Gln Cys Asn Trp Tyr Gly Thr Ile Ile Pro Leu Cys Val Asn Thr Ala 
                165                 170                 175     


Ser Gly Trp Gly Trp Glu Asn Asn Gln Thr Cys Val Ser Arg Ser Val 
            180                 185                 190         


Cys Thr Thr Ile Val Asn Gly Ser Ser Ser Ala Thr Pro Ser Ser Thr 
        195                 200                 205             


Pro Ile Val Ser Ser Ser Ser Arg Ser Ser Ser Ser Val Pro Ile Val 
    210                 215                 220                 


Pro Ser Ser Ser Ser Pro Ser Ser Ser Ser Ser Ser Asn Asn Asn Asn 
225                 230                 235                 240 


Gln Gly Ile Ala Pro Leu Val Val Gln Gly Asn Lys Val Thr Ala Asn 
                245                 250                 255     


Gly Gln Pro Ala Asn Leu Ala Gly Met Ser Leu Phe Trp Ser Asn Thr 
            260                 265                 270         


Gly Trp Gly Gly Glu Lys Tyr Tyr Asn Ser Gln Ala Val Ala Trp Leu 
        275                 280                 285             


Lys Ser Asp Trp Lys Ala Asn Leu Val Arg Ala Ala Met Gly Val Asp 
    290                 295                 300                 


Glu Ala Gly Gly Tyr Leu Thr Asp Ser Thr Asn Lys Thr Arg Val Thr 
305                 310                 315                 320 


Ala Val Val Asp Ala Ala Ile Ala Asn Asn Met Tyr Val Ile Ile Asp 
                325                 330                 335     


Trp His Thr His His Ala Glu Asp Asn Lys Ala Ala Ala Ile Ala Phe 
            340                 345                 350         


Phe Lys Glu Met Ala Thr Lys Tyr Gly Ser Tyr Asn Asn Val Ile Tyr 
        355                 360                 365             


Glu Val Tyr Asn Glu Pro Leu Gln Val Ser Trp Ser Ser Val Ile Lys 
    370                 375                 380                 


Pro Tyr Ala Thr Asp Val Ile Arg Glu Ile Arg Ala Ile Asp Pro Asp 
385                 390                 395                 400 


Asn Leu Ile Ile Val Gly Thr Pro Ser Trp Ser Gln Asp Val Asp Val 
                405                 410                 415     


Ala Ala Asn Asp Pro Ile Thr Ala Tyr Thr Asn Ile Ala Tyr Thr Leu 
            420                 425                 430         


His Phe Tyr Ser Gly Thr His Lys Gln Phe Leu Arg Asp Lys Ala Gln 
        435                 440                 445             


Thr Ala Leu Ser Lys Gly Ile Ala Leu Phe Val Thr Glu Trp Gly Ser 
    450                 455                 460                 


Val Asn Ala Asp Gly Asn Gly Ala Val Asp Thr Ala Glu Thr Asn Ala 
465                 470                 475                 480 


Trp Leu Ser Phe Leu Lys Thr Asn Gly Ile Ser His Ala Asn Trp Ala 
                485                 490                 495     


Leu Asn Asp Lys Ala Glu Gly Ser Ser Ala Leu Thr Pro Gly Ala Ser 
            500                 505                 510         


Ala Asn Gly Gly Trp Ser Ser Gly Gln Leu Thr Ala Ser Gly Ser Leu 
        515                 520                 525             


Val Arg Asn Ala Ile Ile Thr Asn Asn Asn Asn Gly Asn Thr Ser Ser 
    530                 535                 540                 


Val Ala Thr Thr Ser Thr Ser Ser Ser Val Ser Ser Ile Asn Thr Asn 
545                 550                 555                 560 


Pro Ser Thr Ile Ala Pro Asp Asn Ala Lys Ile Arg Tyr Asn Gly Arg 
                565                 570                 575     


Val Ser Leu Asn Ser Thr Ala Ala Leu Tyr Asp Trp Ala Asn Thr Gln 
            580                 585                 590         


Ile Glu 
        


<210> 137
<211> 1119
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 137
atgaaatttc cgcttcaatt ccttcctttg attttttttc gtgttctgcg tttatgttta     60

attcccctac tggtttttac cagtaatttc agtgtggcag acgatgcctg gcaaaatagt    120

aatggctggt ggaatgaaac cgatattccc gcctttgata aaagcaaaat tgaaaagcaa    180

cttttattag taaaagtgca gggcaataaa tttgtggatt caaccggcaa gacgctggtg    240

tttcgcggac tgaatattgc cgaccccgac aaaattgcgc gcgataaacg tttcaccaaa    300

aaacattttg aagtgattaa atcctggggc gcgaatgtta tccgtgtgcc ggtgcatcca    360

tcagcctggc gcaagcacgg caaaaaagcc tatctcgcca tgctcgacca ggtggtggtt    420

tgggcgaatg aattgggcat gtatgtgatc ctcgattggc actccatcgg caacctcaaa    480

tcgcaaatgt tccagaataa ttcctactac accgataagc ccgagacctt cgatttttgg    540

cgcactgtct ccgagcgcta cgccggtatc cacgcggtgg ccttctacga aatttttaat    600

gagccaacgg tattcagcgg ccgtttgggc atggtgagtt gggcggagtg gaaagcgatt    660

aacgaagaaa ttatcaccgt tatccaggcg cacaatccgg cggcgatttc actcgtggcc    720

ggttttaatt gggcatacga tttaaccccg gtcgccaccg cgcctatcga gcgcaacaat    780

gtggcctatg taagccatcc ctatccgatg aaagtgggtg cgccctacga aaaaaattgg    840

gagcgcgatt ttgggtttat cgccgataag tatccggtct tcgccaccga aatcggctac    900

cagttagcga cagataaagg cgcgcacatc ccggtgattg acgatggcca gtacggcaag    960

cgcatcactg attatttcaa tagcaaaggt attagctggg tgggctgggt gtttgacccg   1020

gattggtcgc cgcaaatgtt taccgactac aaaacctata agcctactat gcagggccag   1080

catttccgcg acgttatgct gcgcgataat aaaaaataa                          1119

<210> 138
<211> 372
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(33)

<220> 
<221> DOMAIN
<222> (69)...(346)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (29)...(32)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (45)...(48)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (168)...(171)
<223> N-glycosylation site. Prosite id = PS00001

<400> 138
Met Lys Phe Pro Leu Gln Phe Leu Pro Leu Ile Phe Phe Arg Val Leu 
1               5                   10                  15      


Arg Leu Cys Leu Ile Pro Leu Leu Val Phe Thr Ser Asn Phe Ser Val 
            20                  25                  30          


Ala Asp Asp Ala Trp Gln Asn Ser Asn Gly Trp Trp Asn Glu Thr Asp 
        35                  40                  45              


Ile Pro Ala Phe Asp Lys Ser Lys Ile Glu Lys Gln Leu Leu Leu Val 
    50                  55                  60                  


Lys Val Gln Gly Asn Lys Phe Val Asp Ser Thr Gly Lys Thr Leu Val 
65                  70                  75                  80  


Phe Arg Gly Leu Asn Ile Ala Asp Pro Asp Lys Ile Ala Arg Asp Lys 
                85                  90                  95      


Arg Phe Thr Lys Lys His Phe Glu Val Ile Lys Ser Trp Gly Ala Asn 
            100                 105                 110         


Val Ile Arg Val Pro Val His Pro Ser Ala Trp Arg Lys His Gly Lys 
        115                 120                 125             


Lys Ala Tyr Leu Ala Met Leu Asp Gln Val Val Val Trp Ala Asn Glu 
    130                 135                 140                 


Leu Gly Met Tyr Val Ile Leu Asp Trp His Ser Ile Gly Asn Leu Lys 
145                 150                 155                 160 


Ser Gln Met Phe Gln Asn Asn Ser Tyr Tyr Thr Asp Lys Pro Glu Thr 
                165                 170                 175     


Phe Asp Phe Trp Arg Thr Val Ser Glu Arg Tyr Ala Gly Ile His Ala 
            180                 185                 190         


Val Ala Phe Tyr Glu Ile Phe Asn Glu Pro Thr Val Phe Ser Gly Arg 
        195                 200                 205             


Leu Gly Met Val Ser Trp Ala Glu Trp Lys Ala Ile Asn Glu Glu Ile 
    210                 215                 220                 


Ile Thr Val Ile Gln Ala His Asn Pro Ala Ala Ile Ser Leu Val Ala 
225                 230                 235                 240 


Gly Phe Asn Trp Ala Tyr Asp Leu Thr Pro Val Ala Thr Ala Pro Ile 
                245                 250                 255     


Glu Arg Asn Asn Val Ala Tyr Val Ser His Pro Tyr Pro Met Lys Val 
            260                 265                 270         


Gly Ala Pro Tyr Glu Lys Asn Trp Glu Arg Asp Phe Gly Phe Ile Ala 
        275                 280                 285             


Asp Lys Tyr Pro Val Phe Ala Thr Glu Ile Gly Tyr Gln Leu Ala Thr 
    290                 295                 300                 


Asp Lys Gly Ala His Ile Pro Val Ile Asp Asp Gly Gln Tyr Gly Lys 
305                 310                 315                 320 


Arg Ile Thr Asp Tyr Phe Asn Ser Lys Gly Ile Ser Trp Val Gly Trp 
                325                 330                 335     


Val Phe Asp Pro Asp Trp Ser Pro Gln Met Phe Thr Asp Tyr Lys Thr 
            340                 345                 350         


Tyr Lys Pro Thr Met Gln Gly Gln His Phe Arg Asp Val Met Leu Arg 
        355                 360                 365             


Asp Asn Lys Lys 
    370         


<210> 139
<211> 1773
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 139
atgcctttat gtaccacaaa acatttactt ccggctttgg tgctggttac tgcatccagt     60

tttatgctgg gctgcggtaa tagcactaca cctgcagaaa aatcaaccgc atcatccgca    120

gtggttggca atattaaatt gaatcagctt ggttttttgc ctgagtcagc caagtgggcg    180

gtgattcctg atgtggctgc aactcaattc aaggtgattg atacagcaac taatgccgag    240

gtttacagcg gcaacttaag tgcagccgct aattgggacg cagcagaaga gaatgtaaaa    300

cttgcggatt tttccagtgt aaaaacaccg ggtaattatc ttgtgcgggt tgagggcgtt    360

aaggattcct atccatttgc gattggtgca gatgtttacc aagcgctaaa tgctgctagc    420

atcaaagcgt tttattttaa tcgcaacagc gcggagttat tacctgagca cgctggtgtg    480

tatgcgcgtc cgctcggtca tcctgatacc aatgtattga ttcatccttc agcagcaagc    540

gacgcgcgtc ctgcgggagc tgtgatctcc agtcctaaag gttggtacga cgccggtgat    600

tacaacaaat atatcgttaa ctccgggatc gcgacttatt cactgcttgc tgcttacgaa    660

cattttccgg aaatatttaa taaccagaat ttaaatattc ctgaaagcgg cgatgcaatt    720

cctgatttgc tcaatgaaac cctgtggaat cttgagtgga tgttgacgat gcaagatccc    780

aacgacggcg gtgtttatca caaacttacc aacaaacgat ttgatggcac ggtgatgccg    840

catgaggcaa ctaccgagcg ctacgtcgta caaaaaacca cggcagcagc attggatttt    900

gcagcggtaa tggcagcagc gagccgggtg cttgcgcaat atgaaaatca attgccgggc    960

atgtctgtaa aaatgttagc tgcggcagaa tctgcctggg cgtgggctgc ggctaatccg   1020

gcagtaattt acaaacagcc ggacgatatc aaaaccggtg aatatggtga tgcaaatctc   1080

gccgacgaat ttgcctgggc ggctgctgag ttgtacatca ctaccaaaaa agacagctac   1140

tacaacgcaa tgaaaccgga agagacaacg gcaacagttc catcatgggg tgatgtacgc   1200

gggttgggtt ggatctcgct cgcgcatcat cgcgatcaat tgacagcaat tgcggatcaa   1260

caattaattg cgaaccgcat tgatggtttg gcggccagtt tgcaatcggc gtgggcggcg   1320

tctgcctatc gcgttaccat gcagaaaaat gatttcaagt ggggcagtaa ttcggttgga   1380

ttagggcagg cgatgattct tgtgcaagcc tatcaattga atggcaagcg cgagtatctg   1440

gatgcggcgc aggcaatgct ggattatgtt cttggccgca acgcgaccga catgtcattc   1500

gtcaccggtt atggcgccaa ggcaaccatg catccacacc atcgcccgtc gggtgctgat   1560

aaagtggctt atcctgttcc gggttttctt gccggtggtc cgcaggcggg gcaacaggat   1620

aaagacgatt gtgaagtcgc ctatccatcg cccatcactg ccaagtctta tctggatcac   1680

tactgcagtt acgccagcaa cgaagtggcg attaactgga atgcaccatt agtgtacgtt   1740

tctgccgcca tccaggcgct gacaccaaag taa                                1773

<210> 140
<211> 590
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (41)...(124)
<223> N-terminal ig-like domain of cellulase

<220> 
<221> DOMAIN
<222> (132)...(584)
<223> Glycosyl hydrolase family 9

<220> 
<221> SITE
<222> (27)...(30)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (86)...(89)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (248)...(251)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (501)...(504)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (506)...(522)
<223> Glycosyl hydrolases family 9 active sites signature 1. Prosite id = PS00592

<220> 
<221> SITE
<222> (565)...(583)
<223> Glycosyl hydrolases family 9 active sites signature 2. Prosite id = PS00698

<400> 140
Met Pro Leu Cys Thr Thr Lys His Leu Leu Pro Ala Leu Val Leu Val 
1               5                   10                  15      


Thr Ala Ser Ser Phe Met Leu Gly Cys Gly Asn Ser Thr Thr Pro Ala 
            20                  25                  30          


Glu Lys Ser Thr Ala Ser Ser Ala Val Val Gly Asn Ile Lys Leu Asn 
        35                  40                  45              


Gln Leu Gly Phe Leu Pro Glu Ser Ala Lys Trp Ala Val Ile Pro Asp 
    50                  55                  60                  


Val Ala Ala Thr Gln Phe Lys Val Ile Asp Thr Ala Thr Asn Ala Glu 
65                  70                  75                  80  


Val Tyr Ser Gly Asn Leu Ser Ala Ala Ala Asn Trp Asp Ala Ala Glu 
                85                  90                  95      


Glu Asn Val Lys Leu Ala Asp Phe Ser Ser Val Lys Thr Pro Gly Asn 
            100                 105                 110         


Tyr Leu Val Arg Val Glu Gly Val Lys Asp Ser Tyr Pro Phe Ala Ile 
        115                 120                 125             


Gly Ala Asp Val Tyr Gln Ala Leu Asn Ala Ala Ser Ile Lys Ala Phe 
    130                 135                 140                 


Tyr Phe Asn Arg Asn Ser Ala Glu Leu Leu Pro Glu His Ala Gly Val 
145                 150                 155                 160 


Tyr Ala Arg Pro Leu Gly His Pro Asp Thr Asn Val Leu Ile His Pro 
                165                 170                 175     


Ser Ala Ala Ser Asp Ala Arg Pro Ala Gly Ala Val Ile Ser Ser Pro 
            180                 185                 190         


Lys Gly Trp Tyr Asp Ala Gly Asp Tyr Asn Lys Tyr Ile Val Asn Ser 
        195                 200                 205             


Gly Ile Ala Thr Tyr Ser Leu Leu Ala Ala Tyr Glu His Phe Pro Glu 
    210                 215                 220                 


Ile Phe Asn Asn Gln Asn Leu Asn Ile Pro Glu Ser Gly Asp Ala Ile 
225                 230                 235                 240 


Pro Asp Leu Leu Asn Glu Thr Leu Trp Asn Leu Glu Trp Met Leu Thr 
                245                 250                 255     


Met Gln Asp Pro Asn Asp Gly Gly Val Tyr His Lys Leu Thr Asn Lys 
            260                 265                 270         


Arg Phe Asp Gly Thr Val Met Pro His Glu Ala Thr Thr Glu Arg Tyr 
        275                 280                 285             


Val Val Gln Lys Thr Thr Ala Ala Ala Leu Asp Phe Ala Ala Val Met 
    290                 295                 300                 


Ala Ala Ala Ser Arg Val Leu Ala Gln Tyr Glu Asn Gln Leu Pro Gly 
305                 310                 315                 320 


Met Ser Val Lys Met Leu Ala Ala Ala Glu Ser Ala Trp Ala Trp Ala 
                325                 330                 335     


Ala Ala Asn Pro Ala Val Ile Tyr Lys Gln Pro Asp Asp Ile Lys Thr 
            340                 345                 350         


Gly Glu Tyr Gly Asp Ala Asn Leu Ala Asp Glu Phe Ala Trp Ala Ala 
        355                 360                 365             


Ala Glu Leu Tyr Ile Thr Thr Lys Lys Asp Ser Tyr Tyr Asn Ala Met 
    370                 375                 380                 


Lys Pro Glu Glu Thr Thr Ala Thr Val Pro Ser Trp Gly Asp Val Arg 
385                 390                 395                 400 


Gly Leu Gly Trp Ile Ser Leu Ala His His Arg Asp Gln Leu Thr Ala 
                405                 410                 415     


Ile Ala Asp Gln Gln Leu Ile Ala Asn Arg Ile Asp Gly Leu Ala Ala 
            420                 425                 430         


Ser Leu Gln Ser Ala Trp Ala Ala Ser Ala Tyr Arg Val Thr Met Gln 
        435                 440                 445             


Lys Asn Asp Phe Lys Trp Gly Ser Asn Ser Val Gly Leu Gly Gln Ala 
    450                 455                 460                 


Met Ile Leu Val Gln Ala Tyr Gln Leu Asn Gly Lys Arg Glu Tyr Leu 
465                 470                 475                 480 


Asp Ala Ala Gln Ala Met Leu Asp Tyr Val Leu Gly Arg Asn Ala Thr 
                485                 490                 495     


Asp Met Ser Phe Val Thr Gly Tyr Gly Ala Lys Ala Thr Met His Pro 
            500                 505                 510         


His His Arg Pro Ser Gly Ala Asp Lys Val Ala Tyr Pro Val Pro Gly 
        515                 520                 525             


Phe Leu Ala Gly Gly Pro Gln Ala Gly Gln Gln Asp Lys Asp Asp Cys 
    530                 535                 540                 


Glu Val Ala Tyr Pro Ser Pro Ile Thr Ala Lys Ser Tyr Leu Asp His 
545                 550                 555                 560 


Tyr Cys Ser Tyr Ala Ser Asn Glu Val Ala Ile Asn Trp Asn Ala Pro 
                565                 570                 575     


Leu Val Tyr Val Ser Ala Ala Ile Gln Ala Leu Thr Pro Lys 
            580                 585                 590 


<210> 141
<211> 1875
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 141
atgaacaata aacttcccgg tcgctctgct aatggcaagc gccattcaag tccacgcaag     60

tccatcctca actcggtcat cggtattttg gcgggcagtt tattgtctgg cctcgctctg    120

gccgacgttc ccgcactgac ggtgcaaggc aacaaagtgt tagtgggcgg taaatccgtc    180

agccttgaag gcgtttcact gttctggagt agcactggct ggggcgcaga aaagtattac    240

aacgctgcta ccgtcaagcg cgccaaaacc gagttcaacg ctaacctgat ccgcgcggca    300

ataggccatg gtgaagacgg cgcagtcgat agagactgga acggtaatat ggctcgcctt    360

gatgctgtag ttcaagcagc catcgacaac gatatgtacg tcatcattga ctaccatagc    420

cacaaagctc atcagaattg gggagcagca gatgcattct ttaaacaagt cgcgcagaag    480

tggggcaagt acaacaacgt tatttacgag atttataacg agcctgtcgg tgccaactgg    540

cataccgacc tgaaacctta tgccgagcat gtaggcgcta cgattcgcgc aatcgatccg    600

gacaacctga tcattatggg cactccacaa tggtcgcaag atgttgatat tgcatccacc    660

aataaagcca atgtatccaa cctggcgtac accattcact tctacgccca cgagcacacc    720

ggctggttgc gcgcgaaggc acaaactgcg ctcaataacg gtatcgccct atttgctacc    780

gagtggggta tgaccggtgc caacggccgt ggaccggtaa ataaaggcga aacctgggcc    840

tggatcgact tcctgcgcgc caatggcatt agccatgcgg gctgggcatt ccacgataaa    900

gatcgcgatg tcgccaccgg cgaagttgaa acctcatctt atttctggag tgacggcagc    960

cttaaagagt ccggtcattt tattaaagag attcttgccg gtcgtaaaga tattggcgga   1020

ggcggcggag gtggtggcga cggcggttca accggttcct gccaaaaagc cggtctgggc   1080

gatacccttg aagcagaaaa ctattgccag gcgagcggca ttgaaaccga aaacaccagc   1140

gatgctggcg gcggcaaaaa tgtgggctac atcgacaacg gcgactggtt gacctactcg   1200

atcaatgtgc ctgcaaccgg cacttataaa gtgagctatc gcgttgctgc ctctgcaggc   1260

ggcggtcaat ttcagttaga aaaagccggc ggcagccctg tttatggcaa cgtaaacgta   1320

cctgcgactg gcggttggca aaactggcaa accgtttcgc acaacgtggt tttgccagcc   1380

ggtgagcagc tgattgctat agcggcggtg accggtggct tcaacgttaa ttggctgaaa   1440

gtggaaagca ccggcacacc gccggatacc aatcccggca ccgtgattac caccatccag   1500

gcggaagcct ttagccagca acaaggcact gaactggaaa ataccaccga taccggcggc   1560

ggtaaaaatg tgggttatat cgatgcgggt gattggctct cttacgctgg tacgccggtg   1620

aatatcccca gcactggcag ctatgtgatt gagtatcgcg ttgccagcca aagcggtggc   1680

ggcagcctga cttttgaaga agcaggcggc actccggcct acggcaatct tgcgattccc   1740

tctaccggtg gctggcagac ctggaccacg gtaaaacaca ccgtgaacct gactgcgggc   1800

agccacaagt ttggcatcaa agtgaatgcg ggaggatgga acctgaattg gattcgcatc   1860

agcaaagcca attaa                                                    1875

<210> 142
<211> 624
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(41)

<220> 
<221> DOMAIN
<222> (50)...(305)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (364)...(483)
<223> Carbohydrate binding module (family 6)

<220> 
<221> DOMAIN
<222> (500)...(622)
<223> Carbohydrate binding module (family 6)

<220> 
<221> SITE
<222> (169)...(178)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (227)...(230)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (383)...(386)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (521)...(524)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (605)...(608)
<223> N-glycosylation site. Prosite id = PS00001

<400> 142
Met Asn Asn Lys Leu Pro Gly Arg Ser Ala Asn Gly Lys Arg His Ser 
1               5                   10                  15      


Ser Pro Arg Lys Ser Ile Leu Asn Ser Val Ile Gly Ile Leu Ala Gly 
            20                  25                  30          


Ser Leu Leu Ser Gly Leu Ala Leu Ala Asp Val Pro Ala Leu Thr Val 
        35                  40                  45              


Gln Gly Asn Lys Val Leu Val Gly Gly Lys Ser Val Ser Leu Glu Gly 
    50                  55                  60                  


Val Ser Leu Phe Trp Ser Ser Thr Gly Trp Gly Ala Glu Lys Tyr Tyr 
65                  70                  75                  80  


Asn Ala Ala Thr Val Lys Arg Ala Lys Thr Glu Phe Asn Ala Asn Leu 
                85                  90                  95      


Ile Arg Ala Ala Ile Gly His Gly Glu Asp Gly Ala Val Asp Arg Asp 
            100                 105                 110         


Trp Asn Gly Asn Met Ala Arg Leu Asp Ala Val Val Gln Ala Ala Ile 
        115                 120                 125             


Asp Asn Asp Met Tyr Val Ile Ile Asp Tyr His Ser His Lys Ala His 
    130                 135                 140                 


Gln Asn Trp Gly Ala Ala Asp Ala Phe Phe Lys Gln Val Ala Gln Lys 
145                 150                 155                 160 


Trp Gly Lys Tyr Asn Asn Val Ile Tyr Glu Ile Tyr Asn Glu Pro Val 
                165                 170                 175     


Gly Ala Asn Trp His Thr Asp Leu Lys Pro Tyr Ala Glu His Val Gly 
            180                 185                 190         


Ala Thr Ile Arg Ala Ile Asp Pro Asp Asn Leu Ile Ile Met Gly Thr 
        195                 200                 205             


Pro Gln Trp Ser Gln Asp Val Asp Ile Ala Ser Thr Asn Lys Ala Asn 
    210                 215                 220                 


Val Ser Asn Leu Ala Tyr Thr Ile His Phe Tyr Ala His Glu His Thr 
225                 230                 235                 240 


Gly Trp Leu Arg Ala Lys Ala Gln Thr Ala Leu Asn Asn Gly Ile Ala 
                245                 250                 255     


Leu Phe Ala Thr Glu Trp Gly Met Thr Gly Ala Asn Gly Arg Gly Pro 
            260                 265                 270         


Val Asn Lys Gly Glu Thr Trp Ala Trp Ile Asp Phe Leu Arg Ala Asn 
        275                 280                 285             


Gly Ile Ser His Ala Gly Trp Ala Phe His Asp Lys Asp Arg Asp Val 
    290                 295                 300                 


Ala Thr Gly Glu Val Glu Thr Ser Ser Tyr Phe Trp Ser Asp Gly Ser 
305                 310                 315                 320 


Leu Lys Glu Ser Gly His Phe Ile Lys Glu Ile Leu Ala Gly Arg Lys 
                325                 330                 335     


Asp Ile Gly Gly Gly Gly Gly Gly Gly Gly Asp Gly Gly Ser Thr Gly 
            340                 345                 350         


Ser Cys Gln Lys Ala Gly Leu Gly Asp Thr Leu Glu Ala Glu Asn Tyr 
        355                 360                 365             


Cys Gln Ala Ser Gly Ile Glu Thr Glu Asn Thr Ser Asp Ala Gly Gly 
    370                 375                 380                 


Gly Lys Asn Val Gly Tyr Ile Asp Asn Gly Asp Trp Leu Thr Tyr Ser 
385                 390                 395                 400 


Ile Asn Val Pro Ala Thr Gly Thr Tyr Lys Val Ser Tyr Arg Val Ala 
                405                 410                 415     


Ala Ser Ala Gly Gly Gly Gln Phe Gln Leu Glu Lys Ala Gly Gly Ser 
            420                 425                 430         


Pro Val Tyr Gly Asn Val Asn Val Pro Ala Thr Gly Gly Trp Gln Asn 
        435                 440                 445             


Trp Gln Thr Val Ser His Asn Val Val Leu Pro Ala Gly Glu Gln Leu 
    450                 455                 460                 


Ile Ala Ile Ala Ala Val Thr Gly Gly Phe Asn Val Asn Trp Leu Lys 
465                 470                 475                 480 


Val Glu Ser Thr Gly Thr Pro Pro Asp Thr Asn Pro Gly Thr Val Ile 
                485                 490                 495     


Thr Thr Ile Gln Ala Glu Ala Phe Ser Gln Gln Gln Gly Thr Glu Leu 
            500                 505                 510         


Glu Asn Thr Thr Asp Thr Gly Gly Gly Lys Asn Val Gly Tyr Ile Asp 
        515                 520                 525             


Ala Gly Asp Trp Leu Ser Tyr Ala Gly Thr Pro Val Asn Ile Pro Ser 
    530                 535                 540                 


Thr Gly Ser Tyr Val Ile Glu Tyr Arg Val Ala Ser Gln Ser Gly Gly 
545                 550                 555                 560 


Gly Ser Leu Thr Phe Glu Glu Ala Gly Gly Thr Pro Ala Tyr Gly Asn 
                565                 570                 575     


Leu Ala Ile Pro Ser Thr Gly Gly Trp Gln Thr Trp Thr Thr Val Lys 
            580                 585                 590         


His Thr Val Asn Leu Thr Ala Gly Ser His Lys Phe Gly Ile Lys Val 
        595                 600                 605             


Asn Ala Gly Gly Trp Asn Leu Asn Trp Ile Arg Ile Ser Lys Ala Asn 
    610                 615                 620                 


<210> 143
<211> 594
<212> DNA
<213> Clostridium thermocellum

<400> 143
gtggtatttc tgtggattgg aggaaatgac ctgcttttga gcggaaacgt gaatgcaaca     60

ggccttagta atcttataga ccagattttc acagtgaaac ccaatgtaac actgtttgtg    120

gccgattatt atccgtggcc tgaagcggtc aagcaataca atgcggtgat tccgggaata    180

gttcaacaga aggccaatgc cggcaagaaa gtttattttg taaagcttag tgagattcag    240

tttgacagga acaccgatat ttcatgggat ggtttgcact tgagcgaaat aggatacaca    300

aagattgcaa atatttggta caagtatacg attgacatac taaaagcttt ggcaggacaa    360

acgcagccaa ctccaagtcc gtctccgact cccacagatt ctcctctggt taaaaaaggt    420

gatgttaatt tggacggtca ggtcaattcg acagatttca gccttttgaa aagatatata    480

ctgaaagttg tggatataaa ttcaataaat gtgacaaatg ctgatatgaa caatgatggc    540

aatatcaact ctacagacat ttcaatacta aagagaatac ttcttagaaa ttag          594

<210> 144
<211> 197
<212> PRT
<213> Clostridium thermocellum

<220> 
<221> DOMAIN
<222> (141)...(161)
<223> Dockerin type I repeat

<220> 
<221> DOMAIN
<222> (175)...(195)
<223> Dockerin type I repeat

<220> 
<221> SITE
<222> (18)...(21)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (35)...(38)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (143)...(162)
<223> Clostridium cellulosome enzymes repeated domain signature. Prosite id = PS00448

<220> 
<221> SITE
<222> (151)...(154)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (172)...(175)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (177)...(196)
<223> Clostridium cellulosome enzymes repeated domain signature. Prosite id = PS00448

<220> 
<221> SITE
<222> (177)...(189)
<223> EF-hand calcium-binding domain. Prosite id = PS00018

<220> 
<221> SITE
<222> (185)...(188)
<223> N-glycosylation site. Prosite id = PS00001

<400> 144
Met Val Phe Leu Trp Ile Gly Gly Asn Asp Leu Leu Leu Ser Gly Asn 
1               5                   10                  15      


Val Asn Ala Thr Gly Leu Ser Asn Leu Ile Asp Gln Ile Phe Thr Val 
            20                  25                  30          


Lys Pro Asn Val Thr Leu Phe Val Ala Asp Tyr Tyr Pro Trp Pro Glu 
        35                  40                  45              


Ala Val Lys Gln Tyr Asn Ala Val Ile Pro Gly Ile Val Gln Gln Lys 
    50                  55                  60                  


Ala Asn Ala Gly Lys Lys Val Tyr Phe Val Lys Leu Ser Glu Ile Gln 
65                  70                  75                  80  


Phe Asp Arg Asn Thr Asp Ile Ser Trp Asp Gly Leu His Leu Ser Glu 
                85                  90                  95      


Ile Gly Tyr Thr Lys Ile Ala Asn Ile Trp Tyr Lys Tyr Thr Ile Asp 
            100                 105                 110         


Ile Leu Lys Ala Leu Ala Gly Gln Thr Gln Pro Thr Pro Ser Pro Ser 
        115                 120                 125             


Pro Thr Pro Thr Asp Ser Pro Leu Val Lys Lys Gly Asp Val Asn Leu 
    130                 135                 140                 


Asp Gly Gln Val Asn Ser Thr Asp Phe Ser Leu Leu Lys Arg Tyr Ile 
145                 150                 155                 160 


Leu Lys Val Val Asp Ile Asn Ser Ile Asn Val Thr Asn Ala Asp Met 
                165                 170                 175     


Asn Asn Asp Gly Asn Ile Asn Ser Thr Asp Ile Ser Ile Leu Lys Arg 
            180                 185                 190         


Ile Leu Leu Arg Asn 
        195         


<210> 145
<211> 1689
<212> DNA
<213> Clostridium thermocellum

<400> 145
gtgtataccg gaacggcaac ttcaatgttt gacaatgata caaaagaaac tgtttatatt     60

gctgattttt catctgttaa tgaagaagga acgtactatc ttgccgtgcc gggagtagga    120

aaaagcgtaa actttaaaat tgcaatgaat gtatatgagg atgcttttaa aacagcaatg    180

ctgggaatgt atttgctgcg ctgcggcacc agtgtgtcgg ccacatacaa cggaatacac    240

tattcccatg gaccgtgcca tactaatgat gcatatcttg attatataaa cggacagcat    300

actaaaaaag acagtacaaa aggctggcat gatgcgggcg actacaacaa atatgtggta    360

aacgccggca taaccgttgg ttcaatgttc ctggcgtggg agcattttaa agaccagttg    420

gagcctgtgg cattggagat tcccgaaaag aacaattcaa taccggattt tcttgatgaa    480

ttaaaatatg agatagactg gattcttacc atgcaatacc ctgacgggag cggaagggtg    540

gctcataaag tttcgacaag gaactttggc ggctttatca tgcctgagaa cgaacacgac    600

gaaagatttt tcgtgccctg gagcagtgcc gcaacggcag actttgttgc catgacggcc    660

atggctgcaa gaatattcag gccttatgat cctcaatatg ctgaaaaatg tataaatgcg    720

gcaaaagtaa gctatgagtt tttgaagaac aatcctgcga atgtttttgc aaaccagagt    780

ggattctcaa caggagaata tgccactgtc agtgatgcag atgacagatt gtgggcggcg    840

gctgaaatgt gggagaccct gggagatgaa gaatacctta gagattttga aaacagggcg    900

gcgcaattct cgaaaaaaat agaagccgat tttgactggg ataatgttgc aaacttaggt    960

atgtttacat atcttttgtc agaaagaccg ggcaagaatc ctgctttggt gcagtcaata   1020

aaggatagtc tcctttccac tgcggattca attgtgagga ccagccaaaa ccatggctat   1080

ggcagaaccc ttggtacaac atattactgg ggatgcaacg gcacggttgt aagacagact   1140

atgatacttc aggttgcgaa caagatttca cccaacaatg attatgtaaa tgctgctctc   1200

gatgcgattt cacatgtatt tggaagaaac tattacaaca ggtcttatgt aacaggcctt   1260

ggtataaatc ctcctatgaa tcctcatgac agacgttcag gggctgacgg aatatgggag   1320

ccgtggcccg gttaccttgt aggaggagga tggcccggac cgaaggattg ggtggatatt   1380

caggacagtt atcagaccaa tgaaattgct ataaactgga atgcggcatt gatttatgcc   1440

cttgccggat ttgtcaacta taattctgct caaaatgaag tactgtacgg agatgtgaat   1500

gatgacggaa aagtaaactc cactgacttg actttgttaa aaagatatgt tcttaaagcc   1560

gtctcaactc tgccttcttc caaagctgaa aagaacgcag atgtaaatcg tgacggaaga   1620

gttaattcca gtgatgtcac aatactttca agatatttga taagggtaat cgagaaatta   1680

ccaatataa                                                           1689

<210> 146
<211> 562
<212> PRT
<213> Clostridium thermocellum

<220> 
<221> DOMAIN
<222> (51)...(484)
<223> Glycosyl hydrolase family 9

<220> 
<221> DOMAIN
<222> (498)...(518)
<223> Dockerin type I repeat

<220> 
<221> DOMAIN
<222> (534)...(554)
<223> Dockerin type I repeat

<220> 
<221> SITE
<222> (12)...(15)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (153)...(156)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (261)...(264)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (378)...(381)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (419)...(422)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (421)...(437)
<223> Glycosyl hydrolases family 9 active sites signature 1. Prosite id = PS00592

<220> 
<221> SITE
<222> (464)...(482)
<223> Glycosyl hydrolases family 9 active sites signature 2. Prosite id = PS00698

<220> 
<221> SITE
<222> (505)...(517)
<223> EF-hand calcium-binding domain. Prosite id = PS00018

<220> 
<221> SITE
<222> (505)...(524)
<223> Clostridium cellulosome enzymes repeated domain signature. Prosite id = PS00448

<220> 
<221> SITE
<222> (513)...(516)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (542)...(561)
<223> Clostridium cellulosome enzymes repeated domain signature. Prosite id = PS00448

<220> 
<221> SITE
<222> (542)...(554)
<223> EF-hand calcium-binding domain. Prosite id = PS00018

<220> 
<221> SITE
<222> (550)...(553)
<223> N-glycosylation site. Prosite id = PS00001

<400> 146
Met Tyr Thr Gly Thr Ala Thr Ser Met Phe Asp Asn Asp Thr Lys Glu 
1               5                   10                  15      


Thr Val Tyr Ile Ala Asp Phe Ser Ser Val Asn Glu Glu Gly Thr Tyr 
            20                  25                  30          


Tyr Leu Ala Val Pro Gly Val Gly Lys Ser Val Asn Phe Lys Ile Ala 
        35                  40                  45              


Met Asn Val Tyr Glu Asp Ala Phe Lys Thr Ala Met Leu Gly Met Tyr 
    50                  55                  60                  


Leu Leu Arg Cys Gly Thr Ser Val Ser Ala Thr Tyr Asn Gly Ile His 
65                  70                  75                  80  


Tyr Ser His Gly Pro Cys His Thr Asn Asp Ala Tyr Leu Asp Tyr Ile 
                85                  90                  95      


Asn Gly Gln His Thr Lys Lys Asp Ser Thr Lys Gly Trp His Asp Ala 
            100                 105                 110         


Gly Asp Tyr Asn Lys Tyr Val Val Asn Ala Gly Ile Thr Val Gly Ser 
        115                 120                 125             


Met Phe Leu Ala Trp Glu His Phe Lys Asp Gln Leu Glu Pro Val Ala 
    130                 135                 140                 


Leu Glu Ile Pro Glu Lys Asn Asn Ser Ile Pro Asp Phe Leu Asp Glu 
145                 150                 155                 160 


Leu Lys Tyr Glu Ile Asp Trp Ile Leu Thr Met Gln Tyr Pro Asp Gly 
                165                 170                 175     


Ser Gly Arg Val Ala His Lys Val Ser Thr Arg Asn Phe Gly Gly Phe 
            180                 185                 190         


Ile Met Pro Glu Asn Glu His Asp Glu Arg Phe Phe Val Pro Trp Ser 
        195                 200                 205             


Ser Ala Ala Thr Ala Asp Phe Val Ala Met Thr Ala Met Ala Ala Arg 
    210                 215                 220                 


Ile Phe Arg Pro Tyr Asp Pro Gln Tyr Ala Glu Lys Cys Ile Asn Ala 
225                 230                 235                 240 


Ala Lys Val Ser Tyr Glu Phe Leu Lys Asn Asn Pro Ala Asn Val Phe 
                245                 250                 255     


Ala Asn Gln Ser Gly Phe Ser Thr Gly Glu Tyr Ala Thr Val Ser Asp 
            260                 265                 270         


Ala Asp Asp Arg Leu Trp Ala Ala Ala Glu Met Trp Glu Thr Leu Gly 
        275                 280                 285             


Asp Glu Glu Tyr Leu Arg Asp Phe Glu Asn Arg Ala Ala Gln Phe Ser 
    290                 295                 300                 


Lys Lys Ile Glu Ala Asp Phe Asp Trp Asp Asn Val Ala Asn Leu Gly 
305                 310                 315                 320 


Met Phe Thr Tyr Leu Leu Ser Glu Arg Pro Gly Lys Asn Pro Ala Leu 
                325                 330                 335     


Val Gln Ser Ile Lys Asp Ser Leu Leu Ser Thr Ala Asp Ser Ile Val 
            340                 345                 350         


Arg Thr Ser Gln Asn His Gly Tyr Gly Arg Thr Leu Gly Thr Thr Tyr 
        355                 360                 365             


Tyr Trp Gly Cys Asn Gly Thr Val Val Arg Gln Thr Met Ile Leu Gln 
    370                 375                 380                 


Val Ala Asn Lys Ile Ser Pro Asn Asn Asp Tyr Val Asn Ala Ala Leu 
385                 390                 395                 400 


Asp Ala Ile Ser His Val Phe Gly Arg Asn Tyr Tyr Asn Arg Ser Tyr 
                405                 410                 415     


Val Thr Gly Leu Gly Ile Asn Pro Pro Met Asn Pro His Asp Arg Arg 
            420                 425                 430         


Ser Gly Ala Asp Gly Ile Trp Glu Pro Trp Pro Gly Tyr Leu Val Gly 
        435                 440                 445             


Gly Gly Trp Pro Gly Pro Lys Asp Trp Val Asp Ile Gln Asp Ser Tyr 
    450                 455                 460                 


Gln Thr Asn Glu Ile Ala Ile Asn Trp Asn Ala Ala Leu Ile Tyr Ala 
465                 470                 475                 480 


Leu Ala Gly Phe Val Asn Tyr Asn Ser Ala Gln Asn Glu Val Leu Tyr 
                485                 490                 495     


Gly Asp Val Asn Asp Asp Gly Lys Val Asn Ser Thr Asp Leu Thr Leu 
            500                 505                 510         


Leu Lys Arg Tyr Val Leu Lys Ala Val Ser Thr Leu Pro Ser Ser Lys 
        515                 520                 525             


Ala Glu Lys Asn Ala Asp Val Asn Arg Asp Gly Arg Val Asn Ser Ser 
    530                 535                 540                 


Asp Val Thr Ile Leu Ser Arg Tyr Leu Ile Arg Val Ile Glu Lys Leu 
545                 550                 555                 560 


Pro Ile 
        


<210> 147
<211> 972
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 147
atggcaaaaa aaccaagata tttagtacat gataattaca cggccgaccc atcggctcat     60

gtgttcaatg gccgtatcta tatttaccca tcgcacgatg tggatgcagg catccctgaa    120

aatgacctcg gcgaccattt cgatatgcgc gactaccatg tctattcaat ggacgatgtt    180

aatggcgaag ttacagatca tggggtagtt cttcatgtta aagacatacc ctgggctggt    240

cgccagctct gggctcctga tgcagcctat aaaaatggga aatactatct ctatttccct    300

ttaaaggata aaactgatat tttcaggata ggtgttgctg taagcgataa accggaaggc    360

ccgtttattc ctgagccaga tcctatcaga ggaagttata gtattgatcc tgctgttctt    420

gatgatggtg atggcaattt ttacatgtac tttggaggtt tgtggggcgg ccaattacag    480

cgctaccgca acaacaaagc tattgaatgt ggacacgagc cggctgacaa tgaacccgcc    540

ctttcggccc gagtggtgcg ccttagcgac gatatgctgc aattcgctga agaaccccgc    600

gatgtgctgc ttctcgacga aaacggggag cccattaagg ctggcgacca cgaccgtcgt    660

tatttcgaag gcccatggat gcataagtac aacggaaaat attatttttc ctattcaacc    720

ggcaatacac attttctctg ctatgccata ggcgacaatc cttacggccc atttacctac    780

aaaggaaaaa tacttactcc agttgtaggt tggaccacac accattctat ttgcgagttt    840

aaagggaaat ggtatctttt ctatcacgac agtgtacctt ccggcggcaa aacatggctt    900

cgaagtatta aggttattga gttggaaatc aaccccgatg gtactattgt aaccatcgac    960

ggaatgttat ag                                                        972

<210> 148
<211> 323
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (5)...(316)
<223> Glycosyl hydrolases family 43

<220> 
<221> SITE
<222> (12)...(15)
<223> N-glycosylation site. Prosite id = PS00001

<400> 148
Met Ala Lys Lys Pro Arg Tyr Leu Val His Asp Asn Tyr Thr Ala Asp 
1               5                   10                  15      


Pro Ser Ala His Val Phe Asn Gly Arg Ile Tyr Ile Tyr Pro Ser His 
            20                  25                  30          


Asp Val Asp Ala Gly Ile Pro Glu Asn Asp Leu Gly Asp His Phe Asp 
        35                  40                  45              


Met Arg Asp Tyr His Val Tyr Ser Met Asp Asp Val Asn Gly Glu Val 
    50                  55                  60                  


Thr Asp His Gly Val Val Leu His Val Lys Asp Ile Pro Trp Ala Gly 
65                  70                  75                  80  


Arg Gln Leu Trp Ala Pro Asp Ala Ala Tyr Lys Asn Gly Lys Tyr Tyr 
                85                  90                  95      


Leu Tyr Phe Pro Leu Lys Asp Lys Thr Asp Ile Phe Arg Ile Gly Val 
            100                 105                 110         


Ala Val Ser Asp Lys Pro Glu Gly Pro Phe Ile Pro Glu Pro Asp Pro 
        115                 120                 125             


Ile Arg Gly Ser Tyr Ser Ile Asp Pro Ala Val Leu Asp Asp Gly Asp 
    130                 135                 140                 


Gly Asn Phe Tyr Met Tyr Phe Gly Gly Leu Trp Gly Gly Gln Leu Gln 
145                 150                 155                 160 


Arg Tyr Arg Asn Asn Lys Ala Ile Glu Cys Gly His Glu Pro Ala Asp 
                165                 170                 175     


Asn Glu Pro Ala Leu Ser Ala Arg Val Val Arg Leu Ser Asp Asp Met 
            180                 185                 190         


Leu Gln Phe Ala Glu Glu Pro Arg Asp Val Leu Leu Leu Asp Glu Asn 
        195                 200                 205             


Gly Glu Pro Ile Lys Ala Gly Asp His Asp Arg Arg Tyr Phe Glu Gly 
    210                 215                 220                 


Pro Trp Met His Lys Tyr Asn Gly Lys Tyr Tyr Phe Ser Tyr Ser Thr 
225                 230                 235                 240 


Gly Asn Thr His Phe Leu Cys Tyr Ala Ile Gly Asp Asn Pro Tyr Gly 
                245                 250                 255     


Pro Phe Thr Tyr Lys Gly Lys Ile Leu Thr Pro Val Val Gly Trp Thr 
            260                 265                 270         


Thr His His Ser Ile Cys Glu Phe Lys Gly Lys Trp Tyr Leu Phe Tyr 
        275                 280                 285             


His Asp Ser Val Pro Ser Gly Gly Lys Thr Trp Leu Arg Ser Ile Lys 
    290                 295                 300                 


Val Ile Glu Leu Glu Ile Asn Pro Asp Gly Thr Ile Val Thr Ile Asp 
305                 310                 315                 320 


Gly Met Leu 
            


<210> 149
<211> 1431
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 149
atgaaaacta attatagatt tggagggatt aaaatgtttt ctaaagactt tttatttgga     60

gcatctttgt ctggatttca gtttgaaatg ggaaatccaa ataatgaaga ggaattagac    120

aaaaacacag attggtttgt ttgggtaaga gacttaggaa atataattaa tggaaaagtt    180

agtggagatc ttcctgaata tggtgcagga tattatacta attacaaggc agttcacaat    240

cttgcaaaag aatttggaat gaatgcttta agaattggaa tagaatggtc aagaattttt    300

aaagaaagca caaaagatat tagcccagat gatcctaaca tgttagaaaa acttgatcaa    360

ctagctgata aaaaggcgat tgaacattat agagatgtat tagaagacat aaaaagtaaa    420

ggacttgtag ctattgttaa tttgtcgcac tttactttac cactttggct tcatgatcca    480

ataaatgtac acaaaggaaa ggaaacaaaa aagcttggtt gggtaagtga tgatgcacca    540

atagaatttg ctaaatacgc agaatacatc gcatggaaat ttaaagatat tgttgatatg    600

tggtcttcaa tgaatgaacc tcacgtggta agtcagcttg gttattttca aacaagtgca    660

ggttttccac caagctattt taatccttca tggtatctaa aaagtcttga aaatcaagct    720

ttagcacata accttgctta tgatgctata aaaaaacata cagacaagcc agttggagtt    780

atttattcat ttacgtggta tgatacagtt aataatgatg aagaaatatt cgaaagtgca    840

atgtttttaa ataactggaa ttatatggat agagtaaagg ataaaattga ctttgtaggt    900

gtaaattatt atacaagggc tgttatagac cgacttttgg ttcctataaa aattgataat    960

tatgaattaa attggtatac tcttagtggt tatgggtatt catgtgttga agatggtttt   1020

gcaaattcaa aaagaccttc aagcgaaatt ggttgggaga tatatccaga agggctttat   1080

aatattctca aagaaatata caatagatat ggaaagcaaa tctatataac ggaaaatggt   1140

atagcagatt caagcgataa atacagaagc ttttatatta tttcccacct ttatgcagta   1200

gaaaaagcaa taaacgaagg agtaccagta aaaggatacc ttcactggtc aataatagat   1260

aattatgaat gggcaaaagg ttatggtaaa agatttggac ttgcctatac agattttgaa   1320

agaaaaactt atattccaag accttctatg tatattttaa gagaaataat aaaagaaaga   1380

actattgata agtttaaagg atatgatccg tacggattaa tgaatttttg a            1431

<210> 150
<211> 476
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (9)...(462)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (17)...(31)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (149)...(152)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (379)...(387)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 150
Met Lys Thr Asn Tyr Arg Phe Gly Gly Ile Lys Met Phe Ser Lys Asp 
1               5                   10                  15      


Phe Leu Phe Gly Ala Ser Leu Ser Gly Phe Gln Phe Glu Met Gly Asn 
            20                  25                  30          


Pro Asn Asn Glu Glu Glu Leu Asp Lys Asn Thr Asp Trp Phe Val Trp 
        35                  40                  45              


Val Arg Asp Leu Gly Asn Ile Ile Asn Gly Lys Val Ser Gly Asp Leu 
    50                  55                  60                  


Pro Glu Tyr Gly Ala Gly Tyr Tyr Thr Asn Tyr Lys Ala Val His Asn 
65                  70                  75                  80  


Leu Ala Lys Glu Phe Gly Met Asn Ala Leu Arg Ile Gly Ile Glu Trp 
                85                  90                  95      


Ser Arg Ile Phe Lys Glu Ser Thr Lys Asp Ile Ser Pro Asp Asp Pro 
            100                 105                 110         


Asn Met Leu Glu Lys Leu Asp Gln Leu Ala Asp Lys Lys Ala Ile Glu 
        115                 120                 125             


His Tyr Arg Asp Val Leu Glu Asp Ile Lys Ser Lys Gly Leu Val Ala 
    130                 135                 140                 


Ile Val Asn Leu Ser His Phe Thr Leu Pro Leu Trp Leu His Asp Pro 
145                 150                 155                 160 


Ile Asn Val His Lys Gly Lys Glu Thr Lys Lys Leu Gly Trp Val Ser 
                165                 170                 175     


Asp Asp Ala Pro Ile Glu Phe Ala Lys Tyr Ala Glu Tyr Ile Ala Trp 
            180                 185                 190         


Lys Phe Lys Asp Ile Val Asp Met Trp Ser Ser Met Asn Glu Pro His 
        195                 200                 205             


Val Val Ser Gln Leu Gly Tyr Phe Gln Thr Ser Ala Gly Phe Pro Pro 
    210                 215                 220                 


Ser Tyr Phe Asn Pro Ser Trp Tyr Leu Lys Ser Leu Glu Asn Gln Ala 
225                 230                 235                 240 


Leu Ala His Asn Leu Ala Tyr Asp Ala Ile Lys Lys His Thr Asp Lys 
                245                 250                 255     


Pro Val Gly Val Ile Tyr Ser Phe Thr Trp Tyr Asp Thr Val Asn Asn 
            260                 265                 270         


Asp Glu Glu Ile Phe Glu Ser Ala Met Phe Leu Asn Asn Trp Asn Tyr 
        275                 280                 285             


Met Asp Arg Val Lys Asp Lys Ile Asp Phe Val Gly Val Asn Tyr Tyr 
    290                 295                 300                 


Thr Arg Ala Val Ile Asp Arg Leu Leu Val Pro Ile Lys Ile Asp Asn 
305                 310                 315                 320 


Tyr Glu Leu Asn Trp Tyr Thr Leu Ser Gly Tyr Gly Tyr Ser Cys Val 
                325                 330                 335     


Glu Asp Gly Phe Ala Asn Ser Lys Arg Pro Ser Ser Glu Ile Gly Trp 
            340                 345                 350         


Glu Ile Tyr Pro Glu Gly Leu Tyr Asn Ile Leu Lys Glu Ile Tyr Asn 
        355                 360                 365             


Arg Tyr Gly Lys Gln Ile Tyr Ile Thr Glu Asn Gly Ile Ala Asp Ser 
    370                 375                 380                 


Ser Asp Lys Tyr Arg Ser Phe Tyr Ile Ile Ser His Leu Tyr Ala Val 
385                 390                 395                 400 


Glu Lys Ala Ile Asn Glu Gly Val Pro Val Lys Gly Tyr Leu His Trp 
                405                 410                 415     


Ser Ile Ile Asp Asn Tyr Glu Trp Ala Lys Gly Tyr Gly Lys Arg Phe 
            420                 425                 430         


Gly Leu Ala Tyr Thr Asp Phe Glu Arg Lys Thr Tyr Ile Pro Arg Pro 
        435                 440                 445             


Ser Met Tyr Ile Leu Arg Glu Ile Ile Lys Glu Arg Thr Ile Asp Lys 
    450                 455                 460                 


Phe Lys Gly Tyr Asp Pro Tyr Gly Leu Met Asn Phe 
465                 470                 475     


<210> 151
<211> 1008
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 151
atgaggaact tttttaaagt ctttacatta gttttggtgg tgatttctgt gatgctgttt     60

ggtgaaaata aaaaacttac ggcatttgat tacaataaaa tgataggtat tggaattaac    120

atgggaaatg cgcttgaagc accatttgaa ggtgcatggg gagtggttat aaaagatgaa    180

tattttaaaa taataaaaga aaaaggattt gattcagtta gaatacctat tagatggtca    240

gcgcatattt tagataaacc accttataca atagaaaagg attttttaga aagagtaaaa    300

catgtagttg ataaggcttt ggaaaatgat ttggttgtaa taattaattg ccatcatttt    360

gaggagttat atgaaaaccc tgaaaagtac ggagaagttc ttttagaaat ttggaaacaa    420

gtatcagatt tctttaaaaa ttattctgac aagctttatt ttgaaattta taacgaacct    480

gcaaataatt taaccccaga aaaatggaat gatttgtatc caaaagtttt aaaagaaatc    540

aggaaaacaa acccaacaag aattgtaata gtagatgtgc ctcattgggg aaattacaat    600

tacatcaatc aattaaaact tgtaaatgat ccatatttaa tcgtatcttt tcactattat    660

gaaccattca actttactca ccaaggtgct gaatggataa acccgcgcct tccagtgggg    720

gttaaatgga gtgcaaaaag ttatgaaata gaacagataa aatcacattt tgaatatgta    780

aattcttttt caaaaaagta caatgttcca atatttttag gggaatttgg ggcttactca    840

aaggcagata tggattctcg aatcaaatgg acaaaagcgg ttagtcaaat tgctagagaa    900

tttggatttt caatttgtta ttgggaattt tgttctggtt ttgggcttta caataaaata    960

acaaatactt ggaatgaagg attgttaaac gctgtttttg gaaaataa                1008

<210> 152
<211> 335
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> SIGNAL
<222> (1)...(21)

<220> 
<221> DOMAIN
<222> (42)...(318)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (149)...(152)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (227)...(230)
<223> N-glycosylation site. Prosite id = PS00001

<400> 152
Met Arg Asn Phe Phe Lys Val Phe Thr Leu Val Leu Val Val Ile Ser 
1               5                   10                  15      


Val Met Leu Phe Gly Glu Asn Lys Lys Leu Thr Ala Phe Asp Tyr Asn 
            20                  25                  30          


Lys Met Ile Gly Ile Gly Ile Asn Met Gly Asn Ala Leu Glu Ala Pro 
        35                  40                  45              


Phe Glu Gly Ala Trp Gly Val Val Ile Lys Asp Glu Tyr Phe Lys Ile 
    50                  55                  60                  


Ile Lys Glu Lys Gly Phe Asp Ser Val Arg Ile Pro Ile Arg Trp Ser 
65                  70                  75                  80  


Ala His Ile Leu Asp Lys Pro Pro Tyr Thr Ile Glu Lys Asp Phe Leu 
                85                  90                  95      


Glu Arg Val Lys His Val Val Asp Lys Ala Leu Glu Asn Asp Leu Val 
            100                 105                 110         


Val Ile Ile Asn Cys His His Phe Glu Glu Leu Tyr Glu Asn Pro Glu 
        115                 120                 125             


Lys Tyr Gly Glu Val Leu Leu Glu Ile Trp Lys Gln Val Ser Asp Phe 
    130                 135                 140                 


Phe Lys Asn Tyr Ser Asp Lys Leu Tyr Phe Glu Ile Tyr Asn Glu Pro 
145                 150                 155                 160 


Ala Asn Asn Leu Thr Pro Glu Lys Trp Asn Asp Leu Tyr Pro Lys Val 
                165                 170                 175     


Leu Lys Glu Ile Arg Lys Thr Asn Pro Thr Arg Ile Val Ile Val Asp 
            180                 185                 190         


Val Pro His Trp Gly Asn Tyr Asn Tyr Ile Asn Gln Leu Lys Leu Val 
        195                 200                 205             


Asn Asp Pro Tyr Leu Ile Val Ser Phe His Tyr Tyr Glu Pro Phe Asn 
    210                 215                 220                 


Phe Thr His Gln Gly Ala Glu Trp Ile Asn Pro Arg Leu Pro Val Gly 
225                 230                 235                 240 


Val Lys Trp Ser Ala Lys Ser Tyr Glu Ile Glu Gln Ile Lys Ser His 
                245                 250                 255     


Phe Glu Tyr Val Asn Ser Phe Ser Lys Lys Tyr Asn Val Pro Ile Phe 
            260                 265                 270         


Leu Gly Glu Phe Gly Ala Tyr Ser Lys Ala Asp Met Asp Ser Arg Ile 
        275                 280                 285             


Lys Trp Thr Lys Ala Val Ser Gln Ile Ala Arg Glu Phe Gly Phe Ser 
    290                 295                 300                 


Ile Cys Tyr Trp Glu Phe Cys Ser Gly Phe Gly Leu Tyr Asn Lys Ile 
305                 310                 315                 320 


Thr Asn Thr Trp Asn Glu Gly Leu Leu Asn Ala Val Phe Gly Lys 
                325                 330                 335 


<210> 153
<211> 1068
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 153
atgaaaaggc gtaactggaa ttatcttctg attattttat tggtcatttc cgcattcact     60

ttgatatcgg cccagtgcga taaagataaa actaaagaag gagagaagaa tatggttcaa    120

tcaaccaaag gcaagtctgt ctttgaattg aataaaatga ttgggaaagg agtaaatata    180

ggaaatgctt tggaagcacc ggttgaaggt gcatggggag ttagaataga ggacgaatac    240

ttcgaagtca taaagaaaag gggatttgac tctgtaagga ttcccataag atggtcggcg    300

cacatttccg ataaaccgcc gtacaaaatc gaagaatatt ttctcgaaag ggttaaacat    360

gtagtagata aggcacttga gaataacctt actgtgatta tcaacaccca tcatttcgaa    420

gaactctacc aagacccaga caaatatggt ggagtgctgg ttgaaatctg gcgccaagtt    480

gccagtttct tcaaagacta tcctgaaacg ttattcttcg aaatctacaa tgaacctgcg    540

cagaacttaa caggcgataa gtggaacaag ctctatccaa aggtgcttga ggttattaga    600

gagagcaatc cggacagagt agttatcatc gacgttccga actgggctca ttacagtgcg    660

ataagcagtt tgaaattagt gaacgataaa cgtatcatcg tttcatttca ctactacgaa    720

cctttcaatt tcacacatca gggtgctgaa tgggtcaatc ctgttccacc agttggtgtg    780

aagtggaacg gggaggattg ggaagtaaat cagatcaaaa atcatttcag atacgttagt    840

gactgggcaa aaaagaataa cgtgcctatc tttcttggcg aatttggtgc ttattcgaag    900

gcggacatgg attcaagagt caaatggacg gaaactgtga gaaaaaccgc tgaagagttt    960

ggtttttcct atgcgtattg ggaattctgt gcagggtttg gcatatacga taggtgggct   1020

gaaaaatgga tcgaaccgct ggcaaccgct gtggttggga ataattaa                1068

<210> 154
<211> 355
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> SIGNAL
<222> (1)...(25)

<220> 
<221> DOMAIN
<222> (61)...(337)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (184)...(187)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (246)...(249)
<223> N-glycosylation site. Prosite id = PS00001

<400> 154
Met Lys Arg Arg Asn Trp Asn Tyr Leu Leu Ile Ile Leu Leu Val Ile 
1               5                   10                  15      


Ser Ala Phe Thr Leu Ile Ser Ala Gln Cys Asp Lys Asp Lys Thr Lys 
            20                  25                  30          


Glu Gly Glu Lys Asn Met Val Gln Ser Thr Lys Gly Lys Ser Val Phe 
        35                  40                  45              


Glu Leu Asn Lys Met Ile Gly Lys Gly Val Asn Ile Gly Asn Ala Leu 
    50                  55                  60                  


Glu Ala Pro Val Glu Gly Ala Trp Gly Val Arg Ile Glu Asp Glu Tyr 
65                  70                  75                  80  


Phe Glu Val Ile Lys Lys Arg Gly Phe Asp Ser Val Arg Ile Pro Ile 
                85                  90                  95      


Arg Trp Ser Ala His Ile Ser Asp Lys Pro Pro Tyr Lys Ile Glu Glu 
            100                 105                 110         


Tyr Phe Leu Glu Arg Val Lys His Val Val Asp Lys Ala Leu Glu Asn 
        115                 120                 125             


Asn Leu Thr Val Ile Ile Asn Thr His His Phe Glu Glu Leu Tyr Gln 
    130                 135                 140                 


Asp Pro Asp Lys Tyr Gly Gly Val Leu Val Glu Ile Trp Arg Gln Val 
145                 150                 155                 160 


Ala Ser Phe Phe Lys Asp Tyr Pro Glu Thr Leu Phe Phe Glu Ile Tyr 
                165                 170                 175     


Asn Glu Pro Ala Gln Asn Leu Thr Gly Asp Lys Trp Asn Lys Leu Tyr 
            180                 185                 190         


Pro Lys Val Leu Glu Val Ile Arg Glu Ser Asn Pro Asp Arg Val Val 
        195                 200                 205             


Ile Ile Asp Val Pro Asn Trp Ala His Tyr Ser Ala Ile Ser Ser Leu 
    210                 215                 220                 


Lys Leu Val Asn Asp Lys Arg Ile Ile Val Ser Phe His Tyr Tyr Glu 
225                 230                 235                 240 


Pro Phe Asn Phe Thr His Gln Gly Ala Glu Trp Val Asn Pro Val Pro 
                245                 250                 255     


Pro Val Gly Val Lys Trp Asn Gly Glu Asp Trp Glu Val Asn Gln Ile 
            260                 265                 270         


Lys Asn His Phe Arg Tyr Val Ser Asp Trp Ala Lys Lys Asn Asn Val 
        275                 280                 285             


Pro Ile Phe Leu Gly Glu Phe Gly Ala Tyr Ser Lys Ala Asp Met Asp 
    290                 295                 300                 


Ser Arg Val Lys Trp Thr Glu Thr Val Arg Lys Thr Ala Glu Glu Phe 
305                 310                 315                 320 


Gly Phe Ser Tyr Ala Tyr Trp Glu Phe Cys Ala Gly Phe Gly Ile Tyr 
                325                 330                 335     


Asp Arg Trp Ala Glu Lys Trp Ile Glu Pro Leu Ala Thr Ala Val Val 
            340                 345                 350         


Gly Asn Asn 
        355 


<210> 155
<211> 1794
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 155
atgaagagag ttgcaattat attaagtgtc ctgacactac ttgcaacaat tatgagtttt     60

ccagcctctg ctcaagagcc cataatacga gagttcccag gcaaccaatg gcctacagcg    120

tttgtcgaca tgaatgggga tggcatttca gatttcgtaa tggaaatcaa tccatggaac    180

atccaagacg cagatggaaa gcaaatcatg gaatacgatc ctgttagtaa tgaaatacgg    240

ttctttagca acttaacaaa catcattact aaaaatccca atacttgggt tcatgggtac    300

ccggagatat acattgggaa caagccttgg aacagaaata aagcggatgg acttatagat    360

ctaccaaaga aggtgtcaga tcttactggc tttactgtaa aatttagtta caaccttgag    420

catgatccta atctgccaat taattttgca atggaaactt ggttaacaac ggatcaacta    480

aggacaactg gagttagggc tggagaagta gagatcatgg tctggcttta taataacaag    540

ataaatccgg ctggtagaat aatagacaca gtcaagatac ccatcattat taatggaaaa    600

ctcataaacg ggacatttga agtttggaag aaagatagca ttggtagtgg atggacgtac    660

tttgccttta gattgacaac accaatgaag agtgcagaaa tagaaattga tccaacatta    720

tttgtcaaaa aagttcaaga atacacacaa gtggatatag gaaatctcta catgcaggat    780

tgggaaattg gtacagagtt tggaaatcct acaactacat cagcactctt taactggagt    840

ataaagaatt tcgaagttag caagcaacct ttactccaac aagatgtgga atcaccctca    900

acaacgtctt ccataccaca aacccagacg acatcttcaa acattcagaa tccaataaga    960

cctggaacac ttgacgttag ggtaaatagc tggggaagtg ctacccaata ctcatgcacc   1020

ctctatcttg acggtcaata tgactggact attgaagtca agctaaagga tggttcaaag   1080

atcacaagtt actggagcgc tgatcttacc tacaaggaag atgggaccgc agtattcaca   1140

ccaaagagct ggaacaaggg tccaacagct agcttcggat tcatagcatc cggagatatg   1200

cccgttgagt caatagtgct tgttataaac ggtgaggttt gggatgtgtg gcctgaggtg   1260

tctcaggtgc cgtctgagac gggtactact acgactacta ccacagcaac tccagctcca   1320

actacaacca ctacaactac cactacaact acgagcactc caacccagac tactaccact   1380

actacagtaa ctccggctcc aactactacc acgacaacta ctgtctcaac aactactacc   1440

actataacta cgagtactgt gactactaca actacacagg ttgttccagt taggcctggt   1500

tcgatgagtg ttaaggttaa tgattggggt actggtgggc agtttgacat taccctaaac   1560

cttggtgggc agtacgattg ggttgttaaa gtccagcttg attcatcaac ccaaatgggc   1620

aactactggg gtgttcagaa gagccaagag ggtgattggg tagtcttcac gccactcagt   1680

tggaacaagg gtccaacagc cgtctttgga ttcattgtaa acggcccagt cagcggagtc   1740

aaacaaataa tcctcgaaat aaacggagaa gtttgggaca tatggtcaca atga         1794

<210> 156
<211> 597
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(24)

<220> 
<221> DOMAIN
<222> (117)...(289)
<223> Glycosyl hydrolase family 12

<220> 
<221> DOMAIN
<222> (500)...(582)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (85)...(88)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (206)...(209)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (282)...(285)
<223> N-glycosylation site. Prosite id = PS00001

<400> 156
Met Lys Arg Val Ala Ile Ile Leu Ser Val Leu Thr Leu Leu Ala Thr 
1               5                   10                  15      


Ile Met Ser Phe Pro Ala Ser Ala Gln Glu Pro Ile Ile Arg Glu Phe 
            20                  25                  30          


Pro Gly Asn Gln Trp Pro Thr Ala Phe Val Asp Met Asn Gly Asp Gly 
        35                  40                  45              


Ile Ser Asp Phe Val Met Glu Ile Asn Pro Trp Asn Ile Gln Asp Ala 
    50                  55                  60                  


Asp Gly Lys Gln Ile Met Glu Tyr Asp Pro Val Ser Asn Glu Ile Arg 
65                  70                  75                  80  


Phe Phe Ser Asn Leu Thr Asn Ile Ile Thr Lys Asn Pro Asn Thr Trp 
                85                  90                  95      


Val His Gly Tyr Pro Glu Ile Tyr Ile Gly Asn Lys Pro Trp Asn Arg 
            100                 105                 110         


Asn Lys Ala Asp Gly Leu Ile Asp Leu Pro Lys Lys Val Ser Asp Leu 
        115                 120                 125             


Thr Gly Phe Thr Val Lys Phe Ser Tyr Asn Leu Glu His Asp Pro Asn 
    130                 135                 140                 


Leu Pro Ile Asn Phe Ala Met Glu Thr Trp Leu Thr Thr Asp Gln Leu 
145                 150                 155                 160 


Arg Thr Thr Gly Val Arg Ala Gly Glu Val Glu Ile Met Val Trp Leu 
                165                 170                 175     


Tyr Asn Asn Lys Ile Asn Pro Ala Gly Arg Ile Ile Asp Thr Val Lys 
            180                 185                 190         


Ile Pro Ile Ile Ile Asn Gly Lys Leu Ile Asn Gly Thr Phe Glu Val 
        195                 200                 205             


Trp Lys Lys Asp Ser Ile Gly Ser Gly Trp Thr Tyr Phe Ala Phe Arg 
    210                 215                 220                 


Leu Thr Thr Pro Met Lys Ser Ala Glu Ile Glu Ile Asp Pro Thr Leu 
225                 230                 235                 240 


Phe Val Lys Lys Val Gln Glu Tyr Thr Gln Val Asp Ile Gly Asn Leu 
                245                 250                 255     


Tyr Met Gln Asp Trp Glu Ile Gly Thr Glu Phe Gly Asn Pro Thr Thr 
            260                 265                 270         


Thr Ser Ala Leu Phe Asn Trp Ser Ile Lys Asn Phe Glu Val Ser Lys 
        275                 280                 285             


Gln Pro Leu Leu Gln Gln Asp Val Glu Ser Pro Ser Thr Thr Ser Ser 
    290                 295                 300                 


Ile Pro Gln Thr Gln Thr Thr Ser Ser Asn Ile Gln Asn Pro Ile Arg 
305                 310                 315                 320 


Pro Gly Thr Leu Asp Val Arg Val Asn Ser Trp Gly Ser Ala Thr Gln 
                325                 330                 335     


Tyr Ser Cys Thr Leu Tyr Leu Asp Gly Gln Tyr Asp Trp Thr Ile Glu 
            340                 345                 350         


Val Lys Leu Lys Asp Gly Ser Lys Ile Thr Ser Tyr Trp Ser Ala Asp 
        355                 360                 365             


Leu Thr Tyr Lys Glu Asp Gly Thr Ala Val Phe Thr Pro Lys Ser Trp 
    370                 375                 380                 


Asn Lys Gly Pro Thr Ala Ser Phe Gly Phe Ile Ala Ser Gly Asp Met 
385                 390                 395                 400 


Pro Val Glu Ser Ile Val Leu Val Ile Asn Gly Glu Val Trp Asp Val 
                405                 410                 415     


Trp Pro Glu Val Ser Gln Val Pro Ser Glu Thr Gly Thr Thr Thr Thr 
            420                 425                 430         


Thr Thr Thr Ala Thr Pro Ala Pro Thr Thr Thr Thr Thr Thr Thr Thr 
        435                 440                 445             


Thr Thr Thr Ser Thr Pro Thr Gln Thr Thr Thr Thr Thr Thr Val Thr 
    450                 455                 460                 


Pro Ala Pro Thr Thr Thr Thr Thr Thr Thr Val Ser Thr Thr Thr Thr 
465                 470                 475                 480 


Thr Ile Thr Thr Ser Thr Val Thr Thr Thr Thr Thr Gln Val Val Pro 
                485                 490                 495     


Val Arg Pro Gly Ser Met Ser Val Lys Val Asn Asp Trp Gly Thr Gly 
            500                 505                 510         


Gly Gln Phe Asp Ile Thr Leu Asn Leu Gly Gly Gln Tyr Asp Trp Val 
        515                 520                 525             


Val Lys Val Gln Leu Asp Ser Ser Thr Gln Met Gly Asn Tyr Trp Gly 
    530                 535                 540                 


Val Gln Lys Ser Gln Glu Gly Asp Trp Val Val Phe Thr Pro Leu Ser 
545                 550                 555                 560 


Trp Asn Lys Gly Pro Thr Ala Val Phe Gly Phe Ile Val Asn Gly Pro 
                565                 570                 575     


Val Ser Gly Val Lys Gln Ile Ile Leu Glu Ile Asn Gly Glu Val Trp 
            580                 585                 590         


Asp Ile Trp Ser Gln 
        595         


<210> 157
<211> 1107
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 157
atgaatgtgt tgcgtagtgg aatcgtgacg atgctgctgt tggctgcctt tagtgttcag     60

gcagcctgta cctggcctgc ctgggagcag tttaaaaagg attacatcag tcaggaaggg    120

cgcgtcatcg accccagcga cgcgcgcaaa atcaccacct ccgaagggca aagttacggt    180

atgttctttg ccctggcggc taacgaccgt gtagctttcg ataatattct cgactggacg    240

cagaacaatc tcgctcaggg ctctttaaaa gaacgtttgc ccgcctggct gtggggcaag    300

aaagagaaca gtaagtggga agtgctggac agcaattcgg cctccgatgg tgatgtctgg    360

atggcctggt cgttgctgga ggcggggcgt ttgtggaaag agcagcgtta taccgacatc    420

ggcagcgcgt tgctaaaacg tatcgcgcgg gaggaagtgg tgacggtgcc tgggctgggt    480

tccatgttgt taccgggcaa agtgggtttt gctgaggata acagctggcg ttttaacccc    540

agctacctgc cgccgacgct ggcgcagtat ttcacccgct ttggcgcgcc gtggaccacg    600

ctgcgcgaaa ccaatcaacg tttattgctg gaaaccgccc cgaaaggttt ttcgccagac    660

tgggtgcgct atgagaaaga caaaggctgg cagctaaaag ccgaaaaaac attgatcagc    720

agctacgacg ctatccgcgt ttacatgtgg gtaggcatga tgcctgacag cgatccgcaa    780

aaagcgcgga tgctcaaccg gtttaaaccg atggcgacat tcactgagaa aaacggttat    840

ccgccggaaa aagtggatgt ggctacgggg aaagcgcagg gtaaaggacc ggtcggtttt    900

tctgccgcca tgctgccctt tttacaaaac cgcgatgcgc aggccgttca gcgccagcgc    960

gtggccgata actttcccgg cagcgatgcc tattacaact atgtgctgac cctgtttgga   1020

caaggctggg atcaacaccg tttccgcttc tcgacaaaag gtgagttatt acctgactgg   1080

ggccaggaat gcgcaaattc acactaa                                       1107

<210> 158
<211> 368
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(21)

<220> 
<221> DOMAIN
<222> (1)...(346)
<223> Glycosyl hydrolases family 8

<400> 158
Met Asn Val Leu Arg Ser Gly Ile Val Thr Met Leu Leu Leu Ala Ala 
1               5                   10                  15      


Phe Ser Val Gln Ala Ala Cys Thr Trp Pro Ala Trp Glu Gln Phe Lys 
            20                  25                  30          


Lys Asp Tyr Ile Ser Gln Glu Gly Arg Val Ile Asp Pro Ser Asp Ala 
        35                  40                  45              


Arg Lys Ile Thr Thr Ser Glu Gly Gln Ser Tyr Gly Met Phe Phe Ala 
    50                  55                  60                  


Leu Ala Ala Asn Asp Arg Val Ala Phe Asp Asn Ile Leu Asp Trp Thr 
65                  70                  75                  80  


Gln Asn Asn Leu Ala Gln Gly Ser Leu Lys Glu Arg Leu Pro Ala Trp 
                85                  90                  95      


Leu Trp Gly Lys Lys Glu Asn Ser Lys Trp Glu Val Leu Asp Ser Asn 
            100                 105                 110         


Ser Ala Ser Asp Gly Asp Val Trp Met Ala Trp Ser Leu Leu Glu Ala 
        115                 120                 125             


Gly Arg Leu Trp Lys Glu Gln Arg Tyr Thr Asp Ile Gly Ser Ala Leu 
    130                 135                 140                 


Leu Lys Arg Ile Ala Arg Glu Glu Val Val Thr Val Pro Gly Leu Gly 
145                 150                 155                 160 


Ser Met Leu Leu Pro Gly Lys Val Gly Phe Ala Glu Asp Asn Ser Trp 
                165                 170                 175     


Arg Phe Asn Pro Ser Tyr Leu Pro Pro Thr Leu Ala Gln Tyr Phe Thr 
            180                 185                 190         


Arg Phe Gly Ala Pro Trp Thr Thr Leu Arg Glu Thr Asn Gln Arg Leu 
        195                 200                 205             


Leu Leu Glu Thr Ala Pro Lys Gly Phe Ser Pro Asp Trp Val Arg Tyr 
    210                 215                 220                 


Glu Lys Asp Lys Gly Trp Gln Leu Lys Ala Glu Lys Thr Leu Ile Ser 
225                 230                 235                 240 


Ser Tyr Asp Ala Ile Arg Val Tyr Met Trp Val Gly Met Met Pro Asp 
                245                 250                 255     


Ser Asp Pro Gln Lys Ala Arg Met Leu Asn Arg Phe Lys Pro Met Ala 
            260                 265                 270         


Thr Phe Thr Glu Lys Asn Gly Tyr Pro Pro Glu Lys Val Asp Val Ala 
        275                 280                 285             


Thr Gly Lys Ala Gln Gly Lys Gly Pro Val Gly Phe Ser Ala Ala Met 
    290                 295                 300                 


Leu Pro Phe Leu Gln Asn Arg Asp Ala Gln Ala Val Gln Arg Gln Arg 
305                 310                 315                 320 


Val Ala Asp Asn Phe Pro Gly Ser Asp Ala Tyr Tyr Asn Tyr Val Leu 
                325                 330                 335     


Thr Leu Phe Gly Gln Gly Trp Asp Gln His Arg Phe Arg Phe Ser Thr 
            340                 345                 350         


Lys Gly Glu Leu Leu Pro Asp Trp Gly Gln Glu Cys Ala Asn Ser His 
        355                 360                 365             


<210> 159
<211> 1500
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 159
atgaaacggt caatctctat ttttattacg tgtttattga ttacgttatt gacaatgggc     60

ggcatgatag cttcgccggc atcagcagca gggacaaaaa cgccagtagc caagaatggc    120

cagcttagca taaaaggtac acagctcgtt aaccgagacg gtaaagcggt acagctgaag    180

gggatcagtt cacacggatt gcaatggtat ggagaatatg tcaataaaga cagcttaaaa    240

tggctgagag atgattgggg tatcaccgtt ttccgtgcag cgatgtatac ggcagatggc    300

ggttatattg acaacccgtc cgtgaaaaat aaagtaaaag aagcggttga agcggcaaaa    360

gagcttggga tatatgtcat cattgactgg catatcttaa atgacggtaa tccaaaccaa    420

aataaagaga aggcaaaaga attcttcaag gaaatgtcaa gcctttacgg aaacacgcca    480

aacgtcattt atgaaattgc aaacgaacca aacggtgatg tgaactggaa gcgtgatatt    540

aaaccatatg cggaagaagt gatttcagtt atccgcaaaa atgatccaga caacatcatc    600

attgtcggaa ccggtacatg gagccaggat gtgaatgatg ctgccgatga ccagctaaaa    660

gatgcaaacg ttatgtacgc acttcatttt tatgccggca cacacggcca atttttacgg    720

gataaagcaa actatgcact cagcaaagga gcacctattt ttgtgacaga gtggggaaca    780

agcgacgcgt ctggcaatgg cggtgtattc cttgatcaat cgagggaatg gctgaaatat    840

ctcgacagca agaccattag ctgggtgaac tggaatcttt ctgataagca ggaatcatcc    900

tcagctttaa agccgggggc atctaaaaca ggcggctggc ggttgtcaga tttatctgct    960

tcaggaacat tcgttagaga aaacattctc ggcaccaaag attcgacgaa ggacattcct   1020

gaaacgccat caaaagataa acccacacag gaaaatggta tttctgtaca gtacagagca   1080

ggggatggga gtatgaacag caaccaaatc cgtccgcagc ttcaaataaa aaataacggc   1140

aataccacgg ttgatttaaa agatgtcact gcccgttact ggtataaagc gaaaaacaaa   1200

ggccaaaact ttgactgtga ctacgcgcag attggatgcg gcaatgtgac acacaagttt   1260

gtgacgttgc ataaaccaaa gcaaggtgca gatacctatc tggaacttgg atttaaaaac   1320

ggaacgttgg caccgggagc aagcacaggg aatattcagc tccgtcttca caatgatgac   1380

tggagcaatt atgcacaaag cggcgattat tcctttttca aatcaaatac gtttaaaaca   1440

acgaaaaaaa tcacattata tgatcaagga aaactgattt ggggaacaga accaaattag   1500

<210> 160
<211> 499
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (47)...(301)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (356)...(437)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (164)...(173)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (296)...(299)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (386)...(389)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (421)...(424)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (446)...(449)
<223> N-glycosylation site. Prosite id = PS00001

<400> 160
Met Lys Arg Ser Ile Ser Ile Phe Ile Thr Cys Leu Leu Ile Thr Leu 
1               5                   10                  15      


Leu Thr Met Gly Gly Met Ile Ala Ser Pro Ala Ser Ala Ala Gly Thr 
            20                  25                  30          


Lys Thr Pro Val Ala Lys Asn Gly Gln Leu Ser Ile Lys Gly Thr Gln 
        35                  40                  45              


Leu Val Asn Arg Asp Gly Lys Ala Val Gln Leu Lys Gly Ile Ser Ser 
    50                  55                  60                  


His Gly Leu Gln Trp Tyr Gly Glu Tyr Val Asn Lys Asp Ser Leu Lys 
65                  70                  75                  80  


Trp Leu Arg Asp Asp Trp Gly Ile Thr Val Phe Arg Ala Ala Met Tyr 
                85                  90                  95      


Thr Ala Asp Gly Gly Tyr Ile Asp Asn Pro Ser Val Lys Asn Lys Val 
            100                 105                 110         


Lys Glu Ala Val Glu Ala Ala Lys Glu Leu Gly Ile Tyr Val Ile Ile 
        115                 120                 125             


Asp Trp His Ile Leu Asn Asp Gly Asn Pro Asn Gln Asn Lys Glu Lys 
    130                 135                 140                 


Ala Lys Glu Phe Phe Lys Glu Met Ser Ser Leu Tyr Gly Asn Thr Pro 
145                 150                 155                 160 


Asn Val Ile Tyr Glu Ile Ala Asn Glu Pro Asn Gly Asp Val Asn Trp 
                165                 170                 175     


Lys Arg Asp Ile Lys Pro Tyr Ala Glu Glu Val Ile Ser Val Ile Arg 
            180                 185                 190         


Lys Asn Asp Pro Asp Asn Ile Ile Ile Val Gly Thr Gly Thr Trp Ser 
        195                 200                 205             


Gln Asp Val Asn Asp Ala Ala Asp Asp Gln Leu Lys Asp Ala Asn Val 
    210                 215                 220                 


Met Tyr Ala Leu His Phe Tyr Ala Gly Thr His Gly Gln Phe Leu Arg 
225                 230                 235                 240 


Asp Lys Ala Asn Tyr Ala Leu Ser Lys Gly Ala Pro Ile Phe Val Thr 
                245                 250                 255     


Glu Trp Gly Thr Ser Asp Ala Ser Gly Asn Gly Gly Val Phe Leu Asp 
            260                 265                 270         


Gln Ser Arg Glu Trp Leu Lys Tyr Leu Asp Ser Lys Thr Ile Ser Trp 
        275                 280                 285             


Val Asn Trp Asn Leu Ser Asp Lys Gln Glu Ser Ser Ser Ala Leu Lys 
    290                 295                 300                 


Pro Gly Ala Ser Lys Thr Gly Gly Trp Arg Leu Ser Asp Leu Ser Ala 
305                 310                 315                 320 


Ser Gly Thr Phe Val Arg Glu Asn Ile Leu Gly Thr Lys Asp Ser Thr 
                325                 330                 335     


Lys Asp Ile Pro Glu Thr Pro Ser Lys Asp Lys Pro Thr Gln Glu Asn 
            340                 345                 350         


Gly Ile Ser Val Gln Tyr Arg Ala Gly Asp Gly Ser Met Asn Ser Asn 
        355                 360                 365             


Gln Ile Arg Pro Gln Leu Gln Ile Lys Asn Asn Gly Asn Thr Thr Val 
    370                 375                 380                 


Asp Leu Lys Asp Val Thr Ala Arg Tyr Trp Tyr Lys Ala Lys Asn Lys 
385                 390                 395                 400 


Gly Gln Asn Phe Asp Cys Asp Tyr Ala Gln Ile Gly Cys Gly Asn Val 
                405                 410                 415     


Thr His Lys Phe Val Thr Leu His Lys Pro Lys Gln Gly Ala Asp Thr 
            420                 425                 430         


Tyr Leu Glu Leu Gly Phe Lys Asn Gly Thr Leu Ala Pro Gly Ala Ser 
        435                 440                 445             


Thr Gly Asn Ile Gln Leu Arg Leu His Asn Asp Asp Trp Ser Asn Tyr 
    450                 455                 460                 


Ala Gln Ser Gly Asp Tyr Ser Phe Phe Lys Ser Asn Thr Phe Lys Thr 
465                 470                 475                 480 


Thr Lys Lys Ile Thr Leu Tyr Asp Gln Gly Lys Leu Ile Trp Gly Thr 
                485                 490                 495     


Glu Pro Asn 
            


<210> 161
<211> 1185
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 161
gtgcggggac aaaagttatg ggttgcgctg gcggcgcttc tgtcggtggg gtcggtcgcg     60

ttggcctctg tcgggactaa ggggccggat cgttctccgc tgctcgggca tgccgccgtc    120

cggaagccgt cgcaggccgg cgccctgcag ctcgtccttc gggacggccg acggacgctc    180

gcggacgagc gcggccggat gattcagctg cgcggcatga gcacgcacgg gttgcaatgg    240

tttccgcaga ttctgaacga caacgcgttc gcggcgttgg cggaggactg gggcgccaac    300

gtcgtccggc tggccatgta tgtgggcgaa ggggggtatg ctgcggaccc ggacaagttc    360

cggcaaaggg cgatgcgcgg catcgatctg gcgatcgcgc acgatctgta cgtcatcgtc    420

gactggcacg tgctgacgcc gggagatccg cgggccgatg tgtacagcgg cgcaatggaa    480

ttttttcggt cggtgtcggc tcggtatccg aacgatccgc acctgctgta cgagctggcc    540

aacgagccga acggcgggag cgcggacggg cagccgggca tcccgaacga tgcggacggc    600

tggaaggcgg tcaaggcgta cgcccagccg atcgtcgata tgctgcgcga gaccggcaac    660

ggcaatatcg tcatcgtcgg ctcgccgaac tggagccaac ggccggacct ggccgcggac    720

gatccgatcg acgacccgct gaaccggacg atgtacgcgt tccacttcta tgccgggtcg    780

catcggttct ccgcggacag cggggaccgg cagaacgtga tgagcaacgt ccgctacgcg    840

ctggagcgcg gagccgccgt gttcgcgacc gagtggggga cgagcgaggc cagcggcaac    900

ggcggcccgt acatggaggc agcggatgcg tggctgtcgt tcctgaacga aaacaacatc    960

agctggacga actggtcgct ggcgaacaag gacgagactt cggccgcttt gcggcctttc   1020

acgctgggcg ggctgccggc ggcttcgctc gaccccggcg aaggccgcgc ctggactccg   1080

gaggagctga gcgtcagcgg ggagtacgtg cgggcgcgga tcaagggcat cccgtacaag   1140

cccatcgacc gcagcgcgcc gggcgcgcgg cgtcctttgg gctaa                   1185

<210> 162
<211> 394
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(22)

<220> 
<221> DOMAIN
<222> (58)...(335)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (177)...(186)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (233)...(236)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (251)...(254)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (323)...(326)
<223> N-glycosylation site. Prosite id = PS00001

<400> 162
Met Arg Gly Gln Lys Leu Trp Val Ala Leu Ala Ala Leu Leu Ser Val 
1               5                   10                  15      


Gly Ser Val Ala Leu Ala Ser Val Gly Thr Lys Gly Pro Asp Arg Ser 
            20                  25                  30          


Pro Leu Leu Gly His Ala Ala Val Arg Lys Pro Ser Gln Ala Gly Ala 
        35                  40                  45              


Leu Gln Leu Val Leu Arg Asp Gly Arg Arg Thr Leu Ala Asp Glu Arg 
    50                  55                  60                  


Gly Arg Met Ile Gln Leu Arg Gly Met Ser Thr His Gly Leu Gln Trp 
65                  70                  75                  80  


Phe Pro Gln Ile Leu Asn Asp Asn Ala Phe Ala Ala Leu Ala Glu Asp 
                85                  90                  95      


Trp Gly Ala Asn Val Val Arg Leu Ala Met Tyr Val Gly Glu Gly Gly 
            100                 105                 110         


Tyr Ala Ala Asp Pro Asp Lys Phe Arg Gln Arg Ala Met Arg Gly Ile 
        115                 120                 125             


Asp Leu Ala Ile Ala His Asp Leu Tyr Val Ile Val Asp Trp His Val 
    130                 135                 140                 


Leu Thr Pro Gly Asp Pro Arg Ala Asp Val Tyr Ser Gly Ala Met Glu 
145                 150                 155                 160 


Phe Phe Arg Ser Val Ser Ala Arg Tyr Pro Asn Asp Pro His Leu Leu 
                165                 170                 175     


Tyr Glu Leu Ala Asn Glu Pro Asn Gly Gly Ser Ala Asp Gly Gln Pro 
            180                 185                 190         


Gly Ile Pro Asn Asp Ala Asp Gly Trp Lys Ala Val Lys Ala Tyr Ala 
        195                 200                 205             


Gln Pro Ile Val Asp Met Leu Arg Glu Thr Gly Asn Gly Asn Ile Val 
    210                 215                 220                 


Ile Val Gly Ser Pro Asn Trp Ser Gln Arg Pro Asp Leu Ala Ala Asp 
225                 230                 235                 240 


Asp Pro Ile Asp Asp Pro Leu Asn Arg Thr Met Tyr Ala Phe His Phe 
                245                 250                 255     


Tyr Ala Gly Ser His Arg Phe Ser Ala Asp Ser Gly Asp Arg Gln Asn 
            260                 265                 270         


Val Met Ser Asn Val Arg Tyr Ala Leu Glu Arg Gly Ala Ala Val Phe 
        275                 280                 285             


Ala Thr Glu Trp Gly Thr Ser Glu Ala Ser Gly Asn Gly Gly Pro Tyr 
    290                 295                 300                 


Met Glu Ala Ala Asp Ala Trp Leu Ser Phe Leu Asn Glu Asn Asn Ile 
305                 310                 315                 320 


Ser Trp Thr Asn Trp Ser Leu Ala Asn Lys Asp Glu Thr Ser Ala Ala 
                325                 330                 335     


Leu Arg Pro Phe Thr Leu Gly Gly Leu Pro Ala Ala Ser Leu Asp Pro 
            340                 345                 350         


Gly Glu Gly Arg Ala Trp Thr Pro Glu Glu Leu Ser Val Ser Gly Glu 
        355                 360                 365             


Tyr Val Arg Ala Arg Ile Lys Gly Ile Pro Tyr Lys Pro Ile Asp Arg 
    370                 375                 380                 


Ser Ala Pro Gly Ala Arg Arg Pro Leu Gly 
385                 390                 


<210> 163
<211> 2529
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 163
atgtcgaaaa taaaacaccc taaattatta ctttgcttat tgcctgtttt tggcatgttc     60

aactgtcaaa atgtattgag tgcagacaat cagcaagtta actggcctta tgtgaatacg    120

caacttaagc gtgatccagc cgttgaaaaa cagattgata aactgctcgc aacaatgact    180

ttagagcaaa aagttgcgca aatgatccag ccagaaattg gttatttaag tgtagagcaa    240

atgaaaaagt atggatttgg ctcttatcta aatggcggaa atactgcccc ttacggaaat    300

aaacgagcgg atcaagctac ttggttaaaa tacgcagatg aaatgtacct agcctctatg    360

gattcgacca tagatggtat tgcgatcccg acggtatggg gaaccgatgc catgcatggg    420

catagtaatg tgtatggcgc tacgttattt ccgcacaata ttggcttagg agctgcgcat    480

gatgctgatt taattaaacg cataggccaa gcaacagcaa aggaagtatc tgctacaggg    540

attgaatgga gttttgcgcc tactgttgct gtagttagag atgatcgctg gggcagaact    600

tatgaaagtt actcagaaga ccctgaatta gttaggttgt atgcaggtag tatggtgacc    660

ggtattcaag gtgacatagg cgctgacttt ttaaaaggca gtaaccgtat tgccaccgct    720

aagcactttg tgggtgatgg tggcactgag cgaggcgttg atcgcggaaa tacgcttatc    780

gatgaaaaag gattaagaga tattcacagc gcaggttatt tttctgctat taatgaaggc    840

gtgcaatcag tgatggcatc ctttaatagt tggaatggca aacgtgtaca tggcgataag    900

catttattaa cggatgtact taaaaaccaa cttggttttg atgggtttgt agtaagcgac    960

tggaatgcgc ataagtttgt agaaggctgc gatttagagc aatgcgccca agccataaac   1020

gcaggtgttg atgtaatcat ggtgccagag cattttgaag cgttttatca taatactgtt   1080

aagcaagtga aagagggggt aatagccgaa tcaagaataa acgatgcggt tagacgcttt   1140

ttaagggcta aaatccgctg gggggtgttt acaaagggta aaccatcagc acgacctgaa   1200

tcacaacacc cacagtggtt aggcgctaat gagcatcgca ctttggcaag agaagcggtt   1260

cgcaagtcac tcgttctatt aaaaaacaac gaaaacgttt tgccaattaa agcgcatagt   1320

cgcatcttag ttgccggtaa aggcgccaat gcaatcaata tgcaagcagg tggctggagc   1380

gtttcttggc agggcacaga caatactaat agcgactttc caaatgcgac ctctatttat   1440

gcaggtttaa agtcgcaagt attaaaagca ggcggtgaaa taagcttaag tgagtctggt   1500

gactataaaa ccaagccaga tgtagccatt gttgttattg gtgaagaacc ttatgctgag   1560

tggtttggtg atatagaaat gctagagttt cagcatgaaa gcaaacacgc tcttgcactg   1620

ctgaaaaaac ttaaggcaga taatattacg gtagtcaccg tatttttaag tggtcgcccg   1680

ctttgggtca ataaagaact caatgcatca gatgcctttg ttgctgcatg gttaccaggc   1740

tctgagggtg agggagttgc agacgtatta ctaacagata aaaacggtaa cagtcagttt   1800

gattttagcg gtaagttgag tttttcgtgg cctaaatacg acgaccaatt tagccttaac   1860

ttaggcgatg taaattacga cccgctcttt gcctatggct acgggctaac ttatcaagac   1920

acagtaaaca tacctgttgt gagtgaaact acaagcccta aaaaaatcgt taaaagcgat   1980

gcacacccat tgtttgtaaa aagtttagct aaaaatctca cttgggcgtt agctgataca   2040

acagcaaagg ttttaactag tggttcatcg gccaccagtg gcgataaaaa aagtttgtta   2100

atgcaatcgg ttaatttatc gtatcaagag gatgcaaggc agtttaagtg gcaatctcaa   2160

tcctcagatg caacgcttag cctaagttac ttaaaaccta cgctactcga acgcaaattt   2220

aaatcaggtt accttgaact aaaaatgcga ttagataaag ctccagagca aggtgcccag   2280

ttacaagtaa tgtgtaattt agataactgt ttaagaagta ttgatttttc atcgtttgaa   2340

aaaacaatgg cagataagcg ctggcatact ttatcaatcg cactcaattg cgccgatagc   2400

gaactgattg aacaatccat gtcagatgta attcgtattt cagctcagcg tttaagttta   2460

gcagttgcag atatcgcgtt gactgcaaaa cctaataatg atgcattatc aatcacgtgc   2520

tcaaaataa                                                           2529

<210> 164
<211> 842
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> SIGNAL
<222> (1)...(27)

<220> 
<221> DOMAIN
<222> (126)...(348)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (424)...(640)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (310)...(327)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (482)...(485)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (556)...(559)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (576)...(579)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (682)...(685)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (715)...(718)
<223> N-glycosylation site. Prosite id = PS00001

<400> 164
Met Ser Lys Ile Lys His Pro Lys Leu Leu Leu Cys Leu Leu Pro Val 
1               5                   10                  15      


Phe Gly Met Phe Asn Cys Gln Asn Val Leu Ser Ala Asp Asn Gln Gln 
            20                  25                  30          


Val Asn Trp Pro Tyr Val Asn Thr Gln Leu Lys Arg Asp Pro Ala Val 
        35                  40                  45              


Glu Lys Gln Ile Asp Lys Leu Leu Ala Thr Met Thr Leu Glu Gln Lys 
    50                  55                  60                  


Val Ala Gln Met Ile Gln Pro Glu Ile Gly Tyr Leu Ser Val Glu Gln 
65                  70                  75                  80  


Met Lys Lys Tyr Gly Phe Gly Ser Tyr Leu Asn Gly Gly Asn Thr Ala 
                85                  90                  95      


Pro Tyr Gly Asn Lys Arg Ala Asp Gln Ala Thr Trp Leu Lys Tyr Ala 
            100                 105                 110         


Asp Glu Met Tyr Leu Ala Ser Met Asp Ser Thr Ile Asp Gly Ile Ala 
        115                 120                 125             


Ile Pro Thr Val Trp Gly Thr Asp Ala Met His Gly His Ser Asn Val 
    130                 135                 140                 


Tyr Gly Ala Thr Leu Phe Pro His Asn Ile Gly Leu Gly Ala Ala His 
145                 150                 155                 160 


Asp Ala Asp Leu Ile Lys Arg Ile Gly Gln Ala Thr Ala Lys Glu Val 
                165                 170                 175     


Ser Ala Thr Gly Ile Glu Trp Ser Phe Ala Pro Thr Val Ala Val Val 
            180                 185                 190         


Arg Asp Asp Arg Trp Gly Arg Thr Tyr Glu Ser Tyr Ser Glu Asp Pro 
        195                 200                 205             


Glu Leu Val Arg Leu Tyr Ala Gly Ser Met Val Thr Gly Ile Gln Gly 
    210                 215                 220                 


Asp Ile Gly Ala Asp Phe Leu Lys Gly Ser Asn Arg Ile Ala Thr Ala 
225                 230                 235                 240 


Lys His Phe Val Gly Asp Gly Gly Thr Glu Arg Gly Val Asp Arg Gly 
                245                 250                 255     


Asn Thr Leu Ile Asp Glu Lys Gly Leu Arg Asp Ile His Ser Ala Gly 
            260                 265                 270         


Tyr Phe Ser Ala Ile Asn Glu Gly Val Gln Ser Val Met Ala Ser Phe 
        275                 280                 285             


Asn Ser Trp Asn Gly Lys Arg Val His Gly Asp Lys His Leu Leu Thr 
    290                 295                 300                 


Asp Val Leu Lys Asn Gln Leu Gly Phe Asp Gly Phe Val Val Ser Asp 
305                 310                 315                 320 


Trp Asn Ala His Lys Phe Val Glu Gly Cys Asp Leu Glu Gln Cys Ala 
                325                 330                 335     


Gln Ala Ile Asn Ala Gly Val Asp Val Ile Met Val Pro Glu His Phe 
            340                 345                 350         


Glu Ala Phe Tyr His Asn Thr Val Lys Gln Val Lys Glu Gly Val Ile 
        355                 360                 365             


Ala Glu Ser Arg Ile Asn Asp Ala Val Arg Arg Phe Leu Arg Ala Lys 
    370                 375                 380                 


Ile Arg Trp Gly Val Phe Thr Lys Gly Lys Pro Ser Ala Arg Pro Glu 
385                 390                 395                 400 


Ser Gln His Pro Gln Trp Leu Gly Ala Asn Glu His Arg Thr Leu Ala 
                405                 410                 415     


Arg Glu Ala Val Arg Lys Ser Leu Val Leu Leu Lys Asn Asn Glu Asn 
            420                 425                 430         


Val Leu Pro Ile Lys Ala His Ser Arg Ile Leu Val Ala Gly Lys Gly 
        435                 440                 445             


Ala Asn Ala Ile Asn Met Gln Ala Gly Gly Trp Ser Val Ser Trp Gln 
    450                 455                 460                 


Gly Thr Asp Asn Thr Asn Ser Asp Phe Pro Asn Ala Thr Ser Ile Tyr 
465                 470                 475                 480 


Ala Gly Leu Lys Ser Gln Val Leu Lys Ala Gly Gly Glu Ile Ser Leu 
                485                 490                 495     


Ser Glu Ser Gly Asp Tyr Lys Thr Lys Pro Asp Val Ala Ile Val Val 
            500                 505                 510         


Ile Gly Glu Glu Pro Tyr Ala Glu Trp Phe Gly Asp Ile Glu Met Leu 
        515                 520                 525             


Glu Phe Gln His Glu Ser Lys His Ala Leu Ala Leu Leu Lys Lys Leu 
    530                 535                 540                 


Lys Ala Asp Asn Ile Thr Val Val Thr Val Phe Leu Ser Gly Arg Pro 
545                 550                 555                 560 


Leu Trp Val Asn Lys Glu Leu Asn Ala Ser Asp Ala Phe Val Ala Ala 
                565                 570                 575     


Trp Leu Pro Gly Ser Glu Gly Glu Gly Val Ala Asp Val Leu Leu Thr 
            580                 585                 590         


Asp Lys Asn Gly Asn Ser Gln Phe Asp Phe Ser Gly Lys Leu Ser Phe 
        595                 600                 605             


Ser Trp Pro Lys Tyr Asp Asp Gln Phe Ser Leu Asn Leu Gly Asp Val 
    610                 615                 620                 


Asn Tyr Asp Pro Leu Phe Ala Tyr Gly Tyr Gly Leu Thr Tyr Gln Asp 
625                 630                 635                 640 


Thr Val Asn Ile Pro Val Val Ser Glu Thr Thr Ser Pro Lys Lys Ile 
                645                 650                 655     


Val Lys Ser Asp Ala His Pro Leu Phe Val Lys Ser Leu Ala Lys Asn 
            660                 665                 670         


Leu Thr Trp Ala Leu Ala Asp Thr Thr Ala Lys Val Leu Thr Ser Gly 
        675                 680                 685             


Ser Ser Ala Thr Ser Gly Asp Lys Lys Ser Leu Leu Met Gln Ser Val 
    690                 695                 700                 


Asn Leu Ser Tyr Gln Glu Asp Ala Arg Gln Phe Lys Trp Gln Ser Gln 
705                 710                 715                 720 


Ser Ser Asp Ala Thr Leu Ser Leu Ser Tyr Leu Lys Pro Thr Leu Leu 
                725                 730                 735     


Glu Arg Lys Phe Lys Ser Gly Tyr Leu Glu Leu Lys Met Arg Leu Asp 
            740                 745                 750         


Lys Ala Pro Glu Gln Gly Ala Gln Leu Gln Val Met Cys Asn Leu Asp 
        755                 760                 765             


Asn Cys Leu Arg Ser Ile Asp Phe Ser Ser Phe Glu Lys Thr Met Ala 
    770                 775                 780                 


Asp Lys Arg Trp His Thr Leu Ser Ile Ala Leu Asn Cys Ala Asp Ser 
785                 790                 795                 800 


Glu Leu Ile Glu Gln Ser Met Ser Asp Val Ile Arg Ile Ser Ala Gln 
                805                 810                 815     


Arg Leu Ser Leu Ala Val Ala Asp Ile Ala Leu Thr Ala Lys Pro Asn 
            820                 825                 830         


Asn Asp Ala Leu Ser Ile Thr Cys Ser Lys 
        835                 840         


<210> 165
<211> 1005
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 165
atgtcaaagg cgaaacttta tgtcatactg ttatttttct ctgttgttgc aagcggtttt     60

gatttatcgc gtcaaaaagc ctttgagtac aacaaacatc ttggagttgg tgttaatctt    120

gggaacgctc ttgaggctcc aagagaaggt gcttggggaa tgagaattca agatgattat    180

tttcctgcga tcaaagaaag gggtttcaat catgtcagaa ttcctatccg atggtccgca    240

cactgtatta aagagccacc ttacacaata gatgaaaagt ttttccaaag agttgaacat    300

gttattcaaa aggcacttga aaatggcttg ttggttgtta taaacatgca tcactttgaa    360

gagctttatc aaaatccgct acaatacaag gagatattct tggctctttg gcaacaaatt    420

tcagagagat tcaaggatta tccagctgaa ctgagttttg agattttcaa tgaaccagct    480

caagctttta ctgtgtcact ctggaatcaa tttgcacgcg aagctctcaa agttatacgt    540

cagtctaatc cagaacgaat agtgataatc gatgctccaa attgggctca ttggagtgcc    600

gttgagactc ttgcactgcc agaagatgaa aacatcatag tttcttttca ctactatgaa    660

cctttcaatt ttacccatca aggtgccgaa tgggtttcac cagttccacg cgtgggtgtg    720

aagtgggaag ccacaaaagc ccaaattggt gaaatcgaga agcatttcaa agctgttagc    780

gactgggcga agaaacacaa tgttccaatt taccttgggg agtttggagc ttactccaaa    840

gcagacatgg aatctcgtgt gaaatggact aggacagtca gtcaaatagc acagaagtat    900

ggtttttcaa ttgcgtactg ggaattcggc gcagggtttg gaatttacga tcgatccaaa    960

ggcgaatgga tagagcctct gacgaatgca gtgtttggtc aataa                   1005

<210> 166
<211> 334
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (41)...(317)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (226)...(229)
<223> N-glycosylation site. Prosite id = PS00001

<400> 166
Met Ser Lys Ala Lys Leu Tyr Val Ile Leu Leu Phe Phe Ser Val Val 
1               5                   10                  15      


Ala Ser Gly Phe Asp Leu Ser Arg Gln Lys Ala Phe Glu Tyr Asn Lys 
            20                  25                  30          


His Leu Gly Val Gly Val Asn Leu Gly Asn Ala Leu Glu Ala Pro Arg 
        35                  40                  45              


Glu Gly Ala Trp Gly Met Arg Ile Gln Asp Asp Tyr Phe Pro Ala Ile 
    50                  55                  60                  


Lys Glu Arg Gly Phe Asn His Val Arg Ile Pro Ile Arg Trp Ser Ala 
65                  70                  75                  80  


His Cys Ile Lys Glu Pro Pro Tyr Thr Ile Asp Glu Lys Phe Phe Gln 
                85                  90                  95      


Arg Val Glu His Val Ile Gln Lys Ala Leu Glu Asn Gly Leu Leu Val 
            100                 105                 110         


Val Ile Asn Met His His Phe Glu Glu Leu Tyr Gln Asn Pro Leu Gln 
        115                 120                 125             


Tyr Lys Glu Ile Phe Leu Ala Leu Trp Gln Gln Ile Ser Glu Arg Phe 
    130                 135                 140                 


Lys Asp Tyr Pro Ala Glu Leu Ser Phe Glu Ile Phe Asn Glu Pro Ala 
145                 150                 155                 160 


Gln Ala Phe Thr Val Ser Leu Trp Asn Gln Phe Ala Arg Glu Ala Leu 
                165                 170                 175     


Lys Val Ile Arg Gln Ser Asn Pro Glu Arg Ile Val Ile Ile Asp Ala 
            180                 185                 190         


Pro Asn Trp Ala His Trp Ser Ala Val Glu Thr Leu Ala Leu Pro Glu 
        195                 200                 205             


Asp Glu Asn Ile Ile Val Ser Phe His Tyr Tyr Glu Pro Phe Asn Phe 
    210                 215                 220                 


Thr His Gln Gly Ala Glu Trp Val Ser Pro Val Pro Arg Val Gly Val 
225                 230                 235                 240 


Lys Trp Glu Ala Thr Lys Ala Gln Ile Gly Glu Ile Glu Lys His Phe 
                245                 250                 255     


Lys Ala Val Ser Asp Trp Ala Lys Lys His Asn Val Pro Ile Tyr Leu 
            260                 265                 270         


Gly Glu Phe Gly Ala Tyr Ser Lys Ala Asp Met Glu Ser Arg Val Lys 
        275                 280                 285             


Trp Thr Arg Thr Val Ser Gln Ile Ala Gln Lys Tyr Gly Phe Ser Ile 
    290                 295                 300                 


Ala Tyr Trp Glu Phe Gly Ala Gly Phe Gly Ile Tyr Asp Arg Ser Lys 
305                 310                 315                 320 


Gly Glu Trp Ile Glu Pro Leu Thr Asn Ala Val Phe Gly Gln 
                325                 330                 


<210> 167
<211> 1107
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 167
atgtttatcg gttttcgctt atctattgcg tcgctgctgg cgtgtttgct tttccccatg     60

caggcgttgg cacaggcgga agatcctgtt accagcgatc gccaggtgta tgaaaaaatt    120

ggtcgttttg attcgagctt gctaatcaat cagcaggatt ttattcgcgt taaaggtaat    180

cagtttatcg acgaaaaagg agaggttttc atttttcgtg gtgtaagcgt ggctgacccg    240

gataaattgg cgaaagaaaa gcaatggaaa aaaagtttat ttgcagagtt aaaaaattgg    300

ggtgtgaata ccgtacgttt accgattcat cctcgagcct ggcgtgaacg tgggcaggaa    360

gaatacttca aattgattga tcaggcggta atgtgggcca atgagtttga tcaatatctg    420

attcttgact ggcattccat tggttatctc gccagcggaa attatcaaca tcccatgtat    480

accaccgaca aacaagagac ctttcgcttt tggctggatg tggcctatcg ttatcaaggg    540

gttccgacta cagcggtgta tgagttattt aacgagccga ctaccttgga taaaccttgg    600

ggtaaaatcg agtgggctga atggaaagca cttaatgagc agatgatcga cattatctac    660

gccattgata aaaatgttat tccattggtg gcaggtttta actgggccta cgatttaaca    720

ccgcttaaag gtgcgccagt tgatcgtccg ggtattgctt atgcgtcaca tccctatcca    780

caaaaagtac aacaagatcc cgccgcgaaa gaagtctttt tcaaaggatg ggaagagaag    840

tggggctttg ccagcaaaaa atatccgctg atttgtaccg agctgggctg ggtgcagccg    900

gatggttacg gtgctcatgt accggtaaaa aatgatggca gctacggccc gcaaattatt    960

gaatatatgg aagagcgcgg aatttcctgg acggcctggg tgtttgaccc acaatggtca   1020

ccgaccatga ttaatgattg gaattttact ccctcagaac aaggtgcgtt tttcaagaag   1080

gtaatgttgg agcacgccaa aaagtaa                                       1107

<210> 168
<211> 368
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(24)

<220> 
<221> DOMAIN
<222> (60)...(343)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 168
Met Phe Ile Gly Phe Arg Leu Ser Ile Ala Ser Leu Leu Ala Cys Leu 
1               5                   10                  15      


Leu Phe Pro Met Gln Ala Leu Ala Gln Ala Glu Asp Pro Val Thr Ser 
            20                  25                  30          


Asp Arg Gln Val Tyr Glu Lys Ile Gly Arg Phe Asp Ser Ser Leu Leu 
        35                  40                  45              


Ile Asn Gln Gln Asp Phe Ile Arg Val Lys Gly Asn Gln Phe Ile Asp 
    50                  55                  60                  


Glu Lys Gly Glu Val Phe Ile Phe Arg Gly Val Ser Val Ala Asp Pro 
65                  70                  75                  80  


Asp Lys Leu Ala Lys Glu Lys Gln Trp Lys Lys Ser Leu Phe Ala Glu 
                85                  90                  95      


Leu Lys Asn Trp Gly Val Asn Thr Val Arg Leu Pro Ile His Pro Arg 
            100                 105                 110         


Ala Trp Arg Glu Arg Gly Gln Glu Glu Tyr Phe Lys Leu Ile Asp Gln 
        115                 120                 125             


Ala Val Met Trp Ala Asn Glu Phe Asp Gln Tyr Leu Ile Leu Asp Trp 
    130                 135                 140                 


His Ser Ile Gly Tyr Leu Ala Ser Gly Asn Tyr Gln His Pro Met Tyr 
145                 150                 155                 160 


Thr Thr Asp Lys Gln Glu Thr Phe Arg Phe Trp Leu Asp Val Ala Tyr 
                165                 170                 175     


Arg Tyr Gln Gly Val Pro Thr Thr Ala Val Tyr Glu Leu Phe Asn Glu 
            180                 185                 190         


Pro Thr Thr Leu Asp Lys Pro Trp Gly Lys Ile Glu Trp Ala Glu Trp 
        195                 200                 205             


Lys Ala Leu Asn Glu Gln Met Ile Asp Ile Ile Tyr Ala Ile Asp Lys 
    210                 215                 220                 


Asn Val Ile Pro Leu Val Ala Gly Phe Asn Trp Ala Tyr Asp Leu Thr 
225                 230                 235                 240 


Pro Leu Lys Gly Ala Pro Val Asp Arg Pro Gly Ile Ala Tyr Ala Ser 
                245                 250                 255     


His Pro Tyr Pro Gln Lys Val Gln Gln Asp Pro Ala Ala Lys Glu Val 
            260                 265                 270         


Phe Phe Lys Gly Trp Glu Glu Lys Trp Gly Phe Ala Ser Lys Lys Tyr 
        275                 280                 285             


Pro Leu Ile Cys Thr Glu Leu Gly Trp Val Gln Pro Asp Gly Tyr Gly 
    290                 295                 300                 


Ala His Val Pro Val Lys Asn Asp Gly Ser Tyr Gly Pro Gln Ile Ile 
305                 310                 315                 320 


Glu Tyr Met Glu Glu Arg Gly Ile Ser Trp Thr Ala Trp Val Phe Asp 
                325                 330                 335     


Pro Gln Trp Ser Pro Thr Met Ile Asn Asp Trp Asn Phe Thr Pro Ser 
            340                 345                 350         


Glu Gln Gly Ala Phe Phe Lys Lys Val Met Leu Glu His Ala Lys Lys 
        355                 360                 365             


<210> 169
<211> 1113
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 169
atgcgctgcc cttcactttt atctattaaa aacctactcg ccctgatcgg catccttttg     60

actgcgccgg tgtttgcaca agcggaagat ccggtggcca gtgatcgcaa gacttatgaa    120

aaagtcggtc gctttgacgc gaaccagttg aaaaacaaac aggacactat ccgcgtaaaa    180

gatcatcagt ttgtcgatga gcaaggcaag gcttttatct ttcgtggtgt aagcgtggcc    240

gatccggata agttggtgaa agacaagcaa tggaaagcca gcctgttcaa agagctgaaa    300

gcctggggcg caaataccgt acgcttgccg attcatccgc gcacctggcg cgagcgcgga    360

caggatgaat atctcaaact gatcgaccag gcggtaatct gggcgaacca gcaccagctg    420

tatctgatcc tcgattggca ttccatcggt ttcctcgcgt ccggcaacta tcaacacccg    480

atgtattaca ccgacaaaca ggaaaccttc cgtttctggc acgacatcgc ctaccggtat    540

caaggcgtgc caaccacagc ggtgtatgaa ttatttaacg aaccgacaac cttgcaagat    600

ccttggggta aaaccgagtg ggcggaatgg aaaacgctga atgagcagat gatcgatgtc    660

atttatgcca ttgataaaga tgtcattccg ttggtcgcgg gctttaactg ggcctatgat    720

ctgacgccca tcgccgatgc gcctgttgat cgtccaggcg tggcttacgc gtcccatcct    780

tatccgcaaa aggagcagcc caccccaccg acgaaagaaa atttctttaa agcctgggat    840

gcgaagtggg gctttgccag caagaaatat ccgttgattt gcactgaact gggttgggtg    900

cagcccgacg gctacggtgc ccatgtgccg gtgaaaaatg atggcagtta cgggccgcaa    960

attattgagt ttatggaagc gcgcggcatc tcctggacgg cctgggtatt tgatccgcag   1020

tggtcgccga ccatgatcaa tgactggtcg tttacgccgt cggagcaggg ggcgtttttt   1080

aagaaggtga tgcaggaaaa ggcggggaaa tga                                1113

<210> 170
<211> 370
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(26)

<220> 
<221> DOMAIN
<222> (62)...(345)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 170
Met Arg Cys Pro Ser Leu Leu Ser Ile Lys Asn Leu Leu Ala Leu Ile 
1               5                   10                  15      


Gly Ile Leu Leu Thr Ala Pro Val Phe Ala Gln Ala Glu Asp Pro Val 
            20                  25                  30          


Ala Ser Asp Arg Lys Thr Tyr Glu Lys Val Gly Arg Phe Asp Ala Asn 
        35                  40                  45              


Gln Leu Lys Asn Lys Gln Asp Thr Ile Arg Val Lys Asp His Gln Phe 
    50                  55                  60                  


Val Asp Glu Gln Gly Lys Ala Phe Ile Phe Arg Gly Val Ser Val Ala 
65                  70                  75                  80  


Asp Pro Asp Lys Leu Val Lys Asp Lys Gln Trp Lys Ala Ser Leu Phe 
                85                  90                  95      


Lys Glu Leu Lys Ala Trp Gly Ala Asn Thr Val Arg Leu Pro Ile His 
            100                 105                 110         


Pro Arg Thr Trp Arg Glu Arg Gly Gln Asp Glu Tyr Leu Lys Leu Ile 
        115                 120                 125             


Asp Gln Ala Val Ile Trp Ala Asn Gln His Gln Leu Tyr Leu Ile Leu 
    130                 135                 140                 


Asp Trp His Ser Ile Gly Phe Leu Ala Ser Gly Asn Tyr Gln His Pro 
145                 150                 155                 160 


Met Tyr Tyr Thr Asp Lys Gln Glu Thr Phe Arg Phe Trp His Asp Ile 
                165                 170                 175     


Ala Tyr Arg Tyr Gln Gly Val Pro Thr Thr Ala Val Tyr Glu Leu Phe 
            180                 185                 190         


Asn Glu Pro Thr Thr Leu Gln Asp Pro Trp Gly Lys Thr Glu Trp Ala 
        195                 200                 205             


Glu Trp Lys Thr Leu Asn Glu Gln Met Ile Asp Val Ile Tyr Ala Ile 
    210                 215                 220                 


Asp Lys Asp Val Ile Pro Leu Val Ala Gly Phe Asn Trp Ala Tyr Asp 
225                 230                 235                 240 


Leu Thr Pro Ile Ala Asp Ala Pro Val Asp Arg Pro Gly Val Ala Tyr 
                245                 250                 255     


Ala Ser His Pro Tyr Pro Gln Lys Glu Gln Pro Thr Pro Pro Thr Lys 
            260                 265                 270         


Glu Asn Phe Phe Lys Ala Trp Asp Ala Lys Trp Gly Phe Ala Ser Lys 
        275                 280                 285             


Lys Tyr Pro Leu Ile Cys Thr Glu Leu Gly Trp Val Gln Pro Asp Gly 
    290                 295                 300                 


Tyr Gly Ala His Val Pro Val Lys Asn Asp Gly Ser Tyr Gly Pro Gln 
305                 310                 315                 320 


Ile Ile Glu Phe Met Glu Ala Arg Gly Ile Ser Trp Thr Ala Trp Val 
                325                 330                 335     


Phe Asp Pro Gln Trp Ser Pro Thr Met Ile Asn Asp Trp Ser Phe Thr 
            340                 345                 350         


Pro Ser Glu Gln Gly Ala Phe Phe Lys Lys Val Met Gln Glu Lys Ala 
        355                 360                 365             


Gly Lys 
    370 


<210> 171
<211> 1341
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> misc_feature   
<222> (1)...(1341)
<223> n = A,T,C or G

<400> 171
atgagcattc aacgcgacga ctttcccgcc gacttcgtgt ggggcaccgc caccgccgcc     60

taccagatcg agggggcggt agacgaggat ggccgcgccc ccagcatctg ggacaccttc    120

agccacaccc ctggcaagac ccgcgacggt gataccggcg acaccgcttg cgaccactac    180

caccgctacc gcgaggacgt ggccctgatg cgggagctgg gcgtcaacgg ctaccgcttc    240

tccatcgcct ggccgcgcgt gctgccccag gggcgcggtg tggtcaacgg ggccgggctc    300

gacttctacg accgcctggt ggatgcgctg ctgggagccg ggatcacccc ctgggctacg    360

ctctaccact gggatttgcc ccaagccctc gaggacgcgg gcggctggcc ccggcgcgac    420

accgcctacg ccctggccga gtacgccgcc gtggtggggc ggcgcctggg cgaccggctc    480

aagcgctgga tcaccctcaa cgagccgtgg tgctcggcct acttgggcta cggcaacggg    540

gtgcacgcgc ctggtcgcca ggacttcgcc ctctccttgc aggctgccca ccacctgctg    600

ctggggcacg gcctggcgac agaggcgctg cgggcggcgg tgcccggggc tcaggtgggg    660

gtcaccctca acctcacccc cacccacccg gccacgcccg atccccgcga cctcgaggcc    720

gcccgccgct acgacggctt cttcaaccgc tggtacctcg acccgctgtt cggtttcggc    780

tacccgctgg acatgtggga gctntacggg cggatggtgc cccacgtaga gccggaggac    840

ctgaggcgca tcgcgttctc tcgattcctg ggcatcaact actactcgcg ttcggtggtg    900

cggcacgcgg agcagggtcc cctgctcgtc gagcacgtgc gcccggaggg cgagtacacc    960

tacatgaact gggaggtcta tcccgatggc atccgcgaga tcgtggcgcg ggtggcccgc   1020

gagtaccggc cacgttccat ttacatcacc gagaacgggg cctgctatcc cgacggggtg   1080

gaggacgacg gggaaatcca cgactccaag cgcctggagt actaccgttc ccacctcagc   1140

aagtgcgccc aggccatccg cgagggggcg ccgctgaagg gctacttcgc ctggagcctg   1200

ctggacaact tcgagtgggc cgagggctac gacaagcgct ttgggctgtt ctatgtcaac   1260

ttcgccaccc aggagcgccg cctcaagcag agcgggcgct ggctgaaggg cttcctcgag   1320

ggcgctgcgg aggccggttg a                                             1341

<210> 172
<211> 446
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (4)...(444)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (12)...(26)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (352)...(360)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 172
Met Ser Ile Gln Arg Asp Asp Phe Pro Ala Asp Phe Val Trp Gly Thr 
1               5                   10                  15      


Ala Thr Ala Ala Tyr Gln Ile Glu Gly Ala Val Asp Glu Asp Gly Arg 
            20                  25                  30          


Ala Pro Ser Ile Trp Asp Thr Phe Ser His Thr Pro Gly Lys Thr Arg 
        35                  40                  45              


Asp Gly Asp Thr Gly Asp Thr Ala Cys Asp His Tyr His Arg Tyr Arg 
    50                  55                  60                  


Glu Asp Val Ala Leu Met Arg Glu Leu Gly Val Asn Gly Tyr Arg Phe 
65                  70                  75                  80  


Ser Ile Ala Trp Pro Arg Val Leu Pro Gln Gly Arg Gly Val Val Asn 
                85                  90                  95      


Gly Ala Gly Leu Asp Phe Tyr Asp Arg Leu Val Asp Ala Leu Leu Gly 
            100                 105                 110         


Ala Gly Ile Thr Pro Trp Ala Thr Leu Tyr His Trp Asp Leu Pro Gln 
        115                 120                 125             


Ala Leu Glu Asp Ala Gly Gly Trp Pro Arg Arg Asp Thr Ala Tyr Ala 
    130                 135                 140                 


Leu Ala Glu Tyr Ala Ala Val Val Gly Arg Arg Leu Gly Asp Arg Leu 
145                 150                 155                 160 


Lys Arg Trp Ile Thr Leu Asn Glu Pro Trp Cys Ser Ala Tyr Leu Gly 
                165                 170                 175     


Tyr Gly Asn Gly Val His Ala Pro Gly Arg Gln Asp Phe Ala Leu Ser 
            180                 185                 190         


Leu Gln Ala Ala His His Leu Leu Leu Gly His Gly Leu Ala Thr Glu 
        195                 200                 205             


Ala Leu Arg Ala Ala Val Pro Gly Ala Gln Val Gly Val Thr Leu Asn 
    210                 215                 220                 


Leu Thr Pro Thr His Pro Ala Thr Pro Asp Pro Arg Asp Leu Glu Ala 
225                 230                 235                 240 


Ala Arg Arg Tyr Asp Gly Phe Phe Asn Arg Trp Tyr Leu Asp Pro Leu 
                245                 250                 255     


Phe Gly Phe Gly Tyr Pro Leu Asp Met Trp Glu Leu Tyr Gly Arg Met 
            260                 265                 270         


Val Pro His Val Glu Pro Glu Asp Leu Arg Arg Ile Ala Phe Ser Arg 
        275                 280                 285             


Phe Leu Gly Ile Asn Tyr Tyr Ser Arg Ser Val Val Arg His Ala Glu 
    290                 295                 300                 


Gln Gly Pro Leu Leu Val Glu His Val Arg Pro Glu Gly Glu Tyr Thr 
305                 310                 315                 320 


Tyr Met Asn Trp Glu Val Tyr Pro Asp Gly Ile Arg Glu Ile Val Ala 
                325                 330                 335     


Arg Val Ala Arg Glu Tyr Arg Pro Arg Ser Ile Tyr Ile Thr Glu Asn 
            340                 345                 350         


Gly Ala Cys Tyr Pro Asp Gly Val Glu Asp Asp Gly Glu Ile His Asp 
        355                 360                 365             


Ser Lys Arg Leu Glu Tyr Tyr Arg Ser His Leu Ser Lys Cys Ala Gln 
    370                 375                 380                 


Ala Ile Arg Glu Gly Ala Pro Leu Lys Gly Tyr Phe Ala Trp Ser Leu 
385                 390                 395                 400 


Leu Asp Asn Phe Glu Trp Ala Glu Gly Tyr Asp Lys Arg Phe Gly Leu 
                405                 410                 415     


Phe Tyr Val Asn Phe Ala Thr Gln Glu Arg Arg Leu Lys Gln Ser Gly 
            420                 425                 430         


Arg Trp Leu Lys Gly Phe Leu Glu Gly Ala Ala Glu Ala Gly 
        435                 440                 445     


<210> 173
<211> 2205
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 173
atgaagagtg ttttagcatt ggccttaata gtttccatta atttggtttt attagcaaat     60

tctgtattaa ttccgccaaa tatcgacgat tctttcttat acggtaacga acttgtgaag    120

aaaccttcgg aagccggggc tttgcgagtt attgagtata acgggataaa aaccttggga    180

gatgaagaag gtaatcctat tcaattaagg ggtatgagca cgcacgggct tcaatggttc    240

cccgaaattc taaacgaaaa tgctttcgcc gctcttgcaa atgattggga agcgaatgta    300

attcgccttg ccatgtatgt gggagaagac ggatacgcaa cagatcctga aaccatcaaa    360

aaaagagtca ttcaaggaat cgatcttgca aagaagtacg atatgtacgt gatcgtagat    420

tggcacgtac atgcaccagg agatccaact gatccaatat atgagggagc acaggaattc    480

tttaaagaaa tttcagaatt gtatcctaat gaccctcata tcatctatga attatgtaac    540

gaaccaaatg gtatcccaaa tgatgaaacc ggttggcaaa ttgtaaaaga ttatgcagaa    600

cctattattg aaatgcttcg ggaaaacggg aatgaaaaca tcgttattgt tggaaatcct    660

aattggagcc aaagacctga tttagctgct gatgatccaa tcgatgatca aaataccgtt    720

tacacccttc atttttatgc aggtacccat aaaccgtcac cagatagtta cgtaatgaaa    780

aatgcaattt atgcattgaa gcacggtgcc cctatttttg tatctgaatg gggaacaagc    840

gaagctactg gcgatggtgg accatatata gaagaatcag atgaatggct taagtttttg    900

aacgccaata acgttagttg ggttaattgg tctcttacta ataaaaatga aaaatctgcg    960

gcatttacac catacgtata tggtgaatct gaagcaacag atcttgatcc aggacctgat   1020

caaatatggt caccagagga actaagtatt tcaggagaat atgttcgtac acgtataaaa   1080

ggtgtagaat acgaaccaat tgaccgttca aattactaca aaacattatg ggattttgat   1140

gatgggacaa ctcaaggctt tgtagtaaat tccgacagtc ctgtcactga tgtatcgttg   1200

accaatgagg ataatagatt aaaaatttct ggtttagatg caagcgatga cgttagcgaa   1260

ggaggatttt ggaacaatct tcgtatttct gcggataatt ggggcaacgc agttgatatt   1320

accggggctg gaaaaataat gatagatgtt attttaaaat ctcctgcaac tgttgctata   1380

gcagtgattc ctcaaggacc cgggaatggc tggtggacta atcctgctcg cgctgtaagg   1440

ctcaccccaa atgattttgt caaacaagtt gatggaactt acaaagcagt tctaacaatt   1500

acggctgaag atagcccggg cttggaaaca attggcacga gtgacacaga caatacgatt   1560

caaaatatcg tccttttcgt aggtacagag ggagcagatg taatttattt ggataatttt   1620

aaggtttctg gcaaaaaaat tgaaattccc attatccatg atccgttagg tgaagcaaaa   1680

ttaccgtcgg attttgaaga tggtactcgc cagggttgga aatggagtag tgactcagca   1740

gtgcaaacag ctcttaccat tgaagaagcc aacggttcaa aagctttatc ttgggaagta   1800

gcttatcctg aagtaaagcc aagtgatacg tgggcatccg caccacgttt agatttctgg   1860

atggatggtc taaaacgcgg ggataaaaat ttcttgtttt ttgattttta tttagctccc   1920

gatagagctt ccgaaggaat tattgaaata aatttggcgt ttcaaccatc atcagcaggt   1980

tattgggcac aggcaaccga tacttatgaa attgatttat cagatcttga ttcaacaact   2040

gtaacaaatg atggcctgta ccattatgaa gtaaaaatag atttaaattc agttcccaat   2100

attgaagaca acactgattt aagaaatatg cttttaattt ttgttgatgt aaacagcgac   2160

ttcgctggaa gaatgtatat agataatgtc aattttattg aataa                   2205

<210> 174
<211> 734
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(19)

<220> 
<221> DOMAIN
<222> (57)...(320)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (342)...(543)
<223> Carbohydrate binding domain (family 17/28)

<220> 
<221> DOMAIN
<222> (544)...(733)
<223> Carbohydrate binding domain (family 17/28)

<220> 
<221> SITE
<222> (176)...(185)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (224)...(227)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (308)...(311)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (313)...(316)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (395)...(398)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (600)...(603)
<223> N-glycosylation site. Prosite id = PS00001

<400> 174
Met Lys Ser Val Leu Ala Leu Ala Leu Ile Val Ser Ile Asn Leu Val 
1               5                   10                  15      


Leu Leu Ala Asn Ser Val Leu Ile Pro Pro Asn Ile Asp Asp Ser Phe 
            20                  25                  30          


Leu Tyr Gly Asn Glu Leu Val Lys Lys Pro Ser Glu Ala Gly Ala Leu 
        35                  40                  45              


Arg Val Ile Glu Tyr Asn Gly Ile Lys Thr Leu Gly Asp Glu Glu Gly 
    50                  55                  60                  


Asn Pro Ile Gln Leu Arg Gly Met Ser Thr His Gly Leu Gln Trp Phe 
65                  70                  75                  80  


Pro Glu Ile Leu Asn Glu Asn Ala Phe Ala Ala Leu Ala Asn Asp Trp 
                85                  90                  95      


Glu Ala Asn Val Ile Arg Leu Ala Met Tyr Val Gly Glu Asp Gly Tyr 
            100                 105                 110         


Ala Thr Asp Pro Glu Thr Ile Lys Lys Arg Val Ile Gln Gly Ile Asp 
        115                 120                 125             


Leu Ala Lys Lys Tyr Asp Met Tyr Val Ile Val Asp Trp His Val His 
    130                 135                 140                 


Ala Pro Gly Asp Pro Thr Asp Pro Ile Tyr Glu Gly Ala Gln Glu Phe 
145                 150                 155                 160 


Phe Lys Glu Ile Ser Glu Leu Tyr Pro Asn Asp Pro His Ile Ile Tyr 
                165                 170                 175     


Glu Leu Cys Asn Glu Pro Asn Gly Ile Pro Asn Asp Glu Thr Gly Trp 
            180                 185                 190         


Gln Ile Val Lys Asp Tyr Ala Glu Pro Ile Ile Glu Met Leu Arg Glu 
        195                 200                 205             


Asn Gly Asn Glu Asn Ile Val Ile Val Gly Asn Pro Asn Trp Ser Gln 
    210                 215                 220                 


Arg Pro Asp Leu Ala Ala Asp Asp Pro Ile Asp Asp Gln Asn Thr Val 
225                 230                 235                 240 


Tyr Thr Leu His Phe Tyr Ala Gly Thr His Lys Pro Ser Pro Asp Ser 
                245                 250                 255     


Tyr Val Met Lys Asn Ala Ile Tyr Ala Leu Lys His Gly Ala Pro Ile 
            260                 265                 270         


Phe Val Ser Glu Trp Gly Thr Ser Glu Ala Thr Gly Asp Gly Gly Pro 
        275                 280                 285             


Tyr Ile Glu Glu Ser Asp Glu Trp Leu Lys Phe Leu Asn Ala Asn Asn 
    290                 295                 300                 


Val Ser Trp Val Asn Trp Ser Leu Thr Asn Lys Asn Glu Lys Ser Ala 
305                 310                 315                 320 


Ala Phe Thr Pro Tyr Val Tyr Gly Glu Ser Glu Ala Thr Asp Leu Asp 
                325                 330                 335     


Pro Gly Pro Asp Gln Ile Trp Ser Pro Glu Glu Leu Ser Ile Ser Gly 
            340                 345                 350         


Glu Tyr Val Arg Thr Arg Ile Lys Gly Val Glu Tyr Glu Pro Ile Asp 
        355                 360                 365             


Arg Ser Asn Tyr Tyr Lys Thr Leu Trp Asp Phe Asp Asp Gly Thr Thr 
    370                 375                 380                 


Gln Gly Phe Val Val Asn Ser Asp Ser Pro Val Thr Asp Val Ser Leu 
385                 390                 395                 400 


Thr Asn Glu Asp Asn Arg Leu Lys Ile Ser Gly Leu Asp Ala Ser Asp 
                405                 410                 415     


Asp Val Ser Glu Gly Gly Phe Trp Asn Asn Leu Arg Ile Ser Ala Asp 
            420                 425                 430         


Asn Trp Gly Asn Ala Val Asp Ile Thr Gly Ala Gly Lys Ile Met Ile 
        435                 440                 445             


Asp Val Ile Leu Lys Ser Pro Ala Thr Val Ala Ile Ala Val Ile Pro 
    450                 455                 460                 


Gln Gly Pro Gly Asn Gly Trp Trp Thr Asn Pro Ala Arg Ala Val Arg 
465                 470                 475                 480 


Leu Thr Pro Asn Asp Phe Val Lys Gln Val Asp Gly Thr Tyr Lys Ala 
                485                 490                 495     


Val Leu Thr Ile Thr Ala Glu Asp Ser Pro Gly Leu Glu Thr Ile Gly 
            500                 505                 510         


Thr Ser Asp Thr Asp Asn Thr Ile Gln Asn Ile Val Leu Phe Val Gly 
        515                 520                 525             


Thr Glu Gly Ala Asp Val Ile Tyr Leu Asp Asn Phe Lys Val Ser Gly 
    530                 535                 540                 


Lys Lys Ile Glu Ile Pro Ile Ile His Asp Pro Leu Gly Glu Ala Lys 
545                 550                 555                 560 


Leu Pro Ser Asp Phe Glu Asp Gly Thr Arg Gln Gly Trp Lys Trp Ser 
                565                 570                 575     


Ser Asp Ser Ala Val Gln Thr Ala Leu Thr Ile Glu Glu Ala Asn Gly 
            580                 585                 590         


Ser Lys Ala Leu Ser Trp Glu Val Ala Tyr Pro Glu Val Lys Pro Ser 
        595                 600                 605             


Asp Thr Trp Ala Ser Ala Pro Arg Leu Asp Phe Trp Met Asp Gly Leu 
    610                 615                 620                 


Lys Arg Gly Asp Lys Asn Phe Leu Phe Phe Asp Phe Tyr Leu Ala Pro 
625                 630                 635                 640 


Asp Arg Ala Ser Glu Gly Ile Ile Glu Ile Asn Leu Ala Phe Gln Pro 
                645                 650                 655     


Ser Ser Ala Gly Tyr Trp Ala Gln Ala Thr Asp Thr Tyr Glu Ile Asp 
            660                 665                 670         


Leu Ser Asp Leu Asp Ser Thr Thr Val Thr Asn Asp Gly Leu Tyr His 
        675                 680                 685             


Tyr Glu Val Lys Ile Asp Leu Asn Ser Val Pro Asn Ile Glu Asp Asn 
    690                 695                 700                 


Thr Asp Leu Arg Asn Met Leu Leu Ile Phe Val Asp Val Asn Ser Asp 
705                 710                 715                 720 


Phe Ala Gly Arg Met Tyr Ile Asp Asn Val Asn Phe Ile Glu 
                725                 730                 


<210> 175
<211> 1809
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 175
atgacatttg aaaaaccaat cttcgaacgc tttcgtttgc caagttgcaa gccggtcatt     60

ttaggcctgc tgtcattggc gcttgcagcc tgtggcggtg ttcccgggaa cgagactgac    120

aatgccgggg tatcgtcaaa tgatggttct agcttatcgt cgtccagctt aaactcttca    180

tccagttcaa gcaccgtatc cagttcaagc tcgtccttaa gctcaagccc tttggttagc    240

tcgatctctg tatccagctc aagttcatca atcagttcca gttcgtcgat tagctcaagt    300

tcattaagca gttcaagctc gtcgattagc tcaagctcat ccatcagctc aagttcggtg    360

tctagctcaa gctcatcaat cagttcgagt tcatcgtcca gctccacatc cagttctgac    420

gacagccgcg tggtagaggc agaatccctc aacccgaatc aatcctctga ctacatgttg    480

gtcgtatccg agagcgatcc gcgcctagaa tatgtcggct attttgatgc cggaagctat    540

atttgttacg acaacattga cctgaccggc gtgcgcagca ttgatatgca atacgccaaa    600

ggcatgagcc aaaatggccg ttttgcggtc attatcaatg gcaacagtct gggctctggc    660

actaacctcg gcgaaaagat cacccgccca acttctagct ccaccgccga ctgggaatca    720

tttaccagtt tacgcgtcgg cttatctcaa caagtcagtg gtactcatcg cctgtgcttt    780

gtcgggctaa acggcggcgg aatattcaac ctggataaat tcaccttgag tgatagcacc    840

ggggaaaacg acggtattac gcctccgccc agcagtggca cagcgccccc tcccgctcct    900

ggcgacaaca cggttagcca aggggttttg ccgatcacta cgagcggcaa ccaagtgctt    960

tttggtggcc aaccgggcag cattgctggc atgagtttgt tctggagcaa caacaactgg   1020

ggcggtgagc ggttttacaa cgccgacgtg gtgcgctggt tgaagcagga ttggaatatc   1080

aaactgatac gtgccgccat gggcgtaggc accgagcccg gtggttacat tcaaagccca   1140

ggccccaacc gccagcgcgt gcagaccatt gtagatgcgg ctattgccaa tgacctctat   1200

gtgatcatcg actggcacgc ccacgcagcg gaaaactata ctgatcaggc cattgccttc   1260

tttactgaca tggcgcggca gtatgggcag tacaacaata ttatctacga aatctacaat   1320

gagcctctgc agaatacctc gtgggacaac accatcaaac cttatgcgga gcgggtcatt   1380

gccgccattc gtgcgatcga tccagataac ctgatcatcg tgggcacccg ctcctggtct   1440

cagcgggttg atgaagctgc ggccaatcct attcgcaact accctaatat tgcatacacc   1500

ttgcacttct actctggcac ccacaaacaa gccattcgca attatgcgac cacagcgctg   1560

aacaacggaa ttccgctctt tgttaccgaa tggggcacaa ccgatgctag cggagacggg   1620

gctgtggatg tcgccgaaac gcgaatttgg atggattttt tgcgcgccaa caacatcagt   1680

cacgccaact ggtcgctgaa cgataaggcc gaaggctcag ctgcattgcg ccccggtgcc   1740

agcactactg gaggctggag taacaacgac ctgaccgaat ccggccgatt ggtgcgagat   1800

tacattcgc                                                           1809

<210> 176
<211> 603
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(36)

<220> 
<221> DOMAIN
<222> (316)...(574)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (37)...(40)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (58)...(61)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (155)...(158)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (418)...(421)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (440)...(449)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (451)...(454)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (566)...(569)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (571)...(574)
<223> N-glycosylation site. Prosite id = PS00001

<400> 176
Met Thr Phe Glu Lys Pro Ile Phe Glu Arg Phe Arg Leu Pro Ser Cys 
1               5                   10                  15      


Lys Pro Val Ile Leu Gly Leu Leu Ser Leu Ala Leu Ala Ala Cys Gly 
            20                  25                  30          


Gly Val Pro Gly Asn Glu Thr Asp Asn Ala Gly Val Ser Ser Asn Asp 
        35                  40                  45              


Gly Ser Ser Leu Ser Ser Ser Ser Leu Asn Ser Ser Ser Ser Ser Ser 
    50                  55                  60                  


Thr Val Ser Ser Ser Ser Ser Ser Leu Ser Ser Ser Pro Leu Val Ser 
65                  70                  75                  80  


Ser Ile Ser Val Ser Ser Ser Ser Ser Ser Ile Ser Ser Ser Ser Ser 
                85                  90                  95      


Ile Ser Ser Ser Ser Leu Ser Ser Ser Ser Ser Ser Ile Ser Ser Ser 
            100                 105                 110         


Ser Ser Ile Ser Ser Ser Ser Val Ser Ser Ser Ser Ser Ser Ile Ser 
        115                 120                 125             


Ser Ser Ser Ser Ser Ser Ser Thr Ser Ser Ser Asp Asp Ser Arg Val 
    130                 135                 140                 


Val Glu Ala Glu Ser Leu Asn Pro Asn Gln Ser Ser Asp Tyr Met Leu 
145                 150                 155                 160 


Val Val Ser Glu Ser Asp Pro Arg Leu Glu Tyr Val Gly Tyr Phe Asp 
                165                 170                 175     


Ala Gly Ser Tyr Ile Cys Tyr Asp Asn Ile Asp Leu Thr Gly Val Arg 
            180                 185                 190         


Ser Ile Asp Met Gln Tyr Ala Lys Gly Met Ser Gln Asn Gly Arg Phe 
        195                 200                 205             


Ala Val Ile Ile Asn Gly Asn Ser Leu Gly Ser Gly Thr Asn Leu Gly 
    210                 215                 220                 


Glu Lys Ile Thr Arg Pro Thr Ser Ser Ser Thr Ala Asp Trp Glu Ser 
225                 230                 235                 240 


Phe Thr Ser Leu Arg Val Gly Leu Ser Gln Gln Val Ser Gly Thr His 
                245                 250                 255     


Arg Leu Cys Phe Val Gly Leu Asn Gly Gly Gly Ile Phe Asn Leu Asp 
            260                 265                 270         


Lys Phe Thr Leu Ser Asp Ser Thr Gly Glu Asn Asp Gly Ile Thr Pro 
        275                 280                 285             


Pro Pro Ser Ser Gly Thr Ala Pro Pro Pro Ala Pro Gly Asp Asn Thr 
    290                 295                 300                 


Val Ser Gln Gly Val Leu Pro Ile Thr Thr Ser Gly Asn Gln Val Leu 
305                 310                 315                 320 


Phe Gly Gly Gln Pro Gly Ser Ile Ala Gly Met Ser Leu Phe Trp Ser 
                325                 330                 335     


Asn Asn Asn Trp Gly Gly Glu Arg Phe Tyr Asn Ala Asp Val Val Arg 
            340                 345                 350         


Trp Leu Lys Gln Asp Trp Asn Ile Lys Leu Ile Arg Ala Ala Met Gly 
        355                 360                 365             


Val Gly Thr Glu Pro Gly Gly Tyr Ile Gln Ser Pro Gly Pro Asn Arg 
    370                 375                 380                 


Gln Arg Val Gln Thr Ile Val Asp Ala Ala Ile Ala Asn Asp Leu Tyr 
385                 390                 395                 400 


Val Ile Ile Asp Trp His Ala His Ala Ala Glu Asn Tyr Thr Asp Gln 
                405                 410                 415     


Ala Ile Ala Phe Phe Thr Asp Met Ala Arg Gln Tyr Gly Gln Tyr Asn 
            420                 425                 430         


Asn Ile Ile Tyr Glu Ile Tyr Asn Glu Pro Leu Gln Asn Thr Ser Trp 
        435                 440                 445             


Asp Asn Thr Ile Lys Pro Tyr Ala Glu Arg Val Ile Ala Ala Ile Arg 
    450                 455                 460                 


Ala Ile Asp Pro Asp Asn Leu Ile Ile Val Gly Thr Arg Ser Trp Ser 
465                 470                 475                 480 


Gln Arg Val Asp Glu Ala Ala Ala Asn Pro Ile Arg Asn Tyr Pro Asn 
                485                 490                 495     


Ile Ala Tyr Thr Leu His Phe Tyr Ser Gly Thr His Lys Gln Ala Ile 
            500                 505                 510         


Arg Asn Tyr Ala Thr Thr Ala Leu Asn Asn Gly Ile Pro Leu Phe Val 
        515                 520                 525             


Thr Glu Trp Gly Thr Thr Asp Ala Ser Gly Asp Gly Ala Val Asp Val 
    530                 535                 540                 


Ala Glu Thr Arg Ile Trp Met Asp Phe Leu Arg Ala Asn Asn Ile Ser 
545                 550                 555                 560 


His Ala Asn Trp Ser Leu Asn Asp Lys Ala Glu Gly Ser Ala Ala Leu 
                565                 570                 575     


Arg Pro Gly Ala Ser Thr Thr Gly Gly Trp Ser Asn Asn Asp Leu Thr 
            580                 585                 590         


Glu Ser Gly Arg Leu Val Arg Asp Tyr Ile Arg 
        595                 600             


<210> 177
<211> 1218
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 177
atgttggttt atagagtttc aattcaaaag cacttggcgt cactaacggt actcgtttcg     60

ctgctgttaa ttctcgccgg ttgctccagc tcgagcgatt ccatagcccc agtgtcatcc    120

tccagtgtgt ctagcgctgc cagctctgtc ggcgagatgc ctgcgccggt tcccattacc    180

cccgcgggcg agccgattac tgcgctggat gccgccgcgg agatgggcgc tggttttaac    240

ctcggtcaaa tgtttgataa cactcagcat gcgcgcacat tttatgcggc gcagcccaaa    300

attgatgcct actatgagtt gggctatcgc aatctgcgca tcccaattac ctggaccgat    360

attgtcggtg gcgaccgttt ggtcaatgat cccgatgtgg gggatgtgga ttttgatcat    420

ccgcgcttaa acgagattgc ccagattatt gattatgcgc tctcgctgtc cgacatgtac    480

gtgattatta atgcgcacca cgagcgcgat ttgaagaacg ataataaatg gcaggtattg    540

gaacggttgt ggttggatat agcgacccat tttggcgatc gcgattatcg gctgatgttc    600

caactgctca acgagcccca tttgaataat gatgacccca tgccggttgc caatcttcga    660

tttatgagtg gcaaagccta cgatatgatt cgggcggtga acgccaaacg tattgttatc    720

attggcggca atcaatggtt tgccgccgat gaaatggcgc gggtctggcc agatttacaa    780

ccggtgggcg gtggtgaaga tccctacgtt atggccagtt ttcaccacta taacccctgg    840

gagttttccg gcgacaacca agacgattac gcctacccct ggactgaaga ccacttaacg    900

tcacctatag acactatgct cgagtggtca caaaatatgg gcaacggcat gccgatctat    960

attagcgagt ggggtgtcgg atggcagagc accttggccg tgatggattg caataatatc   1020

cgggaatggt acgcccagat gcacgttcac cacgccgcac ccaagggcat ccctacctct   1080

gtctgggatg atggcggctg gttccgaatt tttgatcaca gcagtaatgt ttttgacaat   1140

gaactggcca cttgtttgat tgatggccaa tgcgattgga gcggaacaga gcgctttaat   1200

atgggatgtt tccgctag                                                 1218

<210> 178
<211> 405
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(46)

<220> 
<221> DOMAIN
<222> (67)...(370)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (201)...(210)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<400> 178
Met Leu Val Tyr Arg Val Ser Ile Gln Lys His Leu Ala Ser Leu Thr 
1               5                   10                  15      


Val Leu Val Ser Leu Leu Leu Ile Leu Ala Gly Cys Ser Ser Ser Ser 
            20                  25                  30          


Asp Ser Ile Ala Pro Val Ser Ser Ser Ser Val Ser Ser Ala Ala Ser 
        35                  40                  45              


Ser Val Gly Glu Met Pro Ala Pro Val Pro Ile Thr Pro Ala Gly Glu 
    50                  55                  60                  


Pro Ile Thr Ala Leu Asp Ala Ala Ala Glu Met Gly Ala Gly Phe Asn 
65                  70                  75                  80  


Leu Gly Gln Met Phe Asp Asn Thr Gln His Ala Arg Thr Phe Tyr Ala 
                85                  90                  95      


Ala Gln Pro Lys Ile Asp Ala Tyr Tyr Glu Leu Gly Tyr Arg Asn Leu 
            100                 105                 110         


Arg Ile Pro Ile Thr Trp Thr Asp Ile Val Gly Gly Asp Arg Leu Val 
        115                 120                 125             


Asn Asp Pro Asp Val Gly Asp Val Asp Phe Asp His Pro Arg Leu Asn 
    130                 135                 140                 


Glu Ile Ala Gln Ile Ile Asp Tyr Ala Leu Ser Leu Ser Asp Met Tyr 
145                 150                 155                 160 


Val Ile Ile Asn Ala His His Glu Arg Asp Leu Lys Asn Asp Asn Lys 
                165                 170                 175     


Trp Gln Val Leu Glu Arg Leu Trp Leu Asp Ile Ala Thr His Phe Gly 
            180                 185                 190         


Asp Arg Asp Tyr Arg Leu Met Phe Gln Leu Leu Asn Glu Pro His Leu 
        195                 200                 205             


Asn Asn Asp Asp Pro Met Pro Val Ala Asn Leu Arg Phe Met Ser Gly 
    210                 215                 220                 


Lys Ala Tyr Asp Met Ile Arg Ala Val Asn Ala Lys Arg Ile Val Ile 
225                 230                 235                 240 


Ile Gly Gly Asn Gln Trp Phe Ala Ala Asp Glu Met Ala Arg Val Trp 
                245                 250                 255     


Pro Asp Leu Gln Pro Val Gly Gly Gly Glu Asp Pro Tyr Val Met Ala 
            260                 265                 270         


Ser Phe His His Tyr Asn Pro Trp Glu Phe Ser Gly Asp Asn Gln Asp 
        275                 280                 285             


Asp Tyr Ala Tyr Pro Trp Thr Glu Asp His Leu Thr Ser Pro Ile Asp 
    290                 295                 300                 


Thr Met Leu Glu Trp Ser Gln Asn Met Gly Asn Gly Met Pro Ile Tyr 
305                 310                 315                 320 


Ile Ser Glu Trp Gly Val Gly Trp Gln Ser Thr Leu Ala Val Met Asp 
                325                 330                 335     


Cys Asn Asn Ile Arg Glu Trp Tyr Ala Gln Met His Val His His Ala 
            340                 345                 350         


Ala Pro Lys Gly Ile Pro Thr Ser Val Trp Asp Asp Gly Gly Trp Phe 
        355                 360                 365             


Arg Ile Phe Asp His Ser Ser Asn Val Phe Asp Asn Glu Leu Ala Thr 
    370                 375                 380                 


Cys Leu Ile Asp Gly Gln Cys Asp Trp Ser Gly Thr Glu Arg Phe Asn 
385                 390                 395                 400 


Met Gly Cys Phe Arg 
                405 


<210> 179
<211> 2313
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 179
atgaaaaaaa gaattatttc agcgggctta acctttttga taggtgtttc gttacaggct     60

caaagtgaaa atttcacgat tataaaaaac aataaaggag ccgatttagg atattctccc    120

gaatcgggta ttaaaattat aaccattaac ggaaagaaat ttaaagattt aaataaaaac    180

ggaaaacttg ataaatatga agattggcga ctttcggcag atgaaagagc caaagattta    240

gcctcgcaaa tgtccgtaga gcaaatagca ggattgatgc tttacagtcg tcatcaatct    300

ttgccagcag gaacttcagg ttttatggca ggaacatata atggaaaacc atttacggag    360

cgtaatgcaa aagcttacga tttaacagat cagcaaatgg cttttttgaa agatgataac    420

cttcgtcatg tcttaatcac aagcgtcgaa agccctgaaa cagctgcttt atggaacaat    480

aaaatgcagg cttttgtaga agatattggc ttaggaattc cgagtaatac aagtacagat    540

ccgcgtcata atgctgttgt aacggctgag tttaatgcag gagcaggagg gaccatatcg    600

atgtggcctg acggtttagg aatggcagct acttttgacc caaaaatagt agaacagttt    660

gggcaaattg ctgccaaaga atatcgcgct ttaggaatcg caactgcact ttcgccacaa    720

atagatttag gttcagagcc aagatggtat agaatttcga tgacttttgg cgaaagcccg    780

gctttaacca gagatatggg acgtgcttat attgacggtt ttcaaacttc atacggaaaa    840

gacgaaatta aagacggttg gggttacaaa agtgtaaatg caatggttaa acactggcca    900

agtggtggag ccgaagaagg cggtcgtgac ggacattggg cttacggaaa gtttgcggta    960

tatcccggaa ataatttgca acagcatatt aatccttttg taaatggcgc tttcaaatta   1020

aaaggaaaaa ccggtaaagc ttcggctgta atgccatatt acacgatcac ttttaatcag   1080

gataaaaagt acaatgaaaa tgtggcaaac ggttacagta aatatattat tacagattta   1140

ttaagagata aatacggtta cgatggtgtt gtttgtacag attggttaat tactgctgat   1200

gaaggaaaaa caccaaatgt atttgccgga aaaccttggg gcgtagagaa tctttcgatt   1260

gccgaaagac attacaaagc aattattgca ggtgtagacc aatttggtgg aaacaatgat   1320

aaaaaacctg ttctcgaagc ctatgatatg ggagtaaaag aatatggaga atcattcatg   1380

agagctcgtt ttgaaagatc agcagtacga ttattaaaga atatttttag agttggtctt   1440

tttgaaaacc catatttaaa tgttgccgaa acaaaagcaa ttgtcggaag tcctgaattt   1500

atgaaagcgg gatatgatgc gcaattaaaa tcggtagtgt tgttaaaaaa taaaacttca   1560

cttcttccga taaaagaaaa gaaaaccgtt tttattccaa agatttatac cgcttctact   1620

aaggattggt ggggaattcc aagtcagcca aaacttgatt atccggtaaa tcttgaattg   1680

gtaaaaaaat attataatgt taccgaagat ccttcaaaag cagattttgc gattgttttt   1740

gtaacaagtc cgcaaagttt agaaggtggt tatgatttga aagacagaca aaacggaagc   1800

aatggttatg tgccaatttc gcttcaatac ggaacttata cagcaaccga agccagagca   1860

aaaagtattg ctgccggaga tcaggttatt gatccaacaa tcaaagacag aacttataaa   1920

aacaaaaccg ttaccgttgc aaatacaatg gatttaagaa cgattttgga tacaaaagat   1980

atgatgaacg gaaaaccggt tattgtttcg gttacggctt caaagccaat gatctttgat   2040

gaattcgaaa aacaagttga cggaattgta ttgaattttg gagtttcgac acaggctgtt   2100

ttagatatta tctccggaaa aatagaacca tcaggattgc ttccggttca gatgccggca   2160

aatatggaaa cggttgaaaa acaatttgaa gacgttcctt atgatatgat tccgcataaa   2220

gacagcgaag gaaatgttta tgattttgct tatggattaa actggaaagg tgttataaaa   2280

gataccagaa ctgaaactta taagaaacag taa                                2313

<210> 180
<211> 770
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> SIGNAL
<222> (1)...(22)

<220> 
<221> DOMAIN
<222> (165)...(437)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (512)...(757)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (24)...(27)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (178)...(181)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (423)...(426)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (524)...(527)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (574)...(577)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (607)...(610)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (650)...(653)
<223> N-glycosylation site. Prosite id = PS00001

<400> 180
Met Lys Lys Arg Ile Ile Ser Ala Gly Leu Thr Phe Leu Ile Gly Val 
1               5                   10                  15      


Ser Leu Gln Ala Gln Ser Glu Asn Phe Thr Ile Ile Lys Asn Asn Lys 
            20                  25                  30          


Gly Ala Asp Leu Gly Tyr Ser Pro Glu Ser Gly Ile Lys Ile Ile Thr 
        35                  40                  45              


Ile Asn Gly Lys Lys Phe Lys Asp Leu Asn Lys Asn Gly Lys Leu Asp 
    50                  55                  60                  


Lys Tyr Glu Asp Trp Arg Leu Ser Ala Asp Glu Arg Ala Lys Asp Leu 
65                  70                  75                  80  


Ala Ser Gln Met Ser Val Glu Gln Ile Ala Gly Leu Met Leu Tyr Ser 
                85                  90                  95      


Arg His Gln Ser Leu Pro Ala Gly Thr Ser Gly Phe Met Ala Gly Thr 
            100                 105                 110         


Tyr Asn Gly Lys Pro Phe Thr Glu Arg Asn Ala Lys Ala Tyr Asp Leu 
        115                 120                 125             


Thr Asp Gln Gln Met Ala Phe Leu Lys Asp Asp Asn Leu Arg His Val 
    130                 135                 140                 


Leu Ile Thr Ser Val Glu Ser Pro Glu Thr Ala Ala Leu Trp Asn Asn 
145                 150                 155                 160 


Lys Met Gln Ala Phe Val Glu Asp Ile Gly Leu Gly Ile Pro Ser Asn 
                165                 170                 175     


Thr Ser Thr Asp Pro Arg His Asn Ala Val Val Thr Ala Glu Phe Asn 
            180                 185                 190         


Ala Gly Ala Gly Gly Thr Ile Ser Met Trp Pro Asp Gly Leu Gly Met 
        195                 200                 205             


Ala Ala Thr Phe Asp Pro Lys Ile Val Glu Gln Phe Gly Gln Ile Ala 
    210                 215                 220                 


Ala Lys Glu Tyr Arg Ala Leu Gly Ile Ala Thr Ala Leu Ser Pro Gln 
225                 230                 235                 240 


Ile Asp Leu Gly Ser Glu Pro Arg Trp Tyr Arg Ile Ser Met Thr Phe 
                245                 250                 255     


Gly Glu Ser Pro Ala Leu Thr Arg Asp Met Gly Arg Ala Tyr Ile Asp 
            260                 265                 270         


Gly Phe Gln Thr Ser Tyr Gly Lys Asp Glu Ile Lys Asp Gly Trp Gly 
        275                 280                 285             


Tyr Lys Ser Val Asn Ala Met Val Lys His Trp Pro Ser Gly Gly Ala 
    290                 295                 300                 


Glu Glu Gly Gly Arg Asp Gly His Trp Ala Tyr Gly Lys Phe Ala Val 
305                 310                 315                 320 


Tyr Pro Gly Asn Asn Leu Gln Gln His Ile Asn Pro Phe Val Asn Gly 
                325                 330                 335     


Ala Phe Lys Leu Lys Gly Lys Thr Gly Lys Ala Ser Ala Val Met Pro 
            340                 345                 350         


Tyr Tyr Thr Ile Thr Phe Asn Gln Asp Lys Lys Tyr Asn Glu Asn Val 
        355                 360                 365             


Ala Asn Gly Tyr Ser Lys Tyr Ile Ile Thr Asp Leu Leu Arg Asp Lys 
    370                 375                 380                 


Tyr Gly Tyr Asp Gly Val Val Cys Thr Asp Trp Leu Ile Thr Ala Asp 
385                 390                 395                 400 


Glu Gly Lys Thr Pro Asn Val Phe Ala Gly Lys Pro Trp Gly Val Glu 
                405                 410                 415     


Asn Leu Ser Ile Ala Glu Arg His Tyr Lys Ala Ile Ile Ala Gly Val 
            420                 425                 430         


Asp Gln Phe Gly Gly Asn Asn Asp Lys Lys Pro Val Leu Glu Ala Tyr 
        435                 440                 445             


Asp Met Gly Val Lys Glu Tyr Gly Glu Ser Phe Met Arg Ala Arg Phe 
    450                 455                 460                 


Glu Arg Ser Ala Val Arg Leu Leu Lys Asn Ile Phe Arg Val Gly Leu 
465                 470                 475                 480 


Phe Glu Asn Pro Tyr Leu Asn Val Ala Glu Thr Lys Ala Ile Val Gly 
                485                 490                 495     


Ser Pro Glu Phe Met Lys Ala Gly Tyr Asp Ala Gln Leu Lys Ser Val 
            500                 505                 510         


Val Leu Leu Lys Asn Lys Thr Ser Leu Leu Pro Ile Lys Glu Lys Lys 
        515                 520                 525             


Thr Val Phe Ile Pro Lys Ile Tyr Thr Ala Ser Thr Lys Asp Trp Trp 
    530                 535                 540                 


Gly Ile Pro Ser Gln Pro Lys Leu Asp Tyr Pro Val Asn Leu Glu Leu 
545                 550                 555                 560 


Val Lys Lys Tyr Tyr Asn Val Thr Glu Asp Pro Ser Lys Ala Asp Phe 
                565                 570                 575     


Ala Ile Val Phe Val Thr Ser Pro Gln Ser Leu Glu Gly Gly Tyr Asp 
            580                 585                 590         


Leu Lys Asp Arg Gln Asn Gly Ser Asn Gly Tyr Val Pro Ile Ser Leu 
        595                 600                 605             


Gln Tyr Gly Thr Tyr Thr Ala Thr Glu Ala Arg Ala Lys Ser Ile Ala 
    610                 615                 620                 


Ala Gly Asp Gln Val Ile Asp Pro Thr Ile Lys Asp Arg Thr Tyr Lys 
625                 630                 635                 640 


Asn Lys Thr Val Thr Val Ala Asn Thr Met Asp Leu Arg Thr Ile Leu 
                645                 650                 655     


Asp Thr Lys Asp Met Met Asn Gly Lys Pro Val Ile Val Ser Val Thr 
            660                 665                 670         


Ala Ser Lys Pro Met Ile Phe Asp Glu Phe Glu Lys Gln Val Asp Gly 
        675                 680                 685             


Ile Val Leu Asn Phe Gly Val Ser Thr Gln Ala Val Leu Asp Ile Ile 
    690                 695                 700                 


Ser Gly Lys Ile Glu Pro Ser Gly Leu Leu Pro Val Gln Met Pro Ala 
705                 710                 715                 720 


Asn Met Glu Thr Val Glu Lys Gln Phe Glu Asp Val Pro Tyr Asp Met 
                725                 730                 735     


Ile Pro His Lys Asp Ser Glu Gly Asn Val Tyr Asp Phe Ala Tyr Gly 
            740                 745                 750         


Leu Asn Trp Lys Gly Val Ile Lys Asp Thr Arg Thr Glu Thr Tyr Lys 
        755                 760                 765             


Lys Gln 
    770 


<210> 181
<211> 1707
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 181
atgaaaagaa ctattttgag atttagtaaa ttcttaaaaa tagtgatttt aattactttc     60

actttgcaaa tatttactgt gtttgctaag aatacaccat atgaaagtag gaaatatcca    120

caccttcttg gcaaccaagc ggtgaaaaaa ccatcagttg ctggcaggct ccagattatt    180

gaaaaggacg gtaaaaagta tttagctgac cagaaaggcg aaataattca gcttcgtggt    240

atgagtacgc atggactcca gtggtatggt gacattgtaa acagaaatgc gtttgctgct    300

ctttcaaaag attggggatg caatgttata aggcttgcga tgtatgtggg tgaaggtggt    360

tatgcttcaa atccaggcat taaagaaaaa gttgtaaagg gaataaaact tgcaattgaa    420

aatgacatgt atgtgattgt tgactggcat gtgttaaatc caggcgaccc gaatgctgaa    480

atttataaag gggcaaaaga ctttttcaag gagatagcta caagttttcc caatgactat    540

cacataatat atgaactttg taatgaaccg aatccaaatg agccgggagt agaaaatagc    600

ttggatggtt ggaaaaaggt aaaggcttat gctgaaccca tcataaaaat gctcagaagc    660

ttggggaatc agaacattat aattgtaggt tcgcccaatt ggagccagag acctgacttt    720

gcaattcaag accctataaa cgacaaaaat gttatgtatt cagttcattt ttattctggt    780

actcacaagg ttgatgggta tgtttttgaa aacatgaaaa aggcatttga aaatggtgtg    840

ccaatttttg tgagtgaatg gggaacaagt ttggcaagcg gtgatggtgg accatatctt    900

gatgaggcgg ataagtggct tgagtattta aatgcaaact atattagctg ggtgaactgg    960

tcactgtcaa acaaaaatga aacatcagct gcttttgtct cgtatgttag tggcatgcat   1020

gatgccacat cacttgatcc tggcgatgat aagatgtggg atataaaaga gctgagtata   1080

tctggagagt atgtgagagc aaggataaaa gggattgcat ataaccctat aaagagagac   1140

aaaggaataa aatgtccttt taaagatctc aatgaagata atatttttta tgaacaagta   1200

gtaaaacttt attcaaaagg aattataaaa ggtactttat cttctaagta tttgcctgat   1260

aaaaacatca caagggctga acttgctgca ctatgtgtaa gattattgaa tctgaaaatt   1320

gaaaaatatg acggcaggtt ttctgatgta aaaagtagtg actggtacgc agatgtggtt   1380

tatacagcat ataaaaatgg tttgtttaaa caagaagaag aaaagagatt tttccctgaa   1440

agaatcacta aaagagaaga agtagctgct ttggcaatcg aagtatacaa aagattgata   1500

ggtaaattag aggttgacgt ggatgatatt caagttgtcg acgaagacct tataaagcct   1560

caatatagag agtgtgtgaa attagcagtt tatcttggta ttattgactt agattcagac   1620

ggaacctttg taccaagtaa gagcgtttcg agaggggagg cagcaacaat tttttataat   1680

gttttgaact tagcaggcaa gctatga                                       1707

<210> 182
<211> 568
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (66)...(330)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (352)...(565)
<223> Carbohydrate binding domain (family 17/28)

<220> 
<221> DOMAIN
<222> (387)...(429)
<223> S-layer homology domain

<220> 
<221> DOMAIN
<222> (447)...(489)
<223> S-layer homology domain

<220> 
<221> SITE
<222> (184)...(193)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (236)...(239)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (323)...(326)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (331)...(334)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (428)...(431)
<223> N-glycosylation site. Prosite id = PS00001

<400> 182
Met Lys Arg Thr Ile Leu Arg Phe Ser Lys Phe Leu Lys Ile Val Ile 
1               5                   10                  15      


Leu Ile Thr Phe Thr Leu Gln Ile Phe Thr Val Phe Ala Lys Asn Thr 
            20                  25                  30          


Pro Tyr Glu Ser Arg Lys Tyr Pro His Leu Leu Gly Asn Gln Ala Val 
        35                  40                  45              


Lys Lys Pro Ser Val Ala Gly Arg Leu Gln Ile Ile Glu Lys Asp Gly 
    50                  55                  60                  


Lys Lys Tyr Leu Ala Asp Gln Lys Gly Glu Ile Ile Gln Leu Arg Gly 
65                  70                  75                  80  


Met Ser Thr His Gly Leu Gln Trp Tyr Gly Asp Ile Val Asn Arg Asn 
                85                  90                  95      


Ala Phe Ala Ala Leu Ser Lys Asp Trp Gly Cys Asn Val Ile Arg Leu 
            100                 105                 110         


Ala Met Tyr Val Gly Glu Gly Gly Tyr Ala Ser Asn Pro Gly Ile Lys 
        115                 120                 125             


Glu Lys Val Val Lys Gly Ile Lys Leu Ala Ile Glu Asn Asp Met Tyr 
    130                 135                 140                 


Val Ile Val Asp Trp His Val Leu Asn Pro Gly Asp Pro Asn Ala Glu 
145                 150                 155                 160 


Ile Tyr Lys Gly Ala Lys Asp Phe Phe Lys Glu Ile Ala Thr Ser Phe 
                165                 170                 175     


Pro Asn Asp Tyr His Ile Ile Tyr Glu Leu Cys Asn Glu Pro Asn Pro 
            180                 185                 190         


Asn Glu Pro Gly Val Glu Asn Ser Leu Asp Gly Trp Lys Lys Val Lys 
        195                 200                 205             


Ala Tyr Ala Glu Pro Ile Ile Lys Met Leu Arg Ser Leu Gly Asn Gln 
    210                 215                 220                 


Asn Ile Ile Ile Val Gly Ser Pro Asn Trp Ser Gln Arg Pro Asp Phe 
225                 230                 235                 240 


Ala Ile Gln Asp Pro Ile Asn Asp Lys Asn Val Met Tyr Ser Val His 
                245                 250                 255     


Phe Tyr Ser Gly Thr His Lys Val Asp Gly Tyr Val Phe Glu Asn Met 
            260                 265                 270         


Lys Lys Ala Phe Glu Asn Gly Val Pro Ile Phe Val Ser Glu Trp Gly 
        275                 280                 285             


Thr Ser Leu Ala Ser Gly Asp Gly Gly Pro Tyr Leu Asp Glu Ala Asp 
    290                 295                 300                 


Lys Trp Leu Glu Tyr Leu Asn Ala Asn Tyr Ile Ser Trp Val Asn Trp 
305                 310                 315                 320 


Ser Leu Ser Asn Lys Asn Glu Thr Ser Ala Ala Phe Val Ser Tyr Val 
                325                 330                 335     


Ser Gly Met His Asp Ala Thr Ser Leu Asp Pro Gly Asp Asp Lys Met 
            340                 345                 350         


Trp Asp Ile Lys Glu Leu Ser Ile Ser Gly Glu Tyr Val Arg Ala Arg 
        355                 360                 365             


Ile Lys Gly Ile Ala Tyr Asn Pro Ile Lys Arg Asp Lys Gly Ile Lys 
    370                 375                 380                 


Cys Pro Phe Lys Asp Leu Asn Glu Asp Asn Ile Phe Tyr Glu Gln Val 
385                 390                 395                 400 


Val Lys Leu Tyr Ser Lys Gly Ile Ile Lys Gly Thr Leu Ser Ser Lys 
                405                 410                 415     


Tyr Leu Pro Asp Lys Asn Ile Thr Arg Ala Glu Leu Ala Ala Leu Cys 
            420                 425                 430         


Val Arg Leu Leu Asn Leu Lys Ile Glu Lys Tyr Asp Gly Arg Phe Ser 
        435                 440                 445             


Asp Val Lys Ser Ser Asp Trp Tyr Ala Asp Val Val Tyr Thr Ala Tyr 
    450                 455                 460                 


Lys Asn Gly Leu Phe Lys Gln Glu Glu Glu Lys Arg Phe Phe Pro Glu 
465                 470                 475                 480 


Arg Ile Thr Lys Arg Glu Glu Val Ala Ala Leu Ala Ile Glu Val Tyr 
                485                 490                 495     


Lys Arg Leu Ile Gly Lys Leu Glu Val Asp Val Asp Asp Ile Gln Val 
            500                 505                 510         


Val Asp Glu Asp Leu Ile Lys Pro Gln Tyr Arg Glu Cys Val Lys Leu 
        515                 520                 525             


Ala Val Tyr Leu Gly Ile Ile Asp Leu Asp Ser Asp Gly Thr Phe Val 
    530                 535                 540                 


Pro Ser Lys Ser Val Ser Arg Gly Glu Ala Ala Thr Ile Phe Tyr Asn 
545                 550                 555                 560 


Val Leu Asn Leu Ala Gly Lys Leu 
                565             


<210> 183
<211> 2268
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 183
atgagagaaa ttattttaaa gtctggtgca ctcttgatgg tagttatttt gattgtttct     60

attttgcaaa ttttaactgt gtttgcccag agcacaccat atgaaaatga aaaatatcca    120

caccttcttg gcaaccaagc ggtgaaaaaa ccatcagttg ctggcaggct ccagattatt    180

gaaaaggacg gtaaaaagta tttagctgac cagaaaggcg aaataattca gcttcgtggc    240

atgagcacgc atggactcca gtggtatggt gacattgtaa acagaaatgc gtttgctgct    300

ctttcaaaag attggggatg caatgttata aggcttgcga tgtatgtggg tgaaggtggt    360

tatgcttcaa atccaggcat taaagaaaaa gttgtaaagg gaataaaact tgcaattgaa    420

aatgacatgt atgtgattgt tgactggcat gtgttaaatc ccggcgaccc gaatgctgaa    480

atttataaag gggcaaaaga ctttttcaaa gagatagcta caagttttcc caatgactat    540

cacataatat atgaactttg taatgaaccg aatccaaatg agccgggagt agaaaatagc    600

ttggatggtt ggaaaaaggt aaaggcttat gcgcagtcca tcataaaaat gctcagaagt    660

ttggggaatc agaacattat aattgtaggt tcgcccaatt ggagccagag acctgacttt    720

gcaattcaag accctataaa cgacaaaaat gttatgtatt cagttcattt ttattctggt    780

actcacaagg ttgatgggta tgtttttgaa aacatgaaaa aggcatttga aaatggtgtg    840

ccaatttttg tgagtgaatg gggaacaagt ttggcaagcg gtgatggtgg accatatctt    900

gatgaggcgg ataagtggct tgagtattta aatgcaaact atattagctg ggtgaactgg    960

tcactgtcaa acaaaaatga aacatcagct gcttttgtcc cgtatgttag tggcatgcat   1020

gatgccacat cacttgatcc tggcgatgat aagatgtggg atataaaaga gctgagtata   1080

tctggagagt atgtgagagc aaggataaaa gggattgcat atgaaccaat tgagagggat   1140

agccaaataa aagaaggaca aagtgcacct ttgggtgaaa aagttttacc atccacgttt   1200

gaggatgaca cgcgccaggg ttgggactgg gatggtccgt ctggtgtgaa aggacctatt   1260

accatcgaaa gtgtaaatgg ttcaaaagcg ctatcttttg aggttgagta tccagagaaa   1320

aaaccgcaag atggctgggc aacagctgca agacttatac ttaaggaaat aaatgcgaag   1380

agagaagata ataagtatct tgcatttgac ttttatataa aaccagaaag agcgtcaaaa   1440

ggtgagattg agatattttt agctttttca ccaccttcct taggttactg ggctcaagta   1500

caagacagtt ttaatattga cctctcaaag ctttcaagtg caaaaaagac agaagagggg   1560

ctttacaaat tcaatgtatt ttttgactta gacaaaatcc aagatggcaa agtgctaaaa   1620

ccagacacga tcttgaggga tattataata gtcatagcag atgggaatag cgattttaaa   1680

ggaaaaatgt ttatagacaa tgttagattc accaacatcc tttttgaaga catcagcctt   1740

gagagcagcc tttatgatgc tgtctccaag ctttattcaa aaggaatcat aaaaggagct   1800

tcagctttta agtacttgcc tgacaagaac atcacaaggg ctgaatttgc tgcactatgt   1860

gtcagggcat tgaacctgaa aattgaaaag tatgacggca ggttttctga tgtaaaaagt   1920

gacacctggt attcggatgt ggtttatacg gcgtataaaa acggtttgtt tggacaggag   1980

aaaaatagat tcttccctga aaggattatg aaaagagaag aaacagcagc tttggcaatt   2040

gaagtgtaca aaagattgac aggtaaaata gaagttagca cagacgatat tcaaattgcc   2100

gatgaagggc ttataaatcc tcaatacaaa gaaagcgtga agttagctat tcagctcggt   2160

attattgacc tgtattcaga cggaaccttt gcaccaagta agagcgtttc gagaggggag   2220

gcagcaacaa ttttctataa catcttgaac ttagcaggca agctatga                2268

<210> 184
<211> 755
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (66)...(330)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (352)...(571)
<223> Carbohydrate binding domain (family 17/28)

<220> 
<221> DOMAIN
<222> (575)...(617)
<223> S-layer homology domain

<220> 
<221> DOMAIN
<222> (635)...(676)
<223> S-layer homology domain

<220> 
<221> DOMAIN
<222> (699)...(742)
<223> S-layer homology domain

<220> 
<221> SITE
<222> (1)...(4)
<223> Tubulin-beta mRNA autoregulation signal. Prosite id = PS00228

<220> 
<221> SITE
<222> (184)...(193)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (236)...(239)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (323)...(326)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (331)...(334)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (432)...(435)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (619)...(622)
<223> N-glycosylation site. Prosite id = PS00001

<400> 184
Met Arg Glu Ile Ile Leu Lys Ser Gly Ala Leu Leu Met Val Val Ile 
1               5                   10                  15      


Leu Ile Val Ser Ile Leu Gln Ile Leu Thr Val Phe Ala Gln Ser Thr 
            20                  25                  30          


Pro Tyr Glu Asn Glu Lys Tyr Pro His Leu Leu Gly Asn Gln Ala Val 
        35                  40                  45              


Lys Lys Pro Ser Val Ala Gly Arg Leu Gln Ile Ile Glu Lys Asp Gly 
    50                  55                  60                  


Lys Lys Tyr Leu Ala Asp Gln Lys Gly Glu Ile Ile Gln Leu Arg Gly 
65                  70                  75                  80  


Met Ser Thr His Gly Leu Gln Trp Tyr Gly Asp Ile Val Asn Arg Asn 
                85                  90                  95      


Ala Phe Ala Ala Leu Ser Lys Asp Trp Gly Cys Asn Val Ile Arg Leu 
            100                 105                 110         


Ala Met Tyr Val Gly Glu Gly Gly Tyr Ala Ser Asn Pro Gly Ile Lys 
        115                 120                 125             


Glu Lys Val Val Lys Gly Ile Lys Leu Ala Ile Glu Asn Asp Met Tyr 
    130                 135                 140                 


Val Ile Val Asp Trp His Val Leu Asn Pro Gly Asp Pro Asn Ala Glu 
145                 150                 155                 160 


Ile Tyr Lys Gly Ala Lys Asp Phe Phe Lys Glu Ile Ala Thr Ser Phe 
                165                 170                 175     


Pro Asn Asp Tyr His Ile Ile Tyr Glu Leu Cys Asn Glu Pro Asn Pro 
            180                 185                 190         


Asn Glu Pro Gly Val Glu Asn Ser Leu Asp Gly Trp Lys Lys Val Lys 
        195                 200                 205             


Ala Tyr Ala Gln Ser Ile Ile Lys Met Leu Arg Ser Leu Gly Asn Gln 
    210                 215                 220                 


Asn Ile Ile Ile Val Gly Ser Pro Asn Trp Ser Gln Arg Pro Asp Phe 
225                 230                 235                 240 


Ala Ile Gln Asp Pro Ile Asn Asp Lys Asn Val Met Tyr Ser Val His 
                245                 250                 255     


Phe Tyr Ser Gly Thr His Lys Val Asp Gly Tyr Val Phe Glu Asn Met 
            260                 265                 270         


Lys Lys Ala Phe Glu Asn Gly Val Pro Ile Phe Val Ser Glu Trp Gly 
        275                 280                 285             


Thr Ser Leu Ala Ser Gly Asp Gly Gly Pro Tyr Leu Asp Glu Ala Asp 
    290                 295                 300                 


Lys Trp Leu Glu Tyr Leu Asn Ala Asn Tyr Ile Ser Trp Val Asn Trp 
305                 310                 315                 320 


Ser Leu Ser Asn Lys Asn Glu Thr Ser Ala Ala Phe Val Pro Tyr Val 
                325                 330                 335     


Ser Gly Met His Asp Ala Thr Ser Leu Asp Pro Gly Asp Asp Lys Met 
            340                 345                 350         


Trp Asp Ile Lys Glu Leu Ser Ile Ser Gly Glu Tyr Val Arg Ala Arg 
        355                 360                 365             


Ile Lys Gly Ile Ala Tyr Glu Pro Ile Glu Arg Asp Ser Gln Ile Lys 
    370                 375                 380                 


Glu Gly Gln Ser Ala Pro Leu Gly Glu Lys Val Leu Pro Ser Thr Phe 
385                 390                 395                 400 


Glu Asp Asp Thr Arg Gln Gly Trp Asp Trp Asp Gly Pro Ser Gly Val 
                405                 410                 415     


Lys Gly Pro Ile Thr Ile Glu Ser Val Asn Gly Ser Lys Ala Leu Ser 
            420                 425                 430         


Phe Glu Val Glu Tyr Pro Glu Lys Lys Pro Gln Asp Gly Trp Ala Thr 
        435                 440                 445             


Ala Ala Arg Leu Ile Leu Lys Glu Ile Asn Ala Lys Arg Glu Asp Asn 
    450                 455                 460                 


Lys Tyr Leu Ala Phe Asp Phe Tyr Ile Lys Pro Glu Arg Ala Ser Lys 
465                 470                 475                 480 


Gly Glu Ile Glu Ile Phe Leu Ala Phe Ser Pro Pro Ser Leu Gly Tyr 
                485                 490                 495     


Trp Ala Gln Val Gln Asp Ser Phe Asn Ile Asp Leu Ser Lys Leu Ser 
            500                 505                 510         


Ser Ala Lys Lys Thr Glu Glu Gly Leu Tyr Lys Phe Asn Val Phe Phe 
        515                 520                 525             


Asp Leu Asp Lys Ile Gln Asp Gly Lys Val Leu Lys Pro Asp Thr Ile 
    530                 535                 540                 


Leu Arg Asp Ile Ile Ile Val Ile Ala Asp Gly Asn Ser Asp Phe Lys 
545                 550                 555                 560 


Gly Lys Met Phe Ile Asp Asn Val Arg Phe Thr Asn Ile Leu Phe Glu 
                565                 570                 575     


Asp Ile Ser Leu Glu Ser Ser Leu Tyr Asp Ala Val Ser Lys Leu Tyr 
            580                 585                 590         


Ser Lys Gly Ile Ile Lys Gly Ala Ser Ala Phe Lys Tyr Leu Pro Asp 
        595                 600                 605             


Lys Asn Ile Thr Arg Ala Glu Phe Ala Ala Leu Cys Val Arg Ala Leu 
    610                 615                 620                 


Asn Leu Lys Ile Glu Lys Tyr Asp Gly Arg Phe Ser Asp Val Lys Ser 
625                 630                 635                 640 


Asp Thr Trp Tyr Ser Asp Val Val Tyr Thr Ala Tyr Lys Asn Gly Leu 
                645                 650                 655     


Phe Gly Gln Glu Lys Asn Arg Phe Phe Pro Glu Arg Ile Met Lys Arg 
            660                 665                 670         


Glu Glu Thr Ala Ala Leu Ala Ile Glu Val Tyr Lys Arg Leu Thr Gly 
        675                 680                 685             


Lys Ile Glu Val Ser Thr Asp Asp Ile Gln Ile Ala Asp Glu Gly Leu 
    690                 695                 700                 


Ile Asn Pro Gln Tyr Lys Glu Ser Val Lys Leu Ala Ile Gln Leu Gly 
705                 710                 715                 720 


Ile Ile Asp Leu Tyr Ser Asp Gly Thr Phe Ala Pro Ser Lys Ser Val 
                725                 730                 735     


Ser Arg Gly Glu Ala Ala Thr Ile Phe Tyr Asn Ile Leu Asn Leu Ala 
            740                 745                 750         


Gly Lys Leu 
        755 


<210> 185
<211> 1128
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 185
atgagagaaa ttattttaaa gtctggtgca ctcttgatgg tagttatttt gattgtttct     60

attttgcaaa ttttaactgt gtttgcccag agcacaccat atgaaaatga aaaatatcca    120

caccttcttg gcaaccaagc ggtgaaaaaa ccatcagttg ctggcaggct ccagattatt    180

gaaaaggacg gtaaaaagta tttagctgac cagaaaggcg aaataattca gcttcgtggc    240

atgagcacgc atggactcca gtggtatggt gacattgtaa acagaaatgc gtttgctgct    300

ctttcaaaag attggggatg caatgttata aggcttgcga tgtatgtggg tgaaggtggt    360

tatgcttcaa atccaggcat taaagaaaaa gttgtaaagg gaataaaact tgcaattgaa    420

aatgacatgt atgtgattgt tgactggcat gtgttaaatc ccggcgaccc gaatgctgaa    480

atttataaag gggcaaaaga ctttttcaaa gagatagcta caagttttcc caatgactat    540

cacataatat atgaactttg taatgaaccg aatccaaatg agccgggagt agaaaatagc    600

ttggatggtt ggaaaaaggt aaaggcttat gcgcagtcca tcataaaaat gctcagaagt    660

ttggggaatc agaacattat aattgtaggt tcgcccaatt ggagccagag acctgacttt    720

gcaattcaag accctataaa cgacaaaaat gttatgtatt cagttcattt ttattctggt    780

actcacaagg ttgatgggta tgtttttgaa aacatgaaaa aggcatttga aaatggtgtg    840

ccaatttttg tgagtgaatg gggaacaagt ttggcaagcg gtgatggtgg accatatctt    900

gatgaggcgg ataagtggct tgagtattta aatgcaaact atattagctg ggtgaactgg    960

tcactgtcaa acaaaaatga aacatcagct gcttttgtcc cgtatgttag tggcatgcat   1020

gatgccacat cacttgatcc tggcgatgat aagatgtggg atataaaaga gctgagtata   1080

tctggagagt atgtgagagc aaggataaaa gggattgcat atgaacca                1128

<210> 186
<211> 376
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (66)...(330)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (1)...(4)
<223> Tubulin-beta mRNA autoregulation signal. Prosite id = PS00228

<220> 
<221> SITE
<222> (184)...(193)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (236)...(239)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (323)...(326)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (331)...(334)
<223> N-glycosylation site. Prosite id = PS00001

<400> 186
Met Arg Glu Ile Ile Leu Lys Ser Gly Ala Leu Leu Met Val Val Ile 
1               5                   10                  15      


Leu Ile Val Ser Ile Leu Gln Ile Leu Thr Val Phe Ala Gln Ser Thr 
            20                  25                  30          


Pro Tyr Glu Asn Glu Lys Tyr Pro His Leu Leu Gly Asn Gln Ala Val 
        35                  40                  45              


Lys Lys Pro Ser Val Ala Gly Arg Leu Gln Ile Ile Glu Lys Asp Gly 
    50                  55                  60                  


Lys Lys Tyr Leu Ala Asp Gln Lys Gly Glu Ile Ile Gln Leu Arg Gly 
65                  70                  75                  80  


Met Ser Thr His Gly Leu Gln Trp Tyr Gly Asp Ile Val Asn Arg Asn 
                85                  90                  95      


Ala Phe Ala Ala Leu Ser Lys Asp Trp Gly Cys Asn Val Ile Arg Leu 
            100                 105                 110         


Ala Met Tyr Val Gly Glu Gly Gly Tyr Ala Ser Asn Pro Gly Ile Lys 
        115                 120                 125             


Glu Lys Val Val Lys Gly Ile Lys Leu Ala Ile Glu Asn Asp Met Tyr 
    130                 135                 140                 


Val Ile Val Asp Trp His Val Leu Asn Pro Gly Asp Pro Asn Ala Glu 
145                 150                 155                 160 


Ile Tyr Lys Gly Ala Lys Asp Phe Phe Lys Glu Ile Ala Thr Ser Phe 
                165                 170                 175     


Pro Asn Asp Tyr His Ile Ile Tyr Glu Leu Cys Asn Glu Pro Asn Pro 
            180                 185                 190         


Asn Glu Pro Gly Val Glu Asn Ser Leu Asp Gly Trp Lys Lys Val Lys 
        195                 200                 205             


Ala Tyr Ala Gln Ser Ile Ile Lys Met Leu Arg Ser Leu Gly Asn Gln 
    210                 215                 220                 


Asn Ile Ile Ile Val Gly Ser Pro Asn Trp Ser Gln Arg Pro Asp Phe 
225                 230                 235                 240 


Ala Ile Gln Asp Pro Ile Asn Asp Lys Asn Val Met Tyr Ser Val His 
                245                 250                 255     


Phe Tyr Ser Gly Thr His Lys Val Asp Gly Tyr Val Phe Glu Asn Met 
            260                 265                 270         


Lys Lys Ala Phe Glu Asn Gly Val Pro Ile Phe Val Ser Glu Trp Gly 
        275                 280                 285             


Thr Ser Leu Ala Ser Gly Asp Gly Gly Pro Tyr Leu Asp Glu Ala Asp 
    290                 295                 300                 


Lys Trp Leu Glu Tyr Leu Asn Ala Asn Tyr Ile Ser Trp Val Asn Trp 
305                 310                 315                 320 


Ser Leu Ser Asn Lys Asn Glu Thr Ser Ala Ala Phe Val Pro Tyr Val 
                325                 330                 335     


Ser Gly Met His Asp Ala Thr Ser Leu Asp Pro Gly Asp Asp Lys Met 
            340                 345                 350         


Trp Asp Ile Lys Glu Leu Ser Ile Ser Gly Glu Tyr Val Arg Ala Arg 
        355                 360                 365             


Ile Lys Gly Ile Ala Tyr Glu Pro 
    370                 375     


<210> 187
<211> 1446
<212> DNA
<213> Thermosphaera aggregans M11TL

<400> 187
ttgaaattcc ccaaagactt catgataggc tactcatctt caccgtttca atttgaagct     60

ggtattcccg ggtccgagga tccgaatagt gattggtggg tatgggtgca tgatccggag    120

aacacagcag ctggactagt cagcggcgat tttcccgaga acggcccagg ttactggaat    180

ttaaaccaaa atgaccacga cctggctgag aagctggggg ttaacactat tagagtaggc    240

gttgagtgga gtaggatttt tccaaagcca actttcaatg ttaaagtccc tgtagagaga    300

gatgagaacg gcagcattgt tcacgtagat gtcgatgata aagcggttga aagacttgat    360

gaattagcca acaaggaggc cgtaaaccat tacgtagaaa tgtataaaga ctgggttgaa    420

agaggtagaa aacttatact caatttatac cattggcccc tgcctctctg gcttcacaac    480

ccaatcatgg tgagaagaat gggcccggac agagcgccct caggctggct taacgaggag    540

tccgtggtgg agtttgccaa atacgccgca tacattgctt ggaaaatggg cgagctacct    600

gttatgtgga gcaccatgaa cgaacccaac gtcgtttatg agcaaggata catgttcgtt    660

aaagggggtt tcccacccgg ctacttgagt ttggaagctg ctgataaggc caggagaaat    720

atgatccagg ctcatgcacg ggcctatgac aatattaaac gcttcagtaa gaaacctgtt    780

ggactaatat acgctttcca atggttcgaa ctattagagg gtccagcaga agtatttgat    840

aagtttaaga gctctaagtt atactatttc acagacatag tatcgaaggg tagttcaatc    900

atcaatgttg aatacaggag agatcttgcc aataggctag actggttggg cgttaactac    960

tatagccgtt tagtctacaa aatcgtcgat gacaaaccta taatcctgca cgggtatgga   1020

ttcctttgta cacctggggg gatcagcccg gctgaaaatc cttgtagcga ttttgggtgg   1080

gaggtgtatc ctgaaggact ctacctactt ctaaaagaac tttacaaccg atacggggta   1140

gacttgatcg tgaccgagaa cggtgtttca gacagcaggg atgcgttgag accggcatac   1200

ctggtctcgc atgtttacag cgtatggaaa gccgctaacg agggcattcc cgtcaaaggc   1260

tacctccact ggagcttgac agacaattac gagtgggccc agggcttcag gcagaaattc   1320

ggtttagtca tggttgactt caaaactaag aaaaggtatc tccgcccaag cgccctagtg   1380

ttccgggaga tcgcaacgca taacggaata ccggatgagc tacagcatct tacactgatc   1440

cagtaa                                                              1446

<210> 188
<211> 481
<212> PRT
<213> Thermosphaera aggregans M11TL

<220> 
<221> DOMAIN
<222> (1)...(470)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (104)...(107)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (387)...(395)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 188
Met Lys Phe Pro Lys Asp Phe Met Ile Gly Tyr Ser Ser Ser Pro Phe 
1               5                   10                  15      


Gln Phe Glu Ala Gly Ile Pro Gly Ser Glu Asp Pro Asn Ser Asp Trp 
            20                  25                  30          


Trp Val Trp Val His Asp Pro Glu Asn Thr Ala Ala Gly Leu Val Ser 
        35                  40                  45              


Gly Asp Phe Pro Glu Asn Gly Pro Gly Tyr Trp Asn Leu Asn Gln Asn 
    50                  55                  60                  


Asp His Asp Leu Ala Glu Lys Leu Gly Val Asn Thr Ile Arg Val Gly 
65                  70                  75                  80  


Val Glu Trp Ser Arg Ile Phe Pro Lys Pro Thr Phe Asn Val Lys Val 
                85                  90                  95      


Pro Val Glu Arg Asp Glu Asn Gly Ser Ile Val His Val Asp Val Asp 
            100                 105                 110         


Asp Lys Ala Val Glu Arg Leu Asp Glu Leu Ala Asn Lys Glu Ala Val 
        115                 120                 125             


Asn His Tyr Val Glu Met Tyr Lys Asp Trp Val Glu Arg Gly Arg Lys 
    130                 135                 140                 


Leu Ile Leu Asn Leu Tyr His Trp Pro Leu Pro Leu Trp Leu His Asn 
145                 150                 155                 160 


Pro Ile Met Val Arg Arg Met Gly Pro Asp Arg Ala Pro Ser Gly Trp 
                165                 170                 175     


Leu Asn Glu Glu Ser Val Val Glu Phe Ala Lys Tyr Ala Ala Tyr Ile 
            180                 185                 190         


Ala Trp Lys Met Gly Glu Leu Pro Val Met Trp Ser Thr Met Asn Glu 
        195                 200                 205             


Pro Asn Val Val Tyr Glu Gln Gly Tyr Met Phe Val Lys Gly Gly Phe 
    210                 215                 220                 


Pro Pro Gly Tyr Leu Ser Leu Glu Ala Ala Asp Lys Ala Arg Arg Asn 
225                 230                 235                 240 


Met Ile Gln Ala His Ala Arg Ala Tyr Asp Asn Ile Lys Arg Phe Ser 
                245                 250                 255     


Lys Lys Pro Val Gly Leu Ile Tyr Ala Phe Gln Trp Phe Glu Leu Leu 
            260                 265                 270         


Glu Gly Pro Ala Glu Val Phe Asp Lys Phe Lys Ser Ser Lys Leu Tyr 
        275                 280                 285             


Tyr Phe Thr Asp Ile Val Ser Lys Gly Ser Ser Ile Ile Asn Val Glu 
    290                 295                 300                 


Tyr Arg Arg Asp Leu Ala Asn Arg Leu Asp Trp Leu Gly Val Asn Tyr 
305                 310                 315                 320 


Tyr Ser Arg Leu Val Tyr Lys Ile Val Asp Asp Lys Pro Ile Ile Leu 
                325                 330                 335     


His Gly Tyr Gly Phe Leu Cys Thr Pro Gly Gly Ile Ser Pro Ala Glu 
            340                 345                 350         


Asn Pro Cys Ser Asp Phe Gly Trp Glu Val Tyr Pro Glu Gly Leu Tyr 
        355                 360                 365             


Leu Leu Leu Lys Glu Leu Tyr Asn Arg Tyr Gly Val Asp Leu Ile Val 
    370                 375                 380                 


Thr Glu Asn Gly Val Ser Asp Ser Arg Asp Ala Leu Arg Pro Ala Tyr 
385                 390                 395                 400 


Leu Val Ser His Val Tyr Ser Val Trp Lys Ala Ala Asn Glu Gly Ile 
                405                 410                 415     


Pro Val Lys Gly Tyr Leu His Trp Ser Leu Thr Asp Asn Tyr Glu Trp 
            420                 425                 430         


Ala Gln Gly Phe Arg Gln Lys Phe Gly Leu Val Met Val Asp Phe Lys 
        435                 440                 445             


Thr Lys Lys Arg Tyr Leu Arg Pro Ser Ala Leu Val Phe Arg Glu Ile 
    450                 455                 460                 


Ala Thr His Asn Gly Ile Pro Asp Glu Leu Gln His Leu Thr Leu Ile 
465                 470                 475                 480 


Gln 
    


<210> 189
<211> 1107
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 189
atgaaaggct ttcgctggtg tgtactcgca gtattgatgc tggcggcaac gaatctacgc     60

gccgcctgta gctggcctgc ctgggaacag tttaaacagg actacatcag cgaaagcggg    120

cgcgttatcg atcccagtga cgcgcgaaaa attaccacct ctgaagggca aagctatgcg    180

ctgttctttg ccctggctgc aaacgatcgt aaagcgttcg atctgctgct ggcgtggacg    240

cgcgataatc tcgccggggg cgatttaacg gccaacctgc ctgcctggct gtgggggcaa    300

aaggacaaag agacctggac ggttatcgat cctaactccg cgtccgatgc tgatatctgg    360

attgcctggt cgctgctgga agcggggcgg ctgtggaaaa atcaggacta cacccgcaca    420

ggcaaggggc tgcttaaacg catcgtcagc gaggaagtgg tgaaagtgcc gggtctgggc    480

ttcatgctgc tgccgggtaa aaccggattt gccgaagaga acgcctggcg ctttaacccc    540

agctacctcc cgccacagct agcgaactat ttcacccgat tgggtacgcc gtggaccacg    600

cttcgtgaaa ccaatctgcg tttattgctg gagacggcac cgaaaggttt ctccccaaac    660

tgggtgcaat atcagaaaaa cagaggctgg cagctcagtc aggataaatc gctggtgggc    720

agttacgacg ccattcgcgt ttatctctgg gtgggcatga tgaatgataa agacccgcaa    780

aaggctcggc tgcttgcacg ctttaagcca atggcaacgg taacggcaaa gcagggcgta    840

ccgccggaga aagtcgatgt cgcaaccggt aaacggacgg ataaaggtcc ggtcggtttt    900

tctgcctctc tgctaccttt tttacagaat cgggacgcgc aggcagtgca acgacagcgc    960

gtcgccgatc attttcctga taataatgcc tattacagct atgtactgac cctctttggg   1020

caagggtggg atcagcatcg ttttcgcttc accacacagg gtgaattaat accgaattgg   1080

ggccaggaat gcgcaagttc acaataa                                       1107

<210> 190
<211> 368
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(22)

<220> 
<221> DOMAIN
<222> (2)...(346)
<223> Glycosyl hydrolases family 8

<400> 190
Met Lys Gly Phe Arg Trp Cys Val Leu Ala Val Leu Met Leu Ala Ala 
1               5                   10                  15      


Thr Asn Leu Arg Ala Ala Cys Ser Trp Pro Ala Trp Glu Gln Phe Lys 
            20                  25                  30          


Gln Asp Tyr Ile Ser Glu Ser Gly Arg Val Ile Asp Pro Ser Asp Ala 
        35                  40                  45              


Arg Lys Ile Thr Thr Ser Glu Gly Gln Ser Tyr Ala Leu Phe Phe Ala 
    50                  55                  60                  


Leu Ala Ala Asn Asp Arg Lys Ala Phe Asp Leu Leu Leu Ala Trp Thr 
65                  70                  75                  80  


Arg Asp Asn Leu Ala Gly Gly Asp Leu Thr Ala Asn Leu Pro Ala Trp 
                85                  90                  95      


Leu Trp Gly Gln Lys Asp Lys Glu Thr Trp Thr Val Ile Asp Pro Asn 
            100                 105                 110         


Ser Ala Ser Asp Ala Asp Ile Trp Ile Ala Trp Ser Leu Leu Glu Ala 
        115                 120                 125             


Gly Arg Leu Trp Lys Asn Gln Asp Tyr Thr Arg Thr Gly Lys Gly Leu 
    130                 135                 140                 


Leu Lys Arg Ile Val Ser Glu Glu Val Val Lys Val Pro Gly Leu Gly 
145                 150                 155                 160 


Phe Met Leu Leu Pro Gly Lys Thr Gly Phe Ala Glu Glu Asn Ala Trp 
                165                 170                 175     


Arg Phe Asn Pro Ser Tyr Leu Pro Pro Gln Leu Ala Asn Tyr Phe Thr 
            180                 185                 190         


Arg Leu Gly Thr Pro Trp Thr Thr Leu Arg Glu Thr Asn Leu Arg Leu 
        195                 200                 205             


Leu Leu Glu Thr Ala Pro Lys Gly Phe Ser Pro Asn Trp Val Gln Tyr 
    210                 215                 220                 


Gln Lys Asn Arg Gly Trp Gln Leu Ser Gln Asp Lys Ser Leu Val Gly 
225                 230                 235                 240 


Ser Tyr Asp Ala Ile Arg Val Tyr Leu Trp Val Gly Met Met Asn Asp 
                245                 250                 255     


Lys Asp Pro Gln Lys Ala Arg Leu Leu Ala Arg Phe Lys Pro Met Ala 
            260                 265                 270         


Thr Val Thr Ala Lys Gln Gly Val Pro Pro Glu Lys Val Asp Val Ala 
        275                 280                 285             


Thr Gly Lys Arg Thr Asp Lys Gly Pro Val Gly Phe Ser Ala Ser Leu 
    290                 295                 300                 


Leu Pro Phe Leu Gln Asn Arg Asp Ala Gln Ala Val Gln Arg Gln Arg 
305                 310                 315                 320 


Val Ala Asp His Phe Pro Asp Asn Asn Ala Tyr Tyr Ser Tyr Val Leu 
                325                 330                 335     


Thr Leu Phe Gly Gln Gly Trp Asp Gln His Arg Phe Arg Phe Thr Thr 
            340                 345                 350         


Gln Gly Glu Leu Ile Pro Asn Trp Gly Gln Glu Cys Ala Ser Ser Gln 
        355                 360                 365             


<210> 191
<211> 984
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 191
atgaaaaaaa ttacacgctg ctgtacattg atatgcgcag caatcatgct attaaactgt     60

agcagttcag ccaaaaacga aaacaaagaa acgtccaaaa caatagtcgg caaacatggg    120

aaactatcgg taaacggcac ttccctcgtt gatgaaaaag gagagaaaat tcaactacgg    180

ggcgttagtt acggatggca caacttttgg ccccgtttct acaatgcctc tacagtaaaa    240

gtattcgtag aagattggaa atgcagtgta ctccgggcag ctataggcgt agaggaacgg    300

agaggatata tagacaacac ggcagaagcc atccgttgtg cgacagttgt cgcagatgcg    360

gccatcgaac aaggcatcta tgtgatcatc gattggcaca gccatggcat acgcacagca    420

gaagccaaac aattcttcac gcaaatggcc aatcgctaca aaggccaccc gaacgtgatc    480

tacgaaatct ttaacgaacc ggtagaagat tcctgggaag atgtaaaagc atattccatc    540

gaaatcatac agaccatccg ggcgattgac ccggataata taatattagt cggcacccca    600

cactgggatc aagacataca tctggcagcc gatagcccga tcagcggatt caacaatctg    660

atgtacacgc tacacttcta tgcagccaca catcatcagg aacttagaga ccgtggaaat    720

tatgccattc aaaaaggtct tcctatattt gtatcagagt gtggaggaat ggaagcctca    780

ggcgacggcc caatcgacca cgccgaatgg gcggcctggt taagttggat ggataaaaac    840

aacatcagct gggcagcatg gtcaatagcc gataaggacg aaacctgctc catgatgaaa    900

aaaacagctt catcggaagg gccatggaca gagaatgagc tgaaagaatg ggggacaatg    960

tgccacaacg aaatctctaa atag                                           984

<210> 192
<211> 327
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(24)

<220> 
<221> DOMAIN
<222> (47)...(297)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (19)...(22)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (45)...(48)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (76)...(79)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (161)...(170)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (285)...(288)
<223> N-glycosylation site. Prosite id = PS00001

<400> 192
Met Lys Lys Ile Thr Arg Cys Cys Thr Leu Ile Cys Ala Ala Ile Met 
1               5                   10                  15      


Leu Leu Asn Cys Ser Ser Ser Ala Lys Asn Glu Asn Lys Glu Thr Ser 
            20                  25                  30          


Lys Thr Ile Val Gly Lys His Gly Lys Leu Ser Val Asn Gly Thr Ser 
        35                  40                  45              


Leu Val Asp Glu Lys Gly Glu Lys Ile Gln Leu Arg Gly Val Ser Tyr 
    50                  55                  60                  


Gly Trp His Asn Phe Trp Pro Arg Phe Tyr Asn Ala Ser Thr Val Lys 
65                  70                  75                  80  


Val Phe Val Glu Asp Trp Lys Cys Ser Val Leu Arg Ala Ala Ile Gly 
                85                  90                  95      


Val Glu Glu Arg Arg Gly Tyr Ile Asp Asn Thr Ala Glu Ala Ile Arg 
            100                 105                 110         


Cys Ala Thr Val Val Ala Asp Ala Ala Ile Glu Gln Gly Ile Tyr Val 
        115                 120                 125             


Ile Ile Asp Trp His Ser His Gly Ile Arg Thr Ala Glu Ala Lys Gln 
    130                 135                 140                 


Phe Phe Thr Gln Met Ala Asn Arg Tyr Lys Gly His Pro Asn Val Ile 
145                 150                 155                 160 


Tyr Glu Ile Phe Asn Glu Pro Val Glu Asp Ser Trp Glu Asp Val Lys 
                165                 170                 175     


Ala Tyr Ser Ile Glu Ile Ile Gln Thr Ile Arg Ala Ile Asp Pro Asp 
            180                 185                 190         


Asn Ile Ile Leu Val Gly Thr Pro His Trp Asp Gln Asp Ile His Leu 
        195                 200                 205             


Ala Ala Asp Ser Pro Ile Ser Gly Phe Asn Asn Leu Met Tyr Thr Leu 
    210                 215                 220                 


His Phe Tyr Ala Ala Thr His His Gln Glu Leu Arg Asp Arg Gly Asn 
225                 230                 235                 240 


Tyr Ala Ile Gln Lys Gly Leu Pro Ile Phe Val Ser Glu Cys Gly Gly 
                245                 250                 255     


Met Glu Ala Ser Gly Asp Gly Pro Ile Asp His Ala Glu Trp Ala Ala 
            260                 265                 270         


Trp Leu Ser Trp Met Asp Lys Asn Asn Ile Ser Trp Ala Ala Trp Ser 
        275                 280                 285             


Ile Ala Asp Lys Asp Glu Thr Cys Ser Met Met Lys Lys Thr Ala Ser 
    290                 295                 300                 


Ser Glu Gly Pro Trp Thr Glu Asn Glu Leu Lys Glu Trp Gly Thr Met 
305                 310                 315                 320 


Cys His Asn Glu Ile Ser Lys 
                325         


<210> 193
<211> 993
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 193
atgcgaaaac ccgcatgcgc caccctggct gtcatgatga gtttgctgtt cactcctttc     60

tctcaggcag gtcaggcctg ggagagttac aaagcgcgct tctttaagcc ggatgggcga    120

atcgtcgata ccggcaacgg taacgtttcc cataccgaag gccagggttt tgccatgctg    180

atggcggtgg caaacgatga taaagcgacc tttgataaac tctggcactg gacgagcagt    240

acgctgaaga ataaagagaa cggtcttttt tactggcgtt ataatcccgc tcaggcagac    300

ccgattgccg acaaaaacaa tgcctccgat ggcgatgtgc tcattgcctg ggctctgtta    360

aaagcgaacg cgcgctggca tgacaaaggc tacagcacgg catcggatgc cattaccaaa    420

gcgctgcttg cccataacgt tatccgctat gcgggttacc gcgtgatggt gcccggctca    480

cacgggttta agcaggacaa taacgttgtg cttaatcctt cctacttcgt atttcctgcc    540

tggcaggcat ttgctgagcg cagccatttg caaatatggc gacagctcgc gcaggacgga    600

cagcggctgc tgaagaaaat ggggacgggt aaagccaatc tgccaactga ctgggtctct    660

cttgacacga aagggacgct ggcccccgca aacgcctggc cgccccgcat gagttatgac    720

gccatccgca ttccgctgta catcagctgg tccaacgcga aaagcccctt gctgaccccg    780

tggcgcgcct ggttcgctca gtttccgcgt gaacaaacac ccgcatgggt taacgtcacg    840

acgaatgaat acgcccccta catgatggct ggcggtctgc tggcagtgcg tgatttaacg    900

atgggacaga gggtcggcga gcccgacatt actgcaaatg atgattacta ttcggcaagt    960

ctgaaaatgc tggtgtggat ctcagaacaa taa                                 993

<210> 194
<211> 330
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(23)

<220> 
<221> DOMAIN
<222> (1)...(329)
<223> Glycosyl hydrolases family 8

<220> 
<221> SITE
<222> (48)...(51)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (108)...(111)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (109)...(127)
<223> Glycosyl hydrolases family 8 signature. Prosite id = PS00812

<220> 
<221> SITE
<222> (282)...(285)
<223> N-glycosylation site. Prosite id = PS00001

<400> 194
Met Arg Lys Pro Ala Cys Ala Thr Leu Ala Val Met Met Ser Leu Leu 
1               5                   10                  15      


Phe Thr Pro Phe Ser Gln Ala Gly Gln Ala Trp Glu Ser Tyr Lys Ala 
            20                  25                  30          


Arg Phe Phe Lys Pro Asp Gly Arg Ile Val Asp Thr Gly Asn Gly Asn 
        35                  40                  45              


Val Ser His Thr Glu Gly Gln Gly Phe Ala Met Leu Met Ala Val Ala 
    50                  55                  60                  


Asn Asp Asp Lys Ala Thr Phe Asp Lys Leu Trp His Trp Thr Ser Ser 
65                  70                  75                  80  


Thr Leu Lys Asn Lys Glu Asn Gly Leu Phe Tyr Trp Arg Tyr Asn Pro 
                85                  90                  95      


Ala Gln Ala Asp Pro Ile Ala Asp Lys Asn Asn Ala Ser Asp Gly Asp 
            100                 105                 110         


Val Leu Ile Ala Trp Ala Leu Leu Lys Ala Asn Ala Arg Trp His Asp 
        115                 120                 125             


Lys Gly Tyr Ser Thr Ala Ser Asp Ala Ile Thr Lys Ala Leu Leu Ala 
    130                 135                 140                 


His Asn Val Ile Arg Tyr Ala Gly Tyr Arg Val Met Val Pro Gly Ser 
145                 150                 155                 160 


His Gly Phe Lys Gln Asp Asn Asn Val Val Leu Asn Pro Ser Tyr Phe 
                165                 170                 175     


Val Phe Pro Ala Trp Gln Ala Phe Ala Glu Arg Ser His Leu Gln Ile 
            180                 185                 190         


Trp Arg Gln Leu Ala Gln Asp Gly Gln Arg Leu Leu Lys Lys Met Gly 
        195                 200                 205             


Thr Gly Lys Ala Asn Leu Pro Thr Asp Trp Val Ser Leu Asp Thr Lys 
    210                 215                 220                 


Gly Thr Leu Ala Pro Ala Asn Ala Trp Pro Pro Arg Met Ser Tyr Asp 
225                 230                 235                 240 


Ala Ile Arg Ile Pro Leu Tyr Ile Ser Trp Ser Asn Ala Lys Ser Pro 
                245                 250                 255     


Leu Leu Thr Pro Trp Arg Ala Trp Phe Ala Gln Phe Pro Arg Glu Gln 
            260                 265                 270         


Thr Pro Ala Trp Val Asn Val Thr Thr Asn Glu Tyr Ala Pro Tyr Met 
        275                 280                 285             


Met Ala Gly Gly Leu Leu Ala Val Arg Asp Leu Thr Met Gly Gln Arg 
    290                 295                 300                 


Val Gly Glu Pro Asp Ile Thr Ala Asn Asp Asp Tyr Tyr Ser Ala Ser 
305                 310                 315                 320 


Leu Lys Met Leu Val Trp Ile Ser Glu Gln 
                325                 330 


<210> 195
<211> 1545
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 195
atgaaacgtt ccatctctgt cttcatcgcc tgttttatgg tagcggcgct tggcatcagc     60

ggtatcattg caccgaaagc ggctgccgct tctaaaacac ccgttgctgt aaacggacag    120

cttaccttaa aaggtacgca gctcgtcaat caaaacggaa aagcggttca gctgaaagga    180

atcagctccc acggtctaca gtggtatggc gattatgtca acaaagactc gttaaaatgg    240

ctgagagacg actggggcat caatgtcttc cgcgcggcca tgtatacagc tgaaggcggc    300

tatattgaca atccgtcggt taaaaacaaa gtgaaggaag ccgtcgaagc ggcaaaagaa    360

ctcggaatct atgtgatcat tgactggcac atactgagcg atggcaatcc aaaccaaaac    420

aaagcgaaag caaaagaatt ttttaacgaa atgtcaagac tttatggcaa gacgccaaac    480

gtcatttttg aaattgccaa cgagccgaac ggcgatgtca actggaatcg cgacattaaa    540

ccttacgccg aagaaatcct gtccgtgatt cgcaaaaact ctccgaaaaa tattgtgatt    600

gtcggaacag gcacctggag ccaggatgtc aatgatgcgg cggacaatca gctgaaagac    660

ggcaatgtca tgtacgcgct ccatttttat gcgggcacgc acggtcagtc tttgcgggat    720

aaagccgatt atgcactcag caaaggagcg ccgattttcg tcacagaatg gggaacgagc    780

gatgcttcag gaaacggcgg ggtctacctt gaccaatcca gggagtggct gaaatattta    840

gacagcaaaa aaatcagctg ggtaaactgg aacttatccg acaaacaaga gtcgtcagca    900

gctttaaacc caggcgcctc taaaaacgga ggatggtcgc aatccgactt gtccccatca    960

ggcaaattcg tcagggataa catccgcagc gggtcaaacg gttcgtcagg agactctgga   1020

tcgaattcga aagggtcaga tcaaaaagac caaaaaaagg atcaggataa accaggtcaa   1080

gacagcggcg ctgcagccaa cacgatagca gtacaataca gagcggggga caacaatgta   1140

aacggcaacc aaatccgccc tcagctcaac attaaaaaca acagcaaaaa aaccgtgtct   1200

ttaaatcgaa tcaccgtccg ctactggtat aaaacgaatc gcaaaggaca aaattttgac   1260

tgcgactatg cccaaatcgg ctgcagcaaa atcacgcaca aattcgttca attaaaaaaa   1320

gcggtaaacg gagcagacac gtatcttgaa gtaggattta aaaatggtac attggcgccg   1380

ggggctgata ctggcgaaat ccagatccgt cttcacaatg acggctggag caattatgcc   1440

caaagcggcg actattcatt ttttaattca aacacgttta aaaatacgaa aaaaatcacg   1500

ttgtatgaga acggaaagct gatttggggc actgaaccta aataa                   1545

<210> 196
<211> 514
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (46)...(300)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (371)...(452)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (163)...(172)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (295)...(298)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (338)...(341)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (399)...(402)
<223> N-glycosylation site. Prosite id = PS00001

<400> 196
Met Lys Arg Ser Ile Ser Val Phe Ile Ala Cys Phe Met Val Ala Ala 
1               5                   10                  15      


Leu Gly Ile Ser Gly Ile Ile Ala Pro Lys Ala Ala Ala Ala Ser Lys 
            20                  25                  30          


Thr Pro Val Ala Val Asn Gly Gln Leu Thr Leu Lys Gly Thr Gln Leu 
        35                  40                  45              


Val Asn Gln Asn Gly Lys Ala Val Gln Leu Lys Gly Ile Ser Ser His 
    50                  55                  60                  


Gly Leu Gln Trp Tyr Gly Asp Tyr Val Asn Lys Asp Ser Leu Lys Trp 
65                  70                  75                  80  


Leu Arg Asp Asp Trp Gly Ile Asn Val Phe Arg Ala Ala Met Tyr Thr 
                85                  90                  95      


Ala Glu Gly Gly Tyr Ile Asp Asn Pro Ser Val Lys Asn Lys Val Lys 
            100                 105                 110         


Glu Ala Val Glu Ala Ala Lys Glu Leu Gly Ile Tyr Val Ile Ile Asp 
        115                 120                 125             


Trp His Ile Leu Ser Asp Gly Asn Pro Asn Gln Asn Lys Ala Lys Ala 
    130                 135                 140                 


Lys Glu Phe Phe Asn Glu Met Ser Arg Leu Tyr Gly Lys Thr Pro Asn 
145                 150                 155                 160 


Val Ile Phe Glu Ile Ala Asn Glu Pro Asn Gly Asp Val Asn Trp Asn 
                165                 170                 175     


Arg Asp Ile Lys Pro Tyr Ala Glu Glu Ile Leu Ser Val Ile Arg Lys 
            180                 185                 190         


Asn Ser Pro Lys Asn Ile Val Ile Val Gly Thr Gly Thr Trp Ser Gln 
        195                 200                 205             


Asp Val Asn Asp Ala Ala Asp Asn Gln Leu Lys Asp Gly Asn Val Met 
    210                 215                 220                 


Tyr Ala Leu His Phe Tyr Ala Gly Thr His Gly Gln Ser Leu Arg Asp 
225                 230                 235                 240 


Lys Ala Asp Tyr Ala Leu Ser Lys Gly Ala Pro Ile Phe Val Thr Glu 
                245                 250                 255     


Trp Gly Thr Ser Asp Ala Ser Gly Asn Gly Gly Val Tyr Leu Asp Gln 
            260                 265                 270         


Ser Arg Glu Trp Leu Lys Tyr Leu Asp Ser Lys Lys Ile Ser Trp Val 
        275                 280                 285             


Asn Trp Asn Leu Ser Asp Lys Gln Glu Ser Ser Ala Ala Leu Asn Pro 
    290                 295                 300                 


Gly Ala Ser Lys Asn Gly Gly Trp Ser Gln Ser Asp Leu Ser Pro Ser 
305                 310                 315                 320 


Gly Lys Phe Val Arg Asp Asn Ile Arg Ser Gly Ser Asn Gly Ser Ser 
                325                 330                 335     


Gly Asp Ser Gly Ser Asn Ser Lys Gly Ser Asp Gln Lys Asp Gln Lys 
            340                 345                 350         


Lys Asp Gln Asp Lys Pro Gly Gln Asp Ser Gly Ala Ala Ala Asn Thr 
        355                 360                 365             


Ile Ala Val Gln Tyr Arg Ala Gly Asp Asn Asn Val Asn Gly Asn Gln 
    370                 375                 380                 


Ile Arg Pro Gln Leu Asn Ile Lys Asn Asn Ser Lys Lys Thr Val Ser 
385                 390                 395                 400 


Leu Asn Arg Ile Thr Val Arg Tyr Trp Tyr Lys Thr Asn Arg Lys Gly 
                405                 410                 415     


Gln Asn Phe Asp Cys Asp Tyr Ala Gln Ile Gly Cys Ser Lys Ile Thr 
            420                 425                 430         


His Lys Phe Val Gln Leu Lys Lys Ala Val Asn Gly Ala Asp Thr Tyr 
        435                 440                 445             


Leu Glu Val Gly Phe Lys Asn Gly Thr Leu Ala Pro Gly Ala Asp Thr 
    450                 455                 460                 


Gly Glu Ile Gln Ile Arg Leu His Asn Asp Gly Trp Ser Asn Tyr Ala 
465                 470                 475                 480 


Gln Ser Gly Asp Tyr Ser Phe Phe Asn Ser Asn Thr Phe Lys Asn Thr 
                485                 490                 495     


Lys Lys Ile Thr Leu Tyr Glu Asn Gly Lys Leu Ile Trp Gly Thr Glu 
            500                 505                 510         


Pro Lys 
        


<210> 197
<211> 972
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 197
atggaacagt cagttgctga aagtgatagc aactcagcat ttgaatacaa caaaatggta     60

ggtaaaggag taaatattgg aaatgcttta gaagctcctt tcgaaggagc ttggggagta    120

agaattgagg atgaatattt tgagataata aagaaaaggg gatttgattc tgttaggatt    180

cccataagat ggtcagcaca tatatccgaa aagccaccat atgatattga caggaatttc    240

ctcgaaagag ttaaccatgt tgtcgatagg gctcttgaga ataatttaac agtaatcatc    300

aatacgcacc attttgaaga actctatcaa gaaccggata aatacggcga tgttttggtg    360

gaaatttgga gacagattgc aaaattcttt aaagattacc cggaaaatct gttctttgaa    420

atctacaacg agcctgctca gaacttgaca gctgaaaaat ggaacgcact ttatccaaaa    480

gtgctcaaag ttatcaggga gagcaatcca acccggattg tcattatcga tgctccaaac    540

tgggcacact atagcgcagt gagaagtcta aaattagtca acgacaaacg catcattgtt    600

tccttccatt actacgaacc tttcaaattc acacatcagg gtgccgaatg ggttaatccc    660

atcccacctg ttagggttaa gtggaatggc gaggaatggg aaattaacca aatcagaagt    720

catttcaaat acgtgagtga ctgggcaaag caaaataacg taccaatctt tcttggtgaa    780

ttcggtgctt attcaaaagc agacatggac tcaagggtta agtggaccga aagtgtgaga    840

aaaatggcgg aagaatttgg attttcatac gcgtattggg aattttgtgc aggatttggc    900

atatacgata gatggtctca aaactggatc gaaccattgg caacagctgt ggttggcaca    960

ggcaaagagt aa                                                        972

<210> 198
<211> 323
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (27)...(303)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (96)...(99)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (150)...(153)
<223> N-glycosylation site. Prosite id = PS00001

<400> 198
Met Glu Gln Ser Val Ala Glu Ser Asp Ser Asn Ser Ala Phe Glu Tyr 
1               5                   10                  15      


Asn Lys Met Val Gly Lys Gly Val Asn Ile Gly Asn Ala Leu Glu Ala 
            20                  25                  30          


Pro Phe Glu Gly Ala Trp Gly Val Arg Ile Glu Asp Glu Tyr Phe Glu 
        35                  40                  45              


Ile Ile Lys Lys Arg Gly Phe Asp Ser Val Arg Ile Pro Ile Arg Trp 
    50                  55                  60                  


Ser Ala His Ile Ser Glu Lys Pro Pro Tyr Asp Ile Asp Arg Asn Phe 
65                  70                  75                  80  


Leu Glu Arg Val Asn His Val Val Asp Arg Ala Leu Glu Asn Asn Leu 
                85                  90                  95      


Thr Val Ile Ile Asn Thr His His Phe Glu Glu Leu Tyr Gln Glu Pro 
            100                 105                 110         


Asp Lys Tyr Gly Asp Val Leu Val Glu Ile Trp Arg Gln Ile Ala Lys 
        115                 120                 125             


Phe Phe Lys Asp Tyr Pro Glu Asn Leu Phe Phe Glu Ile Tyr Asn Glu 
    130                 135                 140                 


Pro Ala Gln Asn Leu Thr Ala Glu Lys Trp Asn Ala Leu Tyr Pro Lys 
145                 150                 155                 160 


Val Leu Lys Val Ile Arg Glu Ser Asn Pro Thr Arg Ile Val Ile Ile 
                165                 170                 175     


Asp Ala Pro Asn Trp Ala His Tyr Ser Ala Val Arg Ser Leu Lys Leu 
            180                 185                 190         


Val Asn Asp Lys Arg Ile Ile Val Ser Phe His Tyr Tyr Glu Pro Phe 
        195                 200                 205             


Lys Phe Thr His Gln Gly Ala Glu Trp Val Asn Pro Ile Pro Pro Val 
    210                 215                 220                 


Arg Val Lys Trp Asn Gly Glu Glu Trp Glu Ile Asn Gln Ile Arg Ser 
225                 230                 235                 240 


His Phe Lys Tyr Val Ser Asp Trp Ala Lys Gln Asn Asn Val Pro Ile 
                245                 250                 255     


Phe Leu Gly Glu Phe Gly Ala Tyr Ser Lys Ala Asp Met Asp Ser Arg 
            260                 265                 270         


Val Lys Trp Thr Glu Ser Val Arg Lys Met Ala Glu Glu Phe Gly Phe 
        275                 280                 285             


Ser Tyr Ala Tyr Trp Glu Phe Cys Ala Gly Phe Gly Ile Tyr Asp Arg 
    290                 295                 300                 


Trp Ser Gln Asn Trp Ile Glu Pro Leu Ala Thr Ala Val Val Gly Thr 
305                 310                 315                 320 


Gly Lys Glu 
            


<210> 199
<211> 978
<212> DNA
<213> Aquifex aeolicus

<400> 199
gtgaagttct ttactgttct tttgtttttc ctttcattcg ttttttcggc gagtattgac     60

gtgtggaaat tgtgggaaca ttataaaaag acctttataa gtaaagaggg gtacgtggta    120

gatccttaca acaattacag ggttacttcg gaagctcaag gctatacact tctcatatcc    180

gccctcatag gggataagga aaccttctac agggtatgga actggacaaa ggaaaatctc    240

aaaaggaagg ataatttatt ctcctggctc tggataaacg gacacgtagt tgacagaaac    300

aacgcaacgg acgccgacct gttcatagct tacgctctgc taatcgcttc tcaaaagtgg    360

aaggattata cgctcctgag cgaggcaaag aggataaagg attccgtcaa ggaactcgtt    420

gtgcccgtgt gcaacggaag aagggattac ctttttatac ccgcaaagga aggttacatt    480

aaaaacaata tagtgagttt aaacgtagtc tactacgtcc cattcatctt cagaaagttc    540

tacgaatctt tcggagagga cgtgtggaaa aacctctata ggtacaccta cgacatttat    600

acgattagga atatatcaac gcaccttaca tacgacctat tcaaaaagga attgagaaag    660

gggaatttta tagacataga cggcatgcgt ttcctgatat acgcttacgt ggacgacaaa    720

aggagtctcc tttacatgag gaacgcagta gagggtattc tgaagtttta cagagaaaag    780

ggttacattc ctttgaagta taattacgta accggtggag caagcaagtt aaaagctccc    840

ttttgttttt actacgtatt cagtaagctc cttccttccg ataaaaactt agaaaaggag    900

ttcagaaatg ggcttgagta cgacaagaag aattactact gttacgctct gcttcttatt    960

gctctcctac acgattag                                                  978

<210> 200
<211> 325
<212> PRT
<213> Aquifex aeolicus

<220> 
<221> SIGNAL
<222> (1)...(16)

<220> 
<221> DOMAIN
<222> (1)...(324)
<223> Glycosyl hydrolases family 8

<220> 
<221> SITE
<222> (75)...(78)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (102)...(105)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (103)...(121)
<223> Glycosyl hydrolases family 8 signature. Prosite id = PS00812

<220> 
<221> SITE
<222> (207)...(210)
<223> N-glycosylation site. Prosite id = PS00001

<400> 200
Met Lys Phe Phe Thr Val Leu Leu Phe Phe Leu Ser Phe Val Phe Ser 
1               5                   10                  15      


Ala Ser Ile Asp Val Trp Lys Leu Trp Glu His Tyr Lys Lys Thr Phe 
            20                  25                  30          


Ile Ser Lys Glu Gly Tyr Val Val Asp Pro Tyr Asn Asn Tyr Arg Val 
        35                  40                  45              


Thr Ser Glu Ala Gln Gly Tyr Thr Leu Leu Ile Ser Ala Leu Ile Gly 
    50                  55                  60                  


Asp Lys Glu Thr Phe Tyr Arg Val Trp Asn Trp Thr Lys Glu Asn Leu 
65                  70                  75                  80  


Lys Arg Lys Asp Asn Leu Phe Ser Trp Leu Trp Ile Asn Gly His Val 
                85                  90                  95      


Val Asp Arg Asn Asn Ala Thr Asp Ala Asp Leu Phe Ile Ala Tyr Ala 
            100                 105                 110         


Leu Leu Ile Ala Ser Gln Lys Trp Lys Asp Tyr Thr Leu Leu Ser Glu 
        115                 120                 125             


Ala Lys Arg Ile Lys Asp Ser Val Lys Glu Leu Val Val Pro Val Cys 
    130                 135                 140                 


Asn Gly Arg Arg Asp Tyr Leu Phe Ile Pro Ala Lys Glu Gly Tyr Ile 
145                 150                 155                 160 


Lys Asn Asn Ile Val Ser Leu Asn Val Val Tyr Tyr Val Pro Phe Ile 
                165                 170                 175     


Phe Arg Lys Phe Tyr Glu Ser Phe Gly Glu Asp Val Trp Lys Asn Leu 
            180                 185                 190         


Tyr Arg Tyr Thr Tyr Asp Ile Tyr Thr Ile Arg Asn Ile Ser Thr His 
        195                 200                 205             


Leu Thr Tyr Asp Leu Phe Lys Lys Glu Leu Arg Lys Gly Asn Phe Ile 
    210                 215                 220                 


Asp Ile Asp Gly Met Arg Phe Leu Ile Tyr Ala Tyr Val Asp Asp Lys 
225                 230                 235                 240 


Arg Ser Leu Leu Tyr Met Arg Asn Ala Val Glu Gly Ile Leu Lys Phe 
                245                 250                 255     


Tyr Arg Glu Lys Gly Tyr Ile Pro Leu Lys Tyr Asn Tyr Val Thr Gly 
            260                 265                 270         


Gly Ala Ser Lys Leu Lys Ala Pro Phe Cys Phe Tyr Tyr Val Phe Ser 
        275                 280                 285             


Lys Leu Leu Pro Ser Asp Lys Asn Leu Glu Lys Glu Phe Arg Asn Gly 
    290                 295                 300                 


Leu Glu Tyr Asp Lys Lys Asn Tyr Tyr Cys Tyr Ala Leu Leu Leu Ile 
305                 310                 315                 320 


Ala Leu Leu His Asp 
                325 


<210> 201
<211> 1311
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 201
atgtttccac gtctttcacc aagccgcttc aggcaagtta ccttaacctt gctcacgctc     60

ggccttgtgt cactgaccgg ttgtgcaggt aacagcaagc cggatgcaga caccagtact    120

gctggtgccg ttgctaccgg cgagtaccgc aatctgtttg ccgaaatcgg aaaaagcgaa    180

atagacatcc agcgcaaaat tgacgaggcg tttcagcact tgttttatgg cgacgcgaaa    240

gatgcagctg tctactatca agcgggtgga aacgagaatg gtccactcgc atatgtttac    300

gatgtgaaca gcaatgacgt gcgctcagaa ggcatgagct acggcatgat gattactgtt    360

caaatggaca aaaaagccga gttcgatgca atctggaact gggcgaaaac ctatatgtat    420

caagactccc ccacgcatcc agcgtttggt tactttgcct ggtccatgcg ccgcgatggt    480

gtcgccaatg acgatatgcc agcgccagat ggcgaggaat atttcgtgac cgctctctat    540

ttcgccgccg cccgctgggg taatggcgaa ggtattttca actaccaaca ggaagcggac    600

accattttga gccgcatgcg ccaccgccag gtgatcaccg gcccaaccaa tcgcggagta    660

atgactgcga ccaatctgtt ccacccggaa gaggcgcaag tgcgcttcac gcccgacatc    720

aataatgctg atcatacaga cgcgtcttac catctgccct cgttctatga aatttgggca    780

cgtgtcgcgc cgcaagaaga tcgcgcgttt tgggccaaag cggccgatgt gagccgcgac    840

tattttgcca aagccgccca ccctgtcact gcgttaacac cggactacgg taattttgat    900

ggcaccccgt gggcggcatc ctggcggccg gagtcggtag attttcgata cgatgcctgg    960

cgttccgtca tgaactggtc catggactat gcctggtggg gcaaagattc aggcgcacct   1020

gcgcgcagtg ataaattact cgcgttcttc gaaacccagg aaggcaaaat gaaccacctc   1080

tatagcctgg atggcaaacc gctgggtggt ggaccgaccc tcggcctaat ttccatgaat   1140

gcaacggcag ctatggcagc tactgatccc cgctggcaca attttgtgga aaagctctgg   1200

caacaacaac cccccacagg gcaataccgg tactacgacg gtgttctata cctgatggcg   1260

ctgctacatt gcgctgggga gtacaaagcg tggatccccg acggggaata a            1311

<210> 202
<211> 436
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (67)...(426)
<223> Glycosyl hydrolases family 8

<220> 
<221> SITE
<222> (385)...(388)
<223> N-glycosylation site. Prosite id = PS00001

<400> 202
Met Phe Pro Arg Leu Ser Pro Ser Arg Phe Arg Gln Val Thr Leu Thr 
1               5                   10                  15      


Leu Leu Thr Leu Gly Leu Val Ser Leu Thr Gly Cys Ala Gly Asn Ser 
            20                  25                  30          


Lys Pro Asp Ala Asp Thr Ser Thr Ala Gly Ala Val Ala Thr Gly Glu 
        35                  40                  45              


Tyr Arg Asn Leu Phe Ala Glu Ile Gly Lys Ser Glu Ile Asp Ile Gln 
    50                  55                  60                  


Arg Lys Ile Asp Glu Ala Phe Gln His Leu Phe Tyr Gly Asp Ala Lys 
65                  70                  75                  80  


Asp Ala Ala Val Tyr Tyr Gln Ala Gly Gly Asn Glu Asn Gly Pro Leu 
                85                  90                  95      


Ala Tyr Val Tyr Asp Val Asn Ser Asn Asp Val Arg Ser Glu Gly Met 
            100                 105                 110         


Ser Tyr Gly Met Met Ile Thr Val Gln Met Asp Lys Lys Ala Glu Phe 
        115                 120                 125             


Asp Ala Ile Trp Asn Trp Ala Lys Thr Tyr Met Tyr Gln Asp Ser Pro 
    130                 135                 140                 


Thr His Pro Ala Phe Gly Tyr Phe Ala Trp Ser Met Arg Arg Asp Gly 
145                 150                 155                 160 


Val Ala Asn Asp Asp Met Pro Ala Pro Asp Gly Glu Glu Tyr Phe Val 
                165                 170                 175     


Thr Ala Leu Tyr Phe Ala Ala Ala Arg Trp Gly Asn Gly Glu Gly Ile 
            180                 185                 190         


Phe Asn Tyr Gln Gln Glu Ala Asp Thr Ile Leu Ser Arg Met Arg His 
        195                 200                 205             


Arg Gln Val Ile Thr Gly Pro Thr Asn Arg Gly Val Met Thr Ala Thr 
    210                 215                 220                 


Asn Leu Phe His Pro Glu Glu Ala Gln Val Arg Phe Thr Pro Asp Ile 
225                 230                 235                 240 


Asn Asn Ala Asp His Thr Asp Ala Ser Tyr His Leu Pro Ser Phe Tyr 
                245                 250                 255     


Glu Ile Trp Ala Arg Val Ala Pro Gln Glu Asp Arg Ala Phe Trp Ala 
            260                 265                 270         


Lys Ala Ala Asp Val Ser Arg Asp Tyr Phe Ala Lys Ala Ala His Pro 
        275                 280                 285             


Val Thr Ala Leu Thr Pro Asp Tyr Gly Asn Phe Asp Gly Thr Pro Trp 
    290                 295                 300                 


Ala Ala Ser Trp Arg Pro Glu Ser Val Asp Phe Arg Tyr Asp Ala Trp 
305                 310                 315                 320 


Arg Ser Val Met Asn Trp Ser Met Asp Tyr Ala Trp Trp Gly Lys Asp 
                325                 330                 335     


Ser Gly Ala Pro Ala Arg Ser Asp Lys Leu Leu Ala Phe Phe Glu Thr 
            340                 345                 350         


Gln Glu Gly Lys Met Asn His Leu Tyr Ser Leu Asp Gly Lys Pro Leu 
        355                 360                 365             


Gly Gly Gly Pro Thr Leu Gly Leu Ile Ser Met Asn Ala Thr Ala Ala 
    370                 375                 380                 


Met Ala Ala Thr Asp Pro Arg Trp His Asn Phe Val Glu Lys Leu Trp 
385                 390                 395                 400 


Gln Gln Gln Pro Pro Thr Gly Gln Tyr Arg Tyr Tyr Asp Gly Val Leu 
                405                 410                 415     


Tyr Leu Met Ala Leu Leu His Cys Ala Gly Glu Tyr Lys Ala Trp Ile 
            420                 425                 430         


Pro Asp Gly Glu 
        435     


<210> 203
<211> 990
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 203
atgtggcagc gaagtaaaac cctggttctg gtgctgggac tgttgttaag tcatcaggcc     60

tttgccggcc ccgcctggga cagctacaaa gcgcgatttt tgatggcgga tggtcgcatc    120

attgataccg gcaacaacag tgtgagccac actgaggggc agggcttcgc catgatgatg    180

gcggtgcgta acaacgatcg cgccggtttt gacaaaatct ggaactggac gaaaaagaac    240

ctgcagaatc ccgagaccgg actgttctac tggcgttata acccggtggc accggacccg    300

attgccgaca gaaacaatgc caccgacggc gataccttta tcgcctgggc gctgttaaaa    360

gcgggtaccc agtggaatga caacagctat ctgaacgcct cggacgccat cactaaatcg    420

ctgctggccc gcaacgtaat cagctttgcc ggttaccgcg tgatgctgcc tggcgcgaag    480

ggctttaacc tcaacagcta cgtgaacctt aatccgtcct atttcatctt ccctgcgtgg    540

gaggatttcg cgaagcgcag tcatctgacg gtgtggcgtg acctgatcaa tgacggacag    600

aagctgctgg tgaagatgcg cttcggcaat acccagcttc ctgcagactg ggtttccctg    660

tacgccgacg gtcgcgtgac cccggccaaa gagtggccgg cgcgctttag ctacgatgcg    720

atccgcgtcc cgctgtatat caagtgggcg aatgcttcca gcccgctgat ggccccttac    780

accgcgtact ggggacggtt tgcccgtacc cagacgccag cctgggtgaa tgtcaccacc    840

ggcgatccgg cgccgtatat gatggcgggt gggttgctgg cggtgcgtga tttggcgctg    900

gggcaacttc cggcgggcga tccgcagatc acgacgcagg aggattacta ttctgcgagt    960

ctgaagatgc tggtgtcgtt agcgaaataa                                     990

<210> 204
<211> 329
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(22)

<220> 
<221> DOMAIN
<222> (1)...(329)
<223> Glycosyl hydrolases family 8

<220> 
<221> SITE
<222> (45)...(48)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (76)...(79)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (107)...(110)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (108)...(126)
<223> Glycosyl hydrolases family 8 signature. Prosite id = PS00812

<220> 
<221> SITE
<222> (134)...(137)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (254)...(257)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (281)...(284)
<223> N-glycosylation site. Prosite id = PS00001

<400> 204
Met Trp Gln Arg Ser Lys Thr Leu Val Leu Val Leu Gly Leu Leu Leu 
1               5                   10                  15      


Ser His Gln Ala Phe Ala Gly Pro Ala Trp Asp Ser Tyr Lys Ala Arg 
            20                  25                  30          


Phe Leu Met Ala Asp Gly Arg Ile Ile Asp Thr Gly Asn Asn Ser Val 
        35                  40                  45              


Ser His Thr Glu Gly Gln Gly Phe Ala Met Met Met Ala Val Arg Asn 
    50                  55                  60                  


Asn Asp Arg Ala Gly Phe Asp Lys Ile Trp Asn Trp Thr Lys Lys Asn 
65                  70                  75                  80  


Leu Gln Asn Pro Glu Thr Gly Leu Phe Tyr Trp Arg Tyr Asn Pro Val 
                85                  90                  95      


Ala Pro Asp Pro Ile Ala Asp Arg Asn Asn Ala Thr Asp Gly Asp Thr 
            100                 105                 110         


Phe Ile Ala Trp Ala Leu Leu Lys Ala Gly Thr Gln Trp Asn Asp Asn 
        115                 120                 125             


Ser Tyr Leu Asn Ala Ser Asp Ala Ile Thr Lys Ser Leu Leu Ala Arg 
    130                 135                 140                 


Asn Val Ile Ser Phe Ala Gly Tyr Arg Val Met Leu Pro Gly Ala Lys 
145                 150                 155                 160 


Gly Phe Asn Leu Asn Ser Tyr Val Asn Leu Asn Pro Ser Tyr Phe Ile 
                165                 170                 175     


Phe Pro Ala Trp Glu Asp Phe Ala Lys Arg Ser His Leu Thr Val Trp 
            180                 185                 190         


Arg Asp Leu Ile Asn Asp Gly Gln Lys Leu Leu Val Lys Met Arg Phe 
        195                 200                 205             


Gly Asn Thr Gln Leu Pro Ala Asp Trp Val Ser Leu Tyr Ala Asp Gly 
    210                 215                 220                 


Arg Val Thr Pro Ala Lys Glu Trp Pro Ala Arg Phe Ser Tyr Asp Ala 
225                 230                 235                 240 


Ile Arg Val Pro Leu Tyr Ile Lys Trp Ala Asn Ala Ser Ser Pro Leu 
                245                 250                 255     


Met Ala Pro Tyr Thr Ala Tyr Trp Gly Arg Phe Ala Arg Thr Gln Thr 
            260                 265                 270         


Pro Ala Trp Val Asn Val Thr Thr Gly Asp Pro Ala Pro Tyr Met Met 
        275                 280                 285             


Ala Gly Gly Leu Leu Ala Val Arg Asp Leu Ala Leu Gly Gln Leu Pro 
    290                 295                 300                 


Ala Gly Asp Pro Gln Ile Thr Thr Gln Glu Asp Tyr Tyr Ser Ala Ser 
305                 310                 315                 320 


Leu Lys Met Leu Val Ser Leu Ala Lys 
                325                 


<210> 205
<211> 3033
<212> DNA
<213> Bacteria

<400> 205
atgggaacat ctcttatgat caaatctaca ctgacaggta tgattactgc tgttgccgcc     60

gcagttttca ccacctctgc agctttcgcg gatgtacctc cgttgacagt gagcggaaat    120

caggttttaa gtggcggtga agcaaaaagc ttcgctggta acagcttctt ttggagcaat    180

accggatggg gccaggaacg tttttacaac gcagaaactg tgcgttggtt gaaagacgac    240

tggaacgcaa ccattgtccg cgccgctatg ggcgtagact ttgatggcag ctatatcccc    300

gagcatgaag acgccgaccc cgagggtaac gtcgctcgcg tacgtgcatt ggtggatgca    360

gccatcgcag aagacatgta cgtgattatc gattttcaca ctcaccacgc agaagattac    420

caagccgaat ctatcgagtt cttcgaagaa atggccacac tgtacggtgg gtacgacaat    480

gttatttatg aaatctataa cgagcccctg caaatcagct gggacaatgt tattaaacct    540

tatgcagaat cggtgattgg cgctatccgc gcaatcgacc cggacaacct gattatcgtc    600

ggcacgccca cttggtcaca ggacgtggac gccgctgcgc gcaatccaat caccagctac    660

agcaatattg cgtacaccct gcacttttac gcaggcactc acggttcatg gttgcgcgat    720

aaagcgcgta acgctatgaa cagtggtatt gcgctgtttg tgactgagtg gggcaccgtt    780

aatgcagatg gcgatggtgc gcctgcagtt aacgaaactc agcaatggat ggacttcctc    840

aagcagaaca atatctctca cttgaactgg tccgtgagtg ataaattgga aggtgcgtct    900

atcgtacaac ctggcacgcc cattagcggc tggaacgctt ctgaccttac ggcctccggc    960

acactggtta agaacatcgt ttccaactgg ggcaccacaa tcggtaacgg cagctcctca   1020

agttcatcca gctcctcttc cagctcttca agcagttctt cttcgagcag ttcctcctcc   1080

agcagctctt cctcgtcaag cagctccgga tcaactggtg gcggcaactg tgctggagtg   1140

aatgtgtacc cgaactggac cgcgcgtgac tggtctggcg gcgcctacaa ccatgcgaac   1200

gctggcgacc aaatggtcta tcaaaacagc ctgtatcgtg ccaactggta caccaacagc   1260

gtgcctggca gcgacgcctc ctggactagc cttggcgcct gcggaggcaa cggaagtacg   1320

acctcatcca gctcaagcag ctcctcgtca agcagcagct cttcttccag cagctcctcg   1380

tctactggcg gtggctccag ctcctccagc agttcatctt cttcatcgtc gtcttccagc   1440

agctctagca gcactggtgg cggtcaatgt accgaagtgt gcaactggta cggtcaggga   1500

acctacccac tgtgtaacaa caccagtggt tggggttggg aaaacaatca gagctgtatc   1560

ggccgtcaaa cctgtgagtc acagaacggt ggcgctggcg gcgtggtgag caactgcacc   1620

ggttcgagta catccagcag ctcctcttcc agcagtagtt cttcctcaag tagcagctcc   1680

agttcatcca gcagctcttc atctggcact ggtagcagta catcttccag cagcagctct   1740

tccagcagct ccagctcaag taccggttcc tccggtatgc ctggaccacg cgtggacaac   1800

cccttcgccg ctgcgcagaa gtggtacata aacccaatgt ggtcagcgag tgctgcaaac   1860

gaacccggcg gctctgtcat tgccaacgaa ccctcgtttg tatggatgga ccgtatcggc   1920

gcaatcgaag ggcctgctga cggtatgggc ctgcgcgacc acttgaacga agcccttgca   1980

caaggcgccg acctgttcat gtttgttgtg tacgacctgc caaaccgtga ctgtgctgca   2040

ctcgcctcca acggtgaact gcgcatctcc gaagatggct tcaacatcta caagtccgac   2100

tacatcgcac ctatcgttga aatcatcagc gaccctgcat acgcaggtat caaaatcgct   2160

gcggttatcg aggtggactc actgcctaac ctggttacca atctgagcga acctgactgt   2220

caggaagcaa atggtcctgg cggctaccgc gacggcattc gtcacgctat cactgaactg   2280

ggcaaaatcc ccaacgtata ctcctacgtg gatattgcac actcaggctg gctgggctgg   2340

aacgacaact tcgcgcaagg cgttaacctg atttatgaag tggttgccaa cctcggttcc   2400

ggcattaacc caatcgccgg tttcgtcagt aactccgcta actacacgcc tgtggaagaa   2460

cccttcttgc cagacgccaa cctgcaggtc ggtggtcagc ccgttcgctc ttccgatttc   2520

tatgagtgga acagctacct ggcagagaaa cccttcgtga ccgattggcg ttctgccatg   2580

atctcgaaag gtatgccaag ctccatcggt atgctgatcg ataccgcacg taacggctgg   2640

ggtggccctg agcgtccaac tgcgcagtct acctccaaca acctgaacac cttcgttaac   2700

gaatcacgta tcgaccgtcg tgagcaccgc ggcaactggt gtaaccagcc tggtggtgtc   2760

ggctaccgtc caaccgctgc accttctcca ggtattgatg cctacgtttg ggtgaaacca   2820

cagggtgagt ctgacggtgt ttccgatcct aacttcgaga tcgatcctaa cgacccgaac   2880

aaacagcacg acccaatgtg tgatccgttc gccagcaact cgtccaacag tgcatacggc   2940

accggcgcta tgccaaatgc tccgcacgct ggtcgctggt tccctgaagc cttccagtta   3000

ctgcttgaaa acgcttaccc accaattaac tag                                3033

<210> 206
<211> 1010
<212> PRT
<213> Bacteria

<220> 
<221> SIGNAL
<222> (1)...(30)

<220> 
<221> DOMAIN
<222> (39)...(300)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (393)...(428)
<223> Carbohydrate binding domain

<220> 
<221> DOMAIN
<222> (493)...(521)
<223> Cellulose or protein binding domain

<220> 
<221> DOMAIN
<222> (610)...(959)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (83)...(86)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (163)...(172)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (275)...(278)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (288)...(291)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (293)...(296)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (316)...(319)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (341)...(344)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (390)...(393)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (443)...(446)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (513)...(516)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (523)...(526)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (546)...(549)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (679)...(695)
<223> Glycosyl hydrolases family 6 signature 1. Prosite id = PS00655

<220> 
<221> SITE
<222> (745)...(748)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (913)...(916)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (987)...(990)
<223> N-glycosylation site. Prosite id = PS00001

<400> 206
Met Gly Thr Ser Leu Met Ile Lys Ser Thr Leu Thr Gly Met Ile Thr 
1               5                   10                  15      


Ala Val Ala Ala Ala Val Phe Thr Thr Ser Ala Ala Phe Ala Asp Val 
            20                  25                  30          


Pro Pro Leu Thr Val Ser Gly Asn Gln Val Leu Ser Gly Gly Glu Ala 
        35                  40                  45              


Lys Ser Phe Ala Gly Asn Ser Phe Phe Trp Ser Asn Thr Gly Trp Gly 
    50                  55                  60                  


Gln Glu Arg Phe Tyr Asn Ala Glu Thr Val Arg Trp Leu Lys Asp Asp 
65                  70                  75                  80  


Trp Asn Ala Thr Ile Val Arg Ala Ala Met Gly Val Asp Phe Asp Gly 
                85                  90                  95      


Ser Tyr Ile Pro Glu His Glu Asp Ala Asp Pro Glu Gly Asn Val Ala 
            100                 105                 110         


Arg Val Arg Ala Leu Val Asp Ala Ala Ile Ala Glu Asp Met Tyr Val 
        115                 120                 125             


Ile Ile Asp Phe His Thr His His Ala Glu Asp Tyr Gln Ala Glu Ser 
    130                 135                 140                 


Ile Glu Phe Phe Glu Glu Met Ala Thr Leu Tyr Gly Gly Tyr Asp Asn 
145                 150                 155                 160 


Val Ile Tyr Glu Ile Tyr Asn Glu Pro Leu Gln Ile Ser Trp Asp Asn 
                165                 170                 175     


Val Ile Lys Pro Tyr Ala Glu Ser Val Ile Gly Ala Ile Arg Ala Ile 
            180                 185                 190         


Asp Pro Asp Asn Leu Ile Ile Val Gly Thr Pro Thr Trp Ser Gln Asp 
        195                 200                 205             


Val Asp Ala Ala Ala Arg Asn Pro Ile Thr Ser Tyr Ser Asn Ile Ala 
    210                 215                 220                 


Tyr Thr Leu His Phe Tyr Ala Gly Thr His Gly Ser Trp Leu Arg Asp 
225                 230                 235                 240 


Lys Ala Arg Asn Ala Met Asn Ser Gly Ile Ala Leu Phe Val Thr Glu 
                245                 250                 255     


Trp Gly Thr Val Asn Ala Asp Gly Asp Gly Ala Pro Ala Val Asn Glu 
            260                 265                 270         


Thr Gln Gln Trp Met Asp Phe Leu Lys Gln Asn Asn Ile Ser His Leu 
        275                 280                 285             


Asn Trp Ser Val Ser Asp Lys Leu Glu Gly Ala Ser Ile Val Gln Pro 
    290                 295                 300                 


Gly Thr Pro Ile Ser Gly Trp Asn Ala Ser Asp Leu Thr Ala Ser Gly 
305                 310                 315                 320 


Thr Leu Val Lys Asn Ile Val Ser Asn Trp Gly Thr Thr Ile Gly Asn 
                325                 330                 335     


Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 
            340                 345                 350         


Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 
        355                 360                 365             


Ser Gly Ser Thr Gly Gly Gly Asn Cys Ala Gly Val Asn Val Tyr Pro 
    370                 375                 380                 


Asn Trp Thr Ala Arg Asp Trp Ser Gly Gly Ala Tyr Asn His Ala Asn 
385                 390                 395                 400 


Ala Gly Asp Gln Met Val Tyr Gln Asn Ser Leu Tyr Arg Ala Asn Trp 
                405                 410                 415     


Tyr Thr Asn Ser Val Pro Gly Ser Asp Ala Ser Trp Thr Ser Leu Gly 
            420                 425                 430         


Ala Cys Gly Gly Asn Gly Ser Thr Thr Ser Ser Ser Ser Ser Ser Ser 
        435                 440                 445             


Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Gly Gly 
    450                 455                 460                 


Gly Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 
465                 470                 475                 480 


Ser Ser Ser Ser Thr Gly Gly Gly Gln Cys Thr Glu Val Cys Asn Trp 
                485                 490                 495     


Tyr Gly Gln Gly Thr Tyr Pro Leu Cys Asn Asn Thr Ser Gly Trp Gly 
            500                 505                 510         


Trp Glu Asn Asn Gln Ser Cys Ile Gly Arg Gln Thr Cys Glu Ser Gln 
        515                 520                 525             


Asn Gly Gly Ala Gly Gly Val Val Ser Asn Cys Thr Gly Ser Ser Thr 
    530                 535                 540                 


Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 
545                 550                 555                 560 


Ser Ser Ser Ser Ser Ser Ser Ser Gly Thr Gly Ser Ser Thr Ser Ser 
                565                 570                 575     


Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Thr Gly Ser Ser Gly 
            580                 585                 590         


Met Pro Gly Pro Arg Val Asp Asn Pro Phe Ala Ala Ala Gln Lys Trp 
        595                 600                 605             


Tyr Ile Asn Pro Met Trp Ser Ala Ser Ala Ala Asn Glu Pro Gly Gly 
    610                 615                 620                 


Ser Val Ile Ala Asn Glu Pro Ser Phe Val Trp Met Asp Arg Ile Gly 
625                 630                 635                 640 


Ala Ile Glu Gly Pro Ala Asp Gly Met Gly Leu Arg Asp His Leu Asn 
                645                 650                 655     


Glu Ala Leu Ala Gln Gly Ala Asp Leu Phe Met Phe Val Val Tyr Asp 
            660                 665                 670         


Leu Pro Asn Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu Leu Arg 
        675                 680                 685             


Ile Ser Glu Asp Gly Phe Asn Ile Tyr Lys Ser Asp Tyr Ile Ala Pro 
    690                 695                 700                 


Ile Val Glu Ile Ile Ser Asp Pro Ala Tyr Ala Gly Ile Lys Ile Ala 
705                 710                 715                 720 


Ala Val Ile Glu Val Asp Ser Leu Pro Asn Leu Val Thr Asn Leu Ser 
                725                 730                 735     


Glu Pro Asp Cys Gln Glu Ala Asn Gly Pro Gly Gly Tyr Arg Asp Gly 
            740                 745                 750         


Ile Arg His Ala Ile Thr Glu Leu Gly Lys Ile Pro Asn Val Tyr Ser 
        755                 760                 765             


Tyr Val Asp Ile Ala His Ser Gly Trp Leu Gly Trp Asn Asp Asn Phe 
    770                 775                 780                 


Ala Gln Gly Val Asn Leu Ile Tyr Glu Val Val Ala Asn Leu Gly Ser 
785                 790                 795                 800 


Gly Ile Asn Pro Ile Ala Gly Phe Val Ser Asn Ser Ala Asn Tyr Thr 
                805                 810                 815     


Pro Val Glu Glu Pro Phe Leu Pro Asp Ala Asn Leu Gln Val Gly Gly 
            820                 825                 830         


Gln Pro Val Arg Ser Ser Asp Phe Tyr Glu Trp Asn Ser Tyr Leu Ala 
        835                 840                 845             


Glu Lys Pro Phe Val Thr Asp Trp Arg Ser Ala Met Ile Ser Lys Gly 
    850                 855                 860                 


Met Pro Ser Ser Ile Gly Met Leu Ile Asp Thr Ala Arg Asn Gly Trp 
865                 870                 875                 880 


Gly Gly Pro Glu Arg Pro Thr Ala Gln Ser Thr Ser Asn Asn Leu Asn 
                885                 890                 895     


Thr Phe Val Asn Glu Ser Arg Ile Asp Arg Arg Glu His Arg Gly Asn 
            900                 905                 910         


Trp Cys Asn Gln Pro Gly Gly Val Gly Tyr Arg Pro Thr Ala Ala Pro 
        915                 920                 925             


Ser Pro Gly Ile Asp Ala Tyr Val Trp Val Lys Pro Gln Gly Glu Ser 
    930                 935                 940                 


Asp Gly Val Ser Asp Pro Asn Phe Glu Ile Asp Pro Asn Asp Pro Asn 
945                 950                 955                 960 


Lys Gln His Asp Pro Met Cys Asp Pro Phe Ala Ser Asn Ser Ser Asn 
                965                 970                 975     


Ser Ala Tyr Gly Thr Gly Ala Met Pro Asn Ala Pro His Ala Gly Arg 
            980                 985                 990         


Trp Phe Pro Glu Ala Phe Gln Leu  Leu Leu Glu Asn Ala  Tyr Pro Pro 
        995                 1000                 1005             


Ile Asn  
    1010 


<210> 207
<211> 984
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 207
gtgattgacc cgagcgacgc gcgcaaaatc accacctccg aagggcaaag ctacgccttg     60

ttcttcgcgc tggcggcgaa cgatcgtgaa gccttcgctt cgctgtttga ctggacgcaa    120

aacaacctgg cggaagggga tttaaaagcc catcttcccg cctggctgtg gggtaaaaag    180

gcggacgaca gctggaccgt gctcgacggt aactccgcgt cagatgccga tatctggata    240

gcctggtcgc tgctcgaagc cgggcggctg tggaaaatgc cgcagtacag cgaaaccggt    300

aaagccctgc tgtctcgcat cgccaaagag gaagtggtga aagtgcccgg tctgggctct    360

atgctgctac cgggtaaagt gggctttgtt gatgacgccg gctggcgctt taacccaagc    420

tatttgcctc cgcagattgc ggcttacttc gcgcgttttg gcgagccgtg gagcgcgatt    480

caggccacca accttcgcct gctgcaggaa acggcaccta aaggctactc gcctaactgg    540

gtacgttttg acaacaaaaa aggctggcag ctaaagcagg ataaatcgct gctcggcagc    600

tacgacgcga tccgcgtgta tctttgggtc ggcatgctca acgacgccga tccgcagaaa    660

ccgcggctgc tgaaaaagtt caggccaatg gcgatgcaaa ccgccaaagc gggcgcggtg    720

ccggagaaaa tcgatattgc caccgggaag gtggagggcg atggcccggt cggtttttct    780

gcctcgctgc tgccgttttt acaggaccgg gacgctcagg ccgttcagcg ccagcgcgtc    840

gccgaccgtt tccccggcaa tgacgcctat tacagctatg ttttgaccct gttcggacaa    900

ggatgggatc agcatcgttt tcgcttcacc gctcgcggtg aacttctacc tgactggggc    960

caggaatgcg taagttctca ctaa                                           984

<210> 208
<211> 327
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(305)
<223> Glycosyl hydrolases family 8

<220> 
<221> SITE
<222> (74)...(92)
<223> Glycosyl hydrolases family 8 signature. Prosite id = PS00812

<400> 208
Met Ile Asp Pro Ser Asp Ala Arg Lys Ile Thr Thr Ser Glu Gly Gln 
1               5                   10                  15      


Ser Tyr Ala Leu Phe Phe Ala Leu Ala Ala Asn Asp Arg Glu Ala Phe 
            20                  25                  30          


Ala Ser Leu Phe Asp Trp Thr Gln Asn Asn Leu Ala Glu Gly Asp Leu 
        35                  40                  45              


Lys Ala His Leu Pro Ala Trp Leu Trp Gly Lys Lys Ala Asp Asp Ser 
    50                  55                  60                  


Trp Thr Val Leu Asp Gly Asn Ser Ala Ser Asp Ala Asp Ile Trp Ile 
65                  70                  75                  80  


Ala Trp Ser Leu Leu Glu Ala Gly Arg Leu Trp Lys Met Pro Gln Tyr 
                85                  90                  95      


Ser Glu Thr Gly Lys Ala Leu Leu Ser Arg Ile Ala Lys Glu Glu Val 
            100                 105                 110         


Val Lys Val Pro Gly Leu Gly Ser Met Leu Leu Pro Gly Lys Val Gly 
        115                 120                 125             


Phe Val Asp Asp Ala Gly Trp Arg Phe Asn Pro Ser Tyr Leu Pro Pro 
    130                 135                 140                 


Gln Ile Ala Ala Tyr Phe Ala Arg Phe Gly Glu Pro Trp Ser Ala Ile 
145                 150                 155                 160 


Gln Ala Thr Asn Leu Arg Leu Leu Gln Glu Thr Ala Pro Lys Gly Tyr 
                165                 170                 175     


Ser Pro Asn Trp Val Arg Phe Asp Asn Lys Lys Gly Trp Gln Leu Lys 
            180                 185                 190         


Gln Asp Lys Ser Leu Leu Gly Ser Tyr Asp Ala Ile Arg Val Tyr Leu 
        195                 200                 205             


Trp Val Gly Met Leu Asn Asp Ala Asp Pro Gln Lys Pro Arg Leu Leu 
    210                 215                 220                 


Lys Lys Phe Arg Pro Met Ala Met Gln Thr Ala Lys Ala Gly Ala Val 
225                 230                 235                 240 


Pro Glu Lys Ile Asp Ile Ala Thr Gly Lys Val Glu Gly Asp Gly Pro 
                245                 250                 255     


Val Gly Phe Ser Ala Ser Leu Leu Pro Phe Leu Gln Asp Arg Asp Ala 
            260                 265                 270         


Gln Ala Val Gln Arg Gln Arg Val Ala Asp Arg Phe Pro Gly Asn Asp 
        275                 280                 285             


Ala Tyr Tyr Ser Tyr Val Leu Thr Leu Phe Gly Gln Gly Trp Asp Gln 
    290                 295                 300                 


His Arg Phe Arg Phe Thr Ala Arg Gly Glu Leu Leu Pro Asp Trp Gly 
305                 310                 315                 320 


Gln Glu Cys Val Ser Ser His 
                325         


<210> 209
<211> 990
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 209
atgctggtgc gactgctgat agcgatgacc gtgctgtttt ctgcctttgc ccacgccgat     60

gcgtgggaaa gctataaatc ccggtttgtc atgcccgacg ggcgcgtggt ggataccggc    120

aatggcaacg tttcgcatac cgaaggccag gggtttgcca tgctgatggc ggtcagcctg    180

aatgaccgcc agacgtttga taagctttgg cagtggacca atgcgacgct aaaaaacaaa    240

gataacggcc tgttttactg gcgctataac ccgaccgcgg cggaccctat caccgataaa    300

aacgatgcca ccgatggcga tatgatgatc gcctgggcct tgctaaaggc gcagaaacag    360

tggcgcgaga acagctacgg tatcgcctcg gacgagataa cccgcgcgct actgaaacac    420

acggtaataa gctacgcggg ctacagggtg atgctgccgg gcgcgcacgg cttcaatctc    480

aacacccgca tcaacctgaa cccttcctac tttattttcc cggcctggca ggctttcgcc    540

gaccgcacgc atctggtggt gtggcgcgat ctgatgcgcg atggcaaaaa gctcatcggc    600

aaaatggggt ggggtaatgc caatcttccg accgactggg ttgcgctgag cgctgacggc    660

aaaatgaagc ccgccgacga gtggaagccg cgtatgagct atgacgccgt gcgcattccg    720

ctgtacatcc actggcagga cgcgcaaagc ccactgttag cgccgtggaa agcgttctgg    780

cagcgctatc agcgtgaaca aacgcctgcg tgggtcaacg tgatgaccaa cgaaacctct    840

ccttacccga tgaacggtgg cctgctggcg attcgcgatt acaccctggg catcaacacc    900

ggcgagccgc aaattaccgc gcaggacgac tactattccg cgagcctgaa aatgctgacg    960

tggctggcgg agcagtctgc gggacgctaa                                     990

<210> 210
<211> 329
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(19)

<220> 
<221> DOMAIN
<222> (1)...(326)
<223> Glycosyl hydrolases family 8

<220> 
<221> SITE
<222> (43)...(46)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (75)...(78)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (104)...(122)
<223> Glycosyl hydrolases family 8 signature. Prosite id = PS00812

<220> 
<221> SITE
<222> (281)...(284)
<223> N-glycosylation site. Prosite id = PS00001

<400> 210
Met Leu Val Arg Leu Leu Ile Ala Met Thr Val Leu Phe Ser Ala Phe 
1               5                   10                  15      


Ala His Ala Asp Ala Trp Glu Ser Tyr Lys Ser Arg Phe Val Met Pro 
            20                  25                  30          


Asp Gly Arg Val Val Asp Thr Gly Asn Gly Asn Val Ser His Thr Glu 
        35                  40                  45              


Gly Gln Gly Phe Ala Met Leu Met Ala Val Ser Leu Asn Asp Arg Gln 
    50                  55                  60                  


Thr Phe Asp Lys Leu Trp Gln Trp Thr Asn Ala Thr Leu Lys Asn Lys 
65                  70                  75                  80  


Asp Asn Gly Leu Phe Tyr Trp Arg Tyr Asn Pro Thr Ala Ala Asp Pro 
                85                  90                  95      


Ile Thr Asp Lys Asn Asp Ala Thr Asp Gly Asp Met Met Ile Ala Trp 
            100                 105                 110         


Ala Leu Leu Lys Ala Gln Lys Gln Trp Arg Glu Asn Ser Tyr Gly Ile 
        115                 120                 125             


Ala Ser Asp Glu Ile Thr Arg Ala Leu Leu Lys His Thr Val Ile Ser 
    130                 135                 140                 


Tyr Ala Gly Tyr Arg Val Met Leu Pro Gly Ala His Gly Phe Asn Leu 
145                 150                 155                 160 


Asn Thr Arg Ile Asn Leu Asn Pro Ser Tyr Phe Ile Phe Pro Ala Trp 
                165                 170                 175     


Gln Ala Phe Ala Asp Arg Thr His Leu Val Val Trp Arg Asp Leu Met 
            180                 185                 190         


Arg Asp Gly Lys Lys Leu Ile Gly Lys Met Gly Trp Gly Asn Ala Asn 
        195                 200                 205             


Leu Pro Thr Asp Trp Val Ala Leu Ser Ala Asp Gly Lys Met Lys Pro 
    210                 215                 220                 


Ala Asp Glu Trp Lys Pro Arg Met Ser Tyr Asp Ala Val Arg Ile Pro 
225                 230                 235                 240 


Leu Tyr Ile His Trp Gln Asp Ala Gln Ser Pro Leu Leu Ala Pro Trp 
                245                 250                 255     


Lys Ala Phe Trp Gln Arg Tyr Gln Arg Glu Gln Thr Pro Ala Trp Val 
            260                 265                 270         


Asn Val Met Thr Asn Glu Thr Ser Pro Tyr Pro Met Asn Gly Gly Leu 
        275                 280                 285             


Leu Ala Ile Arg Asp Tyr Thr Leu Gly Ile Asn Thr Gly Glu Pro Gln 
    290                 295                 300                 


Ile Thr Ala Gln Asp Asp Tyr Tyr Ser Ala Ser Leu Lys Met Leu Thr 
305                 310                 315                 320 


Trp Leu Ala Glu Gln Ser Ala Gly Arg 
                325                 


<210> 211
<211> 1293
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 211
atgttgacac ggagagaatt gattgcagcg actgccctgg ggctcgccgc tagtacgaaa     60

cttgttgctg ctgccacgag taaaaccagg gatgatttcc tttggggggt ggccacggcg    120

ccacaccaaa tcgagggcaa caacctgaat gccgacctgt ggctggccga acaggtgaag    180

ccgacgatct ttcacgagcc ttcgggcgat gcctgtgaca gctaccatcg ctatgaggag    240

gatatcgagc tggtggcggc gctcggcttc aattgttacc gcttcggcat cgaatgggcc    300

cgcatcgagc ccgttcccgg ccagttttcg atcgccgagt tggaccacta ccggcgcatg    360

ctggaggctt gccacgcgca cgggctgacg ccggtggtca cgtacaacca tttcaccgcg    420

ccgcgctggt ttgcagccaa gggcggtttc acccagccgg aggctgccgg cctgttcgcc    480

cgctatgccg gcaaggccac ggagcaccta ggtgacctga tcggcatggc gctgacgttc    540

aacgaggcca atatccggcg cattgtcagc ttgctcctgg gcagtcccga ggcgctccag    600

ctgatcgacg cgatgtttgc cgagtgcgcg cgcctcagcg gatccgaccg gttctggtcg    660

atggtctttt cgccttacgg tgaggccgag ccaataatgg tggacgccca tgccaaggcc    720

agcgcggcga tcaaggccgg ccccggggac tttcccgtcg ggctcacgct gtcgatgcag    780

gacgtgcaag gcatcggcga gggcaaccag gccgaggctt tgatccaggc actttacggc    840

ccatggctcg aggtagcggc cgcgtccgat ttcgttggcg tgcagaccta cacccggata    900

cgcgtgggac cggaaggccg tatcccgccc gacgagggtg ccgaacggac cgatacgggc    960

ttcgagttct acccggcggc gttgggcggc gcgatccgat ttgcgcacga gcgcatcgga   1020

cgccccatct acgtgaccga gaacggcgtg gccgcgaatg acgacagccg gcgcattgtc   1080

tacatagacg gcgcattgaa agcgatgaag gactgcatcg acgagggcct ggacgtacgt   1140

ggctacatgc actggtcgct gttggacaac ttcgaatggt tcgaaggcta tgccaagaga   1200

tacggcctgg tggaggttga tcgcgaaacc ttcgagcgcc gtccgaagcc aagcgcccgg   1260

cacctgggtg cgattgcccg ctcgggcctc taa                                1293

<210> 212
<211> 430
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(23)

<220> 
<221> DOMAIN
<222> (26)...(430)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (348)...(356)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 212
Met Leu Thr Arg Arg Glu Leu Ile Ala Ala Thr Ala Leu Gly Leu Ala 
1               5                   10                  15      


Ala Ser Thr Lys Leu Val Ala Ala Ala Thr Ser Lys Thr Arg Asp Asp 
            20                  25                  30          


Phe Leu Trp Gly Val Ala Thr Ala Pro His Gln Ile Glu Gly Asn Asn 
        35                  40                  45              


Leu Asn Ala Asp Leu Trp Leu Ala Glu Gln Val Lys Pro Thr Ile Phe 
    50                  55                  60                  


His Glu Pro Ser Gly Asp Ala Cys Asp Ser Tyr His Arg Tyr Glu Glu 
65                  70                  75                  80  


Asp Ile Glu Leu Val Ala Ala Leu Gly Phe Asn Cys Tyr Arg Phe Gly 
                85                  90                  95      


Ile Glu Trp Ala Arg Ile Glu Pro Val Pro Gly Gln Phe Ser Ile Ala 
            100                 105                 110         


Glu Leu Asp His Tyr Arg Arg Met Leu Glu Ala Cys His Ala His Gly 
        115                 120                 125             


Leu Thr Pro Val Val Thr Tyr Asn His Phe Thr Ala Pro Arg Trp Phe 
    130                 135                 140                 


Ala Ala Lys Gly Gly Phe Thr Gln Pro Glu Ala Ala Gly Leu Phe Ala 
145                 150                 155                 160 


Arg Tyr Ala Gly Lys Ala Thr Glu His Leu Gly Asp Leu Ile Gly Met 
                165                 170                 175     


Ala Leu Thr Phe Asn Glu Ala Asn Ile Arg Arg Ile Val Ser Leu Leu 
            180                 185                 190         


Leu Gly Ser Pro Glu Ala Leu Gln Leu Ile Asp Ala Met Phe Ala Glu 
        195                 200                 205             


Cys Ala Arg Leu Ser Gly Ser Asp Arg Phe Trp Ser Met Val Phe Ser 
    210                 215                 220                 


Pro Tyr Gly Glu Ala Glu Pro Ile Met Val Asp Ala His Ala Lys Ala 
225                 230                 235                 240 


Ser Ala Ala Ile Lys Ala Gly Pro Gly Asp Phe Pro Val Gly Leu Thr 
                245                 250                 255     


Leu Ser Met Gln Asp Val Gln Gly Ile Gly Glu Gly Asn Gln Ala Glu 
            260                 265                 270         


Ala Leu Ile Gln Ala Leu Tyr Gly Pro Trp Leu Glu Val Ala Ala Ala 
        275                 280                 285             


Ser Asp Phe Val Gly Val Gln Thr Tyr Thr Arg Ile Arg Val Gly Pro 
    290                 295                 300                 


Glu Gly Arg Ile Pro Pro Asp Glu Gly Ala Glu Arg Thr Asp Thr Gly 
305                 310                 315                 320 


Phe Glu Phe Tyr Pro Ala Ala Leu Gly Gly Ala Ile Arg Phe Ala His 
                325                 330                 335     


Glu Arg Ile Gly Arg Pro Ile Tyr Val Thr Glu Asn Gly Val Ala Ala 
            340                 345                 350         


Asn Asp Asp Ser Arg Arg Ile Val Tyr Ile Asp Gly Ala Leu Lys Ala 
        355                 360                 365             


Met Lys Asp Cys Ile Asp Glu Gly Leu Asp Val Arg Gly Tyr Met His 
    370                 375                 380                 


Trp Ser Leu Leu Asp Asn Phe Glu Trp Phe Glu Gly Tyr Ala Lys Arg 
385                 390                 395                 400 


Tyr Gly Leu Val Glu Val Asp Arg Glu Thr Phe Glu Arg Arg Pro Lys 
                405                 410                 415     


Pro Ser Ala Arg His Leu Gly Ala Ile Ala Arg Ser Gly Leu 
            420                 425                 430 


<210> 213
<211> 1995
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 213
atggttcagc tctacatcaa aaacggcttt ccggccggtt acgaacacgg acccgtcgct     60

acctttaacg ttccgatggt agtcaacggc accctgataa accagacctt tgagctttac    120

gacgttgtcg ctgatcccgg ctggaggttc ctgacgttta aatcaaccaa aaactacgtc    180

aacgcgagcg ttgtttttga ctacacccac ttcatcgaga tggccaacga ctatctaaac    240

gactcgctga acagtactta cctcatgtcc ctggagtttg gaaccgagat ctacaccaac    300

gggataagtt ccttcccagg gaccgttgat gtagaatgga gcttaaacga ctattactac    360

gccctggcac ccaggggact cacggcggag gatgctttgg gaatgtgggc agctcttatt    420

cccaacaccg gaaacacctc caacggagga acggaagaaa gtggtgaaga taacaccggt    480

ggcaccaatg gaacaaccag caccaccgaa agcgaaaccg ttgtgctaac ctatccggat    540

gacggccagt ggccgacggc ttacattgac gggaacaaga acggcgtccc cgattacgtt    600

atggagatca acccctggaa cattcagagt gccgatggaa aagcggtgat gcggtacaac    660

tactcgaccg gttacctgca ctactcacag gatctcagta acatagtaat aagaaacgcc    720

ggcggctggg ttcatggata tccagagatc tggtacggga acaagccatg gaacgagtac    780

aacgccaccg atggagaggt gcctttgcca ggacaagttg ggaagctcaa caacttctac    840

gtaactgtaa actacaacct cacccacgaa aacggtctgc ctgttgacct tgcgatagag    900

tcgtggctga cgagggataa gtggcgctca accgggataa agagcgatga gcaggagctc    960

atgatatggc tctactacga tggcctccag ccagccggtt caaaggtggg ggagataacg   1020

gttcccataa tagttaacgg agttgagaaa aacgcaacct tcgaagtttg gaaaggcaac   1080

attggatggg agtacgtggc cttcaggata aaaacgccgc ttccttcggc caacgtaacc   1140

cttccctacg gtcccttcat cagtgccgcc atgaacgtga catcgctgaa ggattacgcc   1200

tcgctttacc ttgaggacgt ggaggttgga accgagttcg gaagtccaag cgtcacatct   1260

gcaaaacttg agtggacttt ctatgagttc aaactaacct acacagacgg gccattgata   1320

accaacgttt cggtatctgc aggtggctcc aactcgggca attccaacgg tattaatgga   1380

aaccagtctg ccacctccgg aaacctgacg gtaagcctta agaacgcctg gggaggtggc   1440

gctcagtaca gcgttaacat aaccctcgat ggaaaggctg tgtggagggt tctcctcagg   1500

ataaaggacg gcagtgttgt tgattcatgg ggtggaaagg tcactggaat gaacggaagc   1560

tacgttgtta ttgaggccga ggactacaac ctggggccaa aggcggtttt gggctttgtt   1620

acgtccggag ctgcaccaat agtgcaggaa gccctcctgc ttgtcaacgg ggaagtcgtg   1680

gcaaggtggg aggcacctgt tgagaggcct gcgctggaag tggatggata cgttgacagt   1740

gaatggaacg acgggttcgt gtacaaaata aggataacca acaggggaag cgctccagtt   1800

gcagggtgga ctttgacact ctcaatgacg agcgagatag tgagcatctg gggagcgacc   1860

tacgagagac ttcctgacgg aacgatagtc cttaaaccga tggactacac ctcggtaata   1920

gctccgggac agagcgttga gataggcttt caggctaaga aggcaggaga tataccatat   1980

ccaaggatag tttaa                                                    1995

<210> 214
<211> 664
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (265)...(435)
<223> Glycosyl hydrolase family 12

<220> 
<221> DOMAIN
<222> (572)...(662)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (29)...(32)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (34)...(37)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (61)...(64)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (81)...(84)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (85)...(88)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (147)...(150)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (165)...(168)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (223)...(226)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (265)...(268)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (290)...(293)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (356)...(359)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (383)...(386)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (398)...(401)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (448)...(451)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (468)...(471)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (475)...(478)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (493)...(496)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (525)...(528)
<223> N-glycosylation site. Prosite id = PS00001

<400> 214
Met Val Gln Leu Tyr Ile Lys Asn Gly Phe Pro Ala Gly Tyr Glu His 
1               5                   10                  15      


Gly Pro Val Ala Thr Phe Asn Val Pro Met Val Val Asn Gly Thr Leu 
            20                  25                  30          


Ile Asn Gln Thr Phe Glu Leu Tyr Asp Val Val Ala Asp Pro Gly Trp 
        35                  40                  45              


Arg Phe Leu Thr Phe Lys Ser Thr Lys Asn Tyr Val Asn Ala Ser Val 
    50                  55                  60                  


Val Phe Asp Tyr Thr His Phe Ile Glu Met Ala Asn Asp Tyr Leu Asn 
65                  70                  75                  80  


Asp Ser Leu Asn Ser Thr Tyr Leu Met Ser Leu Glu Phe Gly Thr Glu 
                85                  90                  95      


Ile Tyr Thr Asn Gly Ile Ser Ser Phe Pro Gly Thr Val Asp Val Glu 
            100                 105                 110         


Trp Ser Leu Asn Asp Tyr Tyr Tyr Ala Leu Ala Pro Arg Gly Leu Thr 
        115                 120                 125             


Ala Glu Asp Ala Leu Gly Met Trp Ala Ala Leu Ile Pro Asn Thr Gly 
    130                 135                 140                 


Asn Thr Ser Asn Gly Gly Thr Glu Glu Ser Gly Glu Asp Asn Thr Gly 
145                 150                 155                 160 


Gly Thr Asn Gly Thr Thr Ser Thr Thr Glu Ser Glu Thr Val Val Leu 
                165                 170                 175     


Thr Tyr Pro Asp Asp Gly Gln Trp Pro Thr Ala Tyr Ile Asp Gly Asn 
            180                 185                 190         


Lys Asn Gly Val Pro Asp Tyr Val Met Glu Ile Asn Pro Trp Asn Ile 
        195                 200                 205             


Gln Ser Ala Asp Gly Lys Ala Val Met Arg Tyr Asn Tyr Ser Thr Gly 
    210                 215                 220                 


Tyr Leu His Tyr Ser Gln Asp Leu Ser Asn Ile Val Ile Arg Asn Ala 
225                 230                 235                 240 


Gly Gly Trp Val His Gly Tyr Pro Glu Ile Trp Tyr Gly Asn Lys Pro 
                245                 250                 255     


Trp Asn Glu Tyr Asn Ala Thr Asp Gly Glu Val Pro Leu Pro Gly Gln 
            260                 265                 270         


Val Gly Lys Leu Asn Asn Phe Tyr Val Thr Val Asn Tyr Asn Leu Thr 
        275                 280                 285             


His Glu Asn Gly Leu Pro Val Asp Leu Ala Ile Glu Ser Trp Leu Thr 
    290                 295                 300                 


Arg Asp Lys Trp Arg Ser Thr Gly Ile Lys Ser Asp Glu Gln Glu Leu 
305                 310                 315                 320 


Met Ile Trp Leu Tyr Tyr Asp Gly Leu Gln Pro Ala Gly Ser Lys Val 
                325                 330                 335     


Gly Glu Ile Thr Val Pro Ile Ile Val Asn Gly Val Glu Lys Asn Ala 
            340                 345                 350         


Thr Phe Glu Val Trp Lys Gly Asn Ile Gly Trp Glu Tyr Val Ala Phe 
        355                 360                 365             


Arg Ile Lys Thr Pro Leu Pro Ser Ala Asn Val Thr Leu Pro Tyr Gly 
    370                 375                 380                 


Pro Phe Ile Ser Ala Ala Met Asn Val Thr Ser Leu Lys Asp Tyr Ala 
385                 390                 395                 400 


Ser Leu Tyr Leu Glu Asp Val Glu Val Gly Thr Glu Phe Gly Ser Pro 
                405                 410                 415     


Ser Val Thr Ser Ala Lys Leu Glu Trp Thr Phe Tyr Glu Phe Lys Leu 
            420                 425                 430         


Thr Tyr Thr Asp Gly Pro Leu Ile Thr Asn Val Ser Val Ser Ala Gly 
        435                 440                 445             


Gly Ser Asn Ser Gly Asn Ser Asn Gly Ile Asn Gly Asn Gln Ser Ala 
    450                 455                 460                 


Thr Ser Gly Asn Leu Thr Val Ser Leu Lys Asn Ala Trp Gly Gly Gly 
465                 470                 475                 480 


Ala Gln Tyr Ser Val Asn Ile Thr Leu Asp Gly Lys Ala Val Trp Arg 
                485                 490                 495     


Val Leu Leu Arg Ile Lys Asp Gly Ser Val Val Asp Ser Trp Gly Gly 
            500                 505                 510         


Lys Val Thr Gly Met Asn Gly Ser Tyr Val Val Ile Glu Ala Glu Asp 
        515                 520                 525             


Tyr Asn Leu Gly Pro Lys Ala Val Leu Gly Phe Val Thr Ser Gly Ala 
    530                 535                 540                 


Ala Pro Ile Val Gln Glu Ala Leu Leu Leu Val Asn Gly Glu Val Val 
545                 550                 555                 560 


Ala Arg Trp Glu Ala Pro Val Glu Arg Pro Ala Leu Glu Val Asp Gly 
                565                 570                 575     


Tyr Val Asp Ser Glu Trp Asn Asp Gly Phe Val Tyr Lys Ile Arg Ile 
            580                 585                 590         


Thr Asn Arg Gly Ser Ala Pro Val Ala Gly Trp Thr Leu Thr Leu Ser 
        595                 600                 605             


Met Thr Ser Glu Ile Val Ser Ile Trp Gly Ala Thr Tyr Glu Arg Leu 
    610                 615                 620                 


Pro Asp Gly Thr Ile Val Leu Lys Pro Met Asp Tyr Thr Ser Val Ile 
625                 630                 635                 640 


Ala Pro Gly Gln Ser Val Glu Ile Gly Phe Gln Ala Lys Lys Ala Gly 
                645                 650                 655     


Asp Ile Pro Tyr Pro Arg Ile Val 
            660                 


<210> 215
<211> 1716
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 215
ttgaacgaga ccacgacatt ctacatatac gcctggaacc agaacggctc agttaacaag     60

acggtaactg taactgtggt agagcaggga cagcagccgg gtgcttcagg cacaggggag    120

atagtcatat catatcctga gtcacagtgg cctaccgccg agatcgatat tgatgacgac    180

cgtgtaaacg actatgttat agagataaac ccgtggaaca ttgagagcgc aaccggctcg    240

gcagtcatga catactacct ggataacggt acactatact actcacagtc gctagacaac    300

atagtactaa ggaatccagg agcctgggtc cacggctacc cagagatatt ctacggcaac    360

aagccgtgga atacttacag cgctacaggc ggccaagtac cattgccagg catggtgtca    420

gagctaaaca gcttctacgt gacggttagc tataagctaa acccggagcc ggggcttcca    480

gtgaatcttg caatagaatc atggctgacg agagagcagt ggaggactac cgggatcaac    540

ccagacgaac aagaactaat gatatggctc tactatgacg gcctccagcc cgcaggctct    600

aagataggcg agatagtggt gcccatatac gtgaacggca cccaggtcaa cgccacattc    660

gaggtttgga ggggcttcat aggctgggag tacatagcct tcaggataaa aacccccatg    720

aaatccgcta cagtgacaat accctattcg ccattcataa gcgcggcggc gaatataacc    780

gatcttacag actacgcgag cctataccta gaggacgtcg aggtaggaac agagtatgga    840

agcccatcca ctagctcagc acatatggaa tggtggatct acagcttcaa tctaacctac    900

acagaccagc ccctactaac agctccaccg ccaccacccg gtccgggaga cggttcagga    960

ggcggtagcg ggggcgaaga aggcggtagt ggaggagtag cgtcaaatgg caccctgcaa   1020

gtaagcctag ccagctcctg gggtagcggc gcccagtatg atatagtagt cttgctagac   1080

caacagtctg agtggagcct gctagtcaag gttgccaacg gctatataag cgattcctgg   1140

ggcgctacgc tcaacggtac tacacaggac ggctacatag tactagtctc agaaccctgg   1200

aacctagggc ccacagcaac agcaggcttc ataacctcag ggtcaaaccc gctagtggaa   1260

gaagttatcc tcatagcagg cggccaagag ctagacaggt ggacggcccc ccagccagat   1320

gcaggagcac tcgacgttag actagtaata gaaagcgact gggataccgg cttcgtagcc   1380

aagatctacg tggcaaacaa cggggacacg ccaatatcgg gatggtacat cgaagtacag   1440

atgacaagca ccataagcag tatatggggc gcaaaagccc agcctgcagg agacgatacc   1500

tacattctca caccagtaga ctatactgca gtaatatctc caggtcagac aatagaagta   1560

ggattcgtag cagaaaaagc aggcaatcac ccatacccca caatagtaga ctacgggatc   1620

acaggctccg ctagcacaca gacgggagca ggaatcgcga ttctactagc aacagtagcc   1680

atcgtgacac taaaccataa aagaagacca gtttaa                             1716

<210> 216
<211> 571
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (131)...(301)
<223> Glycosyl hydrolase family 12

<220> 
<221> DOMAIN
<222> (444)...(534)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (2)...(5)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (15)...(18)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (19)...(22)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (90)...(93)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (215)...(218)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (220)...(223)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (261)...(264)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (301)...(304)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (341)...(344)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (390)...(393)
<223> N-glycosylation site. Prosite id = PS00001

<400> 216
Met Asn Glu Thr Thr Thr Phe Tyr Ile Tyr Ala Trp Asn Gln Asn Gly 
1               5                   10                  15      


Ser Val Asn Lys Thr Val Thr Val Thr Val Val Glu Gln Gly Gln Gln 
            20                  25                  30          


Pro Gly Ala Ser Gly Thr Gly Glu Ile Val Ile Ser Tyr Pro Glu Ser 
        35                  40                  45              


Gln Trp Pro Thr Ala Glu Ile Asp Ile Asp Asp Asp Arg Val Asn Asp 
    50                  55                  60                  


Tyr Val Ile Glu Ile Asn Pro Trp Asn Ile Glu Ser Ala Thr Gly Ser 
65                  70                  75                  80  


Ala Val Met Thr Tyr Tyr Leu Asp Asn Gly Thr Leu Tyr Tyr Ser Gln 
                85                  90                  95      


Ser Leu Asp Asn Ile Val Leu Arg Asn Pro Gly Ala Trp Val His Gly 
            100                 105                 110         


Tyr Pro Glu Ile Phe Tyr Gly Asn Lys Pro Trp Asn Thr Tyr Ser Ala 
        115                 120                 125             


Thr Gly Gly Gln Val Pro Leu Pro Gly Met Val Ser Glu Leu Asn Ser 
    130                 135                 140                 


Phe Tyr Val Thr Val Ser Tyr Lys Leu Asn Pro Glu Pro Gly Leu Pro 
145                 150                 155                 160 


Val Asn Leu Ala Ile Glu Ser Trp Leu Thr Arg Glu Gln Trp Arg Thr 
                165                 170                 175     


Thr Gly Ile Asn Pro Asp Glu Gln Glu Leu Met Ile Trp Leu Tyr Tyr 
            180                 185                 190         


Asp Gly Leu Gln Pro Ala Gly Ser Lys Ile Gly Glu Ile Val Val Pro 
        195                 200                 205             


Ile Tyr Val Asn Gly Thr Gln Val Asn Ala Thr Phe Glu Val Trp Arg 
    210                 215                 220                 


Gly Phe Ile Gly Trp Glu Tyr Ile Ala Phe Arg Ile Lys Thr Pro Met 
225                 230                 235                 240 


Lys Ser Ala Thr Val Thr Ile Pro Tyr Ser Pro Phe Ile Ser Ala Ala 
                245                 250                 255     


Ala Asn Ile Thr Asp Leu Thr Asp Tyr Ala Ser Leu Tyr Leu Glu Asp 
            260                 265                 270         


Val Glu Val Gly Thr Glu Tyr Gly Ser Pro Ser Thr Ser Ser Ala His 
        275                 280                 285             


Met Glu Trp Trp Ile Tyr Ser Phe Asn Leu Thr Tyr Thr Asp Gln Pro 
    290                 295                 300                 


Leu Leu Thr Ala Pro Pro Pro Pro Pro Gly Pro Gly Asp Gly Ser Gly 
305                 310                 315                 320 


Gly Gly Ser Gly Gly Glu Glu Gly Gly Ser Gly Gly Val Ala Ser Asn 
                325                 330                 335     


Gly Thr Leu Gln Val Ser Leu Ala Ser Ser Trp Gly Ser Gly Ala Gln 
            340                 345                 350         


Tyr Asp Ile Val Val Leu Leu Asp Gln Gln Ser Glu Trp Ser Leu Leu 
        355                 360                 365             


Val Lys Val Ala Asn Gly Tyr Ile Ser Asp Ser Trp Gly Ala Thr Leu 
    370                 375                 380                 


Asn Gly Thr Thr Gln Asp Gly Tyr Ile Val Leu Val Ser Glu Pro Trp 
385                 390                 395                 400 


Asn Leu Gly Pro Thr Ala Thr Ala Gly Phe Ile Thr Ser Gly Ser Asn 
                405                 410                 415     


Pro Leu Val Glu Glu Val Ile Leu Ile Ala Gly Gly Gln Glu Leu Asp 
            420                 425                 430         


Arg Trp Thr Ala Pro Gln Pro Asp Ala Gly Ala Leu Asp Val Arg Leu 
        435                 440                 445             


Val Ile Glu Ser Asp Trp Asp Thr Gly Phe Val Ala Lys Ile Tyr Val 
    450                 455                 460                 


Ala Asn Asn Gly Asp Thr Pro Ile Ser Gly Trp Tyr Ile Glu Val Gln 
465                 470                 475                 480 


Met Thr Ser Thr Ile Ser Ser Ile Trp Gly Ala Lys Ala Gln Pro Ala 
                485                 490                 495     


Gly Asp Asp Thr Tyr Ile Leu Thr Pro Val Asp Tyr Thr Ala Val Ile 
            500                 505                 510         


Ser Pro Gly Gln Thr Ile Glu Val Gly Phe Val Ala Glu Lys Ala Gly 
        515                 520                 525             


Asn His Pro Tyr Pro Thr Ile Val Asp Tyr Gly Ile Thr Gly Ser Ala 
    530                 535                 540                 


Ser Thr Gln Thr Gly Ala Gly Ile Ala Ile Leu Leu Ala Thr Val Ala 
545                 550                 555                 560 


Ile Val Thr Leu Asn His Lys Arg Arg Pro Val 
                565                 570     


<210> 217
<211> 1065
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 217
atgcggttca gaaagaattt cgcagttctc atgttaatcg ttttgatttc aacgctgttt     60

ctatcaactc agtgtaaagg taacataaag aacgtagaaa gcaaaaagga aggggagaca    120

gttcacgtgt caaagtctgc gtttgattac aacaaaatga ttgggagagg cataaatatg    180

gggaacgcgc ttgaagcccc atttgaggga gcttggggtg taacaattga agatgagtac    240

ttcaaattaa taaaagaacg cggttttgat tctgtgagaa tacctgtcag gtggtcagca    300

catgtttcag aagagccacc ttacaaaatc aacgaagatt ttctaaatag agtgaagcac    360

gttgtcgatg aagcactgaa gaataactta acggtaatta ttaacactca tcacttcgaa    420

gagctgtatg ctgatccgga caagaacggt cctatattga ttgaaatatg gcgtcaggtc    480

gcagagtttt tcaaagatta tcctgataat ctgtttttcg agatttacaa cgaacctgct    540

caaaatttga cagcggaaaa atggaacgaa ctctacccaa aggttctcga agtaattaga    600

aagacaaatc ctacaaggat agtaataatc gacgttccaa actgggctca ttacagcgct    660

ataagatcac taaaacttgt tgatgacaag aatataatcg tgtcgttcca ttattatgaa    720

cccttcaact tcacccatca aggtgccgaa tgggttacgc caaggctccc agttggcgtt    780

gaatggaaag gtgaagaatg ggaagtgaat accataagaa accatttcaa atatgtaagc    840

gactgggcaa agaagaacaa tgttccagtt tttcttggag aattcggtgc ttattcaaaa    900

gcagacatga actcaagagt gagatggact gaaactgtga gaaaaatagc tgaagaattc    960

ggattttctt atgcatattg ggaattctgc gcagggtttg gtatctatga tcgctggtcg   1020

cagaaatgga tcgaaccgct cgcaacttct gttgttggga ggtaa                   1065

<210> 218
<211> 354
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (61)...(337)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (184)...(187)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (246)...(249)
<223> N-glycosylation site. Prosite id = PS00001

<400> 218
Met Arg Phe Arg Lys Asn Phe Ala Val Leu Met Leu Ile Val Leu Ile 
1               5                   10                  15      


Ser Thr Leu Phe Leu Ser Thr Gln Cys Lys Gly Asn Ile Lys Asn Val 
            20                  25                  30          


Glu Ser Lys Lys Glu Gly Glu Thr Val His Val Ser Lys Ser Ala Phe 
        35                  40                  45              


Asp Tyr Asn Lys Met Ile Gly Arg Gly Ile Asn Met Gly Asn Ala Leu 
    50                  55                  60                  


Glu Ala Pro Phe Glu Gly Ala Trp Gly Val Thr Ile Glu Asp Glu Tyr 
65                  70                  75                  80  


Phe Lys Leu Ile Lys Glu Arg Gly Phe Asp Ser Val Arg Ile Pro Val 
                85                  90                  95      


Arg Trp Ser Ala His Val Ser Glu Glu Pro Pro Tyr Lys Ile Asn Glu 
            100                 105                 110         


Asp Phe Leu Asn Arg Val Lys His Val Val Asp Glu Ala Leu Lys Asn 
        115                 120                 125             


Asn Leu Thr Val Ile Ile Asn Thr His His Phe Glu Glu Leu Tyr Ala 
    130                 135                 140                 


Asp Pro Asp Lys Asn Gly Pro Ile Leu Ile Glu Ile Trp Arg Gln Val 
145                 150                 155                 160 


Ala Glu Phe Phe Lys Asp Tyr Pro Asp Asn Leu Phe Phe Glu Ile Tyr 
                165                 170                 175     


Asn Glu Pro Ala Gln Asn Leu Thr Ala Glu Lys Trp Asn Glu Leu Tyr 
            180                 185                 190         


Pro Lys Val Leu Glu Val Ile Arg Lys Thr Asn Pro Thr Arg Ile Val 
        195                 200                 205             


Ile Ile Asp Val Pro Asn Trp Ala His Tyr Ser Ala Ile Arg Ser Leu 
    210                 215                 220                 


Lys Leu Val Asp Asp Lys Asn Ile Ile Val Ser Phe His Tyr Tyr Glu 
225                 230                 235                 240 


Pro Phe Asn Phe Thr His Gln Gly Ala Glu Trp Val Thr Pro Arg Leu 
                245                 250                 255     


Pro Val Gly Val Glu Trp Lys Gly Glu Glu Trp Glu Val Asn Thr Ile 
            260                 265                 270         


Arg Asn His Phe Lys Tyr Val Ser Asp Trp Ala Lys Lys Asn Asn Val 
        275                 280                 285             


Pro Val Phe Leu Gly Glu Phe Gly Ala Tyr Ser Lys Ala Asp Met Asn 
    290                 295                 300                 


Ser Arg Val Arg Trp Thr Glu Thr Val Arg Lys Ile Ala Glu Glu Phe 
305                 310                 315                 320 


Gly Phe Ser Tyr Ala Tyr Trp Glu Phe Cys Ala Gly Phe Gly Ile Tyr 
                325                 330                 335     


Asp Arg Trp Ser Gln Lys Trp Ile Glu Pro Leu Ala Thr Ser Val Val 
            340                 345                 350         


Gly Arg 
        


<210> 219
<211> 1104
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 219
atggacgaat tggaatcatc ctgcgctttt cctatgtccc gctacttgct cctatgggtc     60

tgggtcatgc tttcgagctc ggcattcgcc caaccgcaaa aagttgatga cgtagtgatc    120

gacaccacgc acgcaccagc ctccccgcct cgcacgtatt cacttccttt cgtgcgtgtt    180

gaaggcaacc gttttgtcga cgaaaacggc cgcacgctcg ttttccgagg cgtatcgatt    240

gccgatcctg accgactgga gcgcattggc aaatggaata aaacgctttt tgaagtgatc    300

aaaaaagact ggaatgccaa cattgtccgc attccggtcc acccccaagc ttggcgcgag    360

cgtggcgcgt cggcgtacct aaaacttctc gaccaagcgg ttgcgtgggc taatgcgctg    420

cagctttact tgatcattga ctggcatagc attggtaact tgcgcacaga gctctttcag    480

cacccgatgt ataacaccac caaaacggag actttccgtt tctggaaaac cattgcagag    540

cactttcgcc ataatccgat cgtggctttc tatgagcttt ttaacgaacc cacccatttt    600

aacggtacgc taggtcgcat gtcttgggaa gagtacaagg ccattataga agaaatcatc    660

tacatcatct acgcgcacga ccagaccgtc atccccctgg tagggggctt tgattgggcc    720

tacgacctca ctccggttcg tgagcatccc attaactttc ccggggttgc ctacacggcg    780

catccctatc ctcaaaaacg ccagccccct tgggaagaaa aatgggagca agactgggga    840

tttgtggcaa acacgtatcc ggtgtttgtt accgaactgg ggttcatgag cgcagacggc    900

cctggtgccc acgttccagt aattggcgat gagacgtatg gtgaagcaat ccttcgctac    960

atggaacaaa aggggatttc ttggacggca tgggtatttg atcccgtttg gtccccgcaa   1020

ctcatcgcca attgggattt tgaacctaca cctcagggcc gatttttccg tgacaaaatg   1080

cgccagctta acccacgaaa ctaa                                          1104

<210> 220
<211> 367
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(30)

<220> 
<221> DOMAIN
<222> (63)...(341)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (94)...(97)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (167)...(170)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (204)...(207)
<223> N-glycosylation site. Prosite id = PS00001

<400> 220
Met Asp Glu Leu Glu Ser Ser Cys Ala Phe Pro Met Ser Arg Tyr Leu 
1               5                   10                  15      


Leu Leu Trp Val Trp Val Met Leu Ser Ser Ser Ala Phe Ala Gln Pro 
            20                  25                  30          


Gln Lys Val Asp Asp Val Val Ile Asp Thr Thr His Ala Pro Ala Ser 
        35                  40                  45              


Pro Pro Arg Thr Tyr Ser Leu Pro Phe Val Arg Val Glu Gly Asn Arg 
    50                  55                  60                  


Phe Val Asp Glu Asn Gly Arg Thr Leu Val Phe Arg Gly Val Ser Ile 
65                  70                  75                  80  


Ala Asp Pro Asp Arg Leu Glu Arg Ile Gly Lys Trp Asn Lys Thr Leu 
                85                  90                  95      


Phe Glu Val Ile Lys Lys Asp Trp Asn Ala Asn Ile Val Arg Ile Pro 
            100                 105                 110         


Val His Pro Gln Ala Trp Arg Glu Arg Gly Ala Ser Ala Tyr Leu Lys 
        115                 120                 125             


Leu Leu Asp Gln Ala Val Ala Trp Ala Asn Ala Leu Gln Leu Tyr Leu 
    130                 135                 140                 


Ile Ile Asp Trp His Ser Ile Gly Asn Leu Arg Thr Glu Leu Phe Gln 
145                 150                 155                 160 


His Pro Met Tyr Asn Thr Thr Lys Thr Glu Thr Phe Arg Phe Trp Lys 
                165                 170                 175     


Thr Ile Ala Glu His Phe Arg His Asn Pro Ile Val Ala Phe Tyr Glu 
            180                 185                 190         


Leu Phe Asn Glu Pro Thr His Phe Asn Gly Thr Leu Gly Arg Met Ser 
        195                 200                 205             


Trp Glu Glu Tyr Lys Ala Ile Ile Glu Glu Ile Ile Tyr Ile Ile Tyr 
    210                 215                 220                 


Ala His Asp Gln Thr Val Ile Pro Leu Val Gly Gly Phe Asp Trp Ala 
225                 230                 235                 240 


Tyr Asp Leu Thr Pro Val Arg Glu His Pro Ile Asn Phe Pro Gly Val 
                245                 250                 255     


Ala Tyr Thr Ala His Pro Tyr Pro Gln Lys Arg Gln Pro Pro Trp Glu 
            260                 265                 270         


Glu Lys Trp Glu Gln Asp Trp Gly Phe Val Ala Asn Thr Tyr Pro Val 
        275                 280                 285             


Phe Val Thr Glu Leu Gly Phe Met Ser Ala Asp Gly Pro Gly Ala His 
    290                 295                 300                 


Val Pro Val Ile Gly Asp Glu Thr Tyr Gly Glu Ala Ile Leu Arg Tyr 
305                 310                 315                 320 


Met Glu Gln Lys Gly Ile Ser Trp Thr Ala Trp Val Phe Asp Pro Val 
                325                 330                 335     


Trp Ser Pro Gln Leu Ile Ala Asn Trp Asp Phe Glu Pro Thr Pro Gln 
            340                 345                 350         


Gly Arg Phe Phe Arg Asp Lys Met Arg Gln Leu Asn Pro Arg Asn 
        355                 360                 365         


<210> 221
<211> 1587
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 221
atgtctagcg tcgcttcgtt gttgtccttg accttgctgc aggcgcaagc ggttaccgtc     60

ggcgtggatc tcgcgaaaga tctgcgcccg atctctccat acgtctacgg cacgaacacg    120

tccgactgga gcggccgtac gaaatacctg acgatgtggc gctggggcgg taaccggacg    180

accgcctaca actgggaaaa caacgcgagc cacgccggac gcgattgggc acaccagaac    240

gactcgttcc ttggtggcgg ggacattccc ggcgaggcgc ttcgcgcacc tctgaccgcg    300

gcggtggccg ccggcgcggc ggcgctcgtg accgtcccct gcatcggcta cgtggccgcg    360

gataagaacg gcggcgggga cgtcaatcag acgccgaact acctcgacgt tcggttccat    420

ccgtcgtacg ccacgaaagg ctccgcgttc agcaaccctc ccaacgtgaa cgaccacgcc    480

gtctatcagg acgagttcgt ggcttggctg ctgaaccggg tcgtcacgga tcggccgatc    540

tggtttgcgc tggacaacga gcccgacctc tgggcggaga cccacccgcg cattcagacc    600

cagaagccga cgtacgcggg gatcatggag atatcgaagc gctacgcgca ggcgatcaaa    660

tccgtcgcgc ccgattccct cgtattcggc cccgcgagct acggctggag cggttacgac    720

tcgttccaag ccgcgtccga cgcgaacggc cggttcttcc tcgacttcta cctgcagaac    780

ttccgcacgc tgcaggagca gaccggaaag cgctacctcg acgtcctcga cctccactgg    840

tatcccgagg cgcgcggcgc gggcaagcgc atcacggaag acggaacgga gcagggcctc    900

tacaccgcac ggatgcaggc cccccggtcg ctctgggacc caacctacac cgaagactct    960

tggatcgccc agtggggcac ccagggaccg attcggctgg taccgcgcat gctggagaag   1020

atcgcgggca actatccggg caccaagctc gcattcaccg agtggaatta cggcggcggc   1080

ggccacatca ccgggggcat cgcggcggcg gacgtcctcg gcattttcgg ccgtgaaggc   1140

gtgttcgccg cgaactactg ggacatcaag gacgacgagt cgttcgccta cggcggtttc   1200

gccatgttcc ggaacttcga cggtcaaggc gcgcggttcg gggacatttc cgttcgcgcc   1260

gccagcggcg acgtagccaa ggtgaccgcc tacgccagcc gggactcgct gtacaaaaac   1320

gaagtcaccg tcgtgctgat caacaagcag tttacgagca cccccgtcaa cctcacgctg   1380

gccggcgcgg acggattcca gccggtcatc gccgctcggc tctcgagcgc gtcgccccgc   1440

ccgagcccga tcgctctccc ggccgtctca ggttcgaccg tgaacctgac cttgccggcg   1500

ctcacggtca cgaccttgcg cttcaatcgc tttacgggtc ctcgaccggg gggccccatc   1560

atgctcccga agccgaaggg gcggtag                                       1587

<210> 222
<211> 528
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(17)

<220> 
<221> SITE
<222> (39)...(42)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (58)...(61)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (69)...(72)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (81)...(84)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (464)...(467)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (502)...(505)
<223> N-glycosylation site. Prosite id = PS00001

<400> 222
Met Ser Ser Val Ala Ser Leu Leu Ser Leu Thr Leu Leu Gln Ala Gln 
1               5                   10                  15      


Ala Val Thr Val Gly Val Asp Leu Ala Lys Asp Leu Arg Pro Ile Ser 
            20                  25                  30          


Pro Tyr Val Tyr Gly Thr Asn Thr Ser Asp Trp Ser Gly Arg Thr Lys 
        35                  40                  45              


Tyr Leu Thr Met Trp Arg Trp Gly Gly Asn Arg Thr Thr Ala Tyr Asn 
    50                  55                  60                  


Trp Glu Asn Asn Ala Ser His Ala Gly Arg Asp Trp Ala His Gln Asn 
65                  70                  75                  80  


Asp Ser Phe Leu Gly Gly Gly Asp Ile Pro Gly Glu Ala Leu Arg Ala 
                85                  90                  95      


Pro Leu Thr Ala Ala Val Ala Ala Gly Ala Ala Ala Leu Val Thr Val 
            100                 105                 110         


Pro Cys Ile Gly Tyr Val Ala Ala Asp Lys Asn Gly Gly Gly Asp Val 
        115                 120                 125             


Asn Gln Thr Pro Asn Tyr Leu Asp Val Arg Phe His Pro Ser Tyr Ala 
    130                 135                 140                 


Thr Lys Gly Ser Ala Phe Ser Asn Pro Pro Asn Val Asn Asp His Ala 
145                 150                 155                 160 


Val Tyr Gln Asp Glu Phe Val Ala Trp Leu Leu Asn Arg Val Val Thr 
                165                 170                 175     


Asp Arg Pro Ile Trp Phe Ala Leu Asp Asn Glu Pro Asp Leu Trp Ala 
            180                 185                 190         


Glu Thr His Pro Arg Ile Gln Thr Gln Lys Pro Thr Tyr Ala Gly Ile 
        195                 200                 205             


Met Glu Ile Ser Lys Arg Tyr Ala Gln Ala Ile Lys Ser Val Ala Pro 
    210                 215                 220                 


Asp Ser Leu Val Phe Gly Pro Ala Ser Tyr Gly Trp Ser Gly Tyr Asp 
225                 230                 235                 240 


Ser Phe Gln Ala Ala Ser Asp Ala Asn Gly Arg Phe Phe Leu Asp Phe 
                245                 250                 255     


Tyr Leu Gln Asn Phe Arg Thr Leu Gln Glu Gln Thr Gly Lys Arg Tyr 
            260                 265                 270         


Leu Asp Val Leu Asp Leu His Trp Tyr Pro Glu Ala Arg Gly Ala Gly 
        275                 280                 285             


Lys Arg Ile Thr Glu Asp Gly Thr Glu Gln Gly Leu Tyr Thr Ala Arg 
    290                 295                 300                 


Met Gln Ala Pro Arg Ser Leu Trp Asp Pro Thr Tyr Thr Glu Asp Ser 
305                 310                 315                 320 


Trp Ile Ala Gln Trp Gly Thr Gln Gly Pro Ile Arg Leu Val Pro Arg 
                325                 330                 335     


Met Leu Glu Lys Ile Ala Gly Asn Tyr Pro Gly Thr Lys Leu Ala Phe 
            340                 345                 350         


Thr Glu Trp Asn Tyr Gly Gly Gly Gly His Ile Thr Gly Gly Ile Ala 
        355                 360                 365             


Ala Ala Asp Val Leu Gly Ile Phe Gly Arg Glu Gly Val Phe Ala Ala 
    370                 375                 380                 


Asn Tyr Trp Asp Ile Lys Asp Asp Glu Ser Phe Ala Tyr Gly Gly Phe 
385                 390                 395                 400 


Ala Met Phe Arg Asn Phe Asp Gly Gln Gly Ala Arg Phe Gly Asp Ile 
                405                 410                 415     


Ser Val Arg Ala Ala Ser Gly Asp Val Ala Lys Val Thr Ala Tyr Ala 
            420                 425                 430         


Ser Arg Asp Ser Leu Tyr Lys Asn Glu Val Thr Val Val Leu Ile Asn 
        435                 440                 445             


Lys Gln Phe Thr Ser Thr Pro Val Asn Leu Thr Leu Ala Gly Ala Asp 
    450                 455                 460                 


Gly Phe Gln Pro Val Ile Ala Ala Arg Leu Ser Ser Ala Ser Pro Arg 
465                 470                 475                 480 


Pro Ser Pro Ile Ala Leu Pro Ala Val Ser Gly Ser Thr Val Asn Leu 
                485                 490                 495     


Thr Leu Pro Ala Leu Thr Val Thr Thr Leu Arg Phe Asn Arg Phe Thr 
            500                 505                 510         


Gly Pro Arg Pro Gly Gly Pro Ile Met Leu Pro Lys Pro Lys Gly Arg 
        515                 520                 525             


<210> 223
<211> 966
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 223
atgatgcgca cgctcgtcac gtcggcattc gcgtgcctgc tcctgcccct cggcaccggg     60

caggccgatg ccggcgtcct ggccgatccg atcggcatga ccagcgggtt ctacacggac    120

ccgaactcga atccggccgc atgggtcgcg gcgaaccccg gcgacggacg ggcgccggcg    180

atccgggaca acatcgcctc gcgccccatg gcccggtggt tcggttcctg gagcggtgac    240

atcggggccg cggtgggctc ctacgtggga gccgcggacg ccgccgacaa gcttcccgtc    300

ctgatcgcct acaacatccc cggccgggac gcctgtggcg gccactccgg cggcggggcg    360

ggttctccgg cggcgtaccg cacctggatc tccgccttcg cctcggccat cggcgggcga    420

ccggcgctgg tggtgatcga gcccgactcc ctcggtgatt acagctgtct gacccagcag    480

cagatcgatg agcgcaacgc catgctcaaa gacgccctgg cgcagttctc cgcgcacgcg    540

cccaacacgt ggacgtacct ggatgcggga aaccccgcct ggatcgacgc ggccaccatg    600

gcccggcatc tcgacggcgc cggggcccgg caggcgcacg gcttctcgtc gaacatctcg    660

aactactacg gcaacagccg gaacatcagt tacggcaacg ccatcaactc ggcgctgtcg    720

gcctcctacg gttacacgaa gcctttcgtc atcgacacca gccgcaacgg caacgattcc    780

aacggcgagt ggtgcaaccc cgcagggcgg aggatcgggg ccgtcagcca gacgggcggt    840

ggagccgaga tgctgctgtg gctgaagacc cccggcgagt ccgacggcaa ctgcggagtc    900

ggcgccggct ccgtggccgg tcagttcctc ccggaagtcg cgtacaagat gatctacgga    960

tactga                                                               966

<210> 224
<211> 321
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(24)

<220> 
<221> DOMAIN
<222> (39)...(311)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (145)...(154)
<223> Glycosyl hydrolases family 6 signature 2. Prosite id = PS00656

<220> 
<221> SITE
<222> (221)...(224)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (231)...(234)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (261)...(264)
<223> N-glycosylation site. Prosite id = PS00001

<400> 224
Met Met Arg Thr Leu Val Thr Ser Ala Phe Ala Cys Leu Leu Leu Pro 
1               5                   10                  15      


Leu Gly Thr Gly Gln Ala Asp Ala Gly Val Leu Ala Asp Pro Ile Gly 
            20                  25                  30          


Met Thr Ser Gly Phe Tyr Thr Asp Pro Asn Ser Asn Pro Ala Ala Trp 
        35                  40                  45              


Val Ala Ala Asn Pro Gly Asp Gly Arg Ala Pro Ala Ile Arg Asp Asn 
    50                  55                  60                  


Ile Ala Ser Arg Pro Met Ala Arg Trp Phe Gly Ser Trp Ser Gly Asp 
65                  70                  75                  80  


Ile Gly Ala Ala Val Gly Ser Tyr Val Gly Ala Ala Asp Ala Ala Asp 
                85                  90                  95      


Lys Leu Pro Val Leu Ile Ala Tyr Asn Ile Pro Gly Arg Asp Ala Cys 
            100                 105                 110         


Gly Gly His Ser Gly Gly Gly Ala Gly Ser Pro Ala Ala Tyr Arg Thr 
        115                 120                 125             


Trp Ile Ser Ala Phe Ala Ser Ala Ile Gly Gly Arg Pro Ala Leu Val 
    130                 135                 140                 


Val Ile Glu Pro Asp Ser Leu Gly Asp Tyr Ser Cys Leu Thr Gln Gln 
145                 150                 155                 160 


Gln Ile Asp Glu Arg Asn Ala Met Leu Lys Asp Ala Leu Ala Gln Phe 
                165                 170                 175     


Ser Ala His Ala Pro Asn Thr Trp Thr Tyr Leu Asp Ala Gly Asn Pro 
            180                 185                 190         


Ala Trp Ile Asp Ala Ala Thr Met Ala Arg His Leu Asp Gly Ala Gly 
        195                 200                 205             


Ala Arg Gln Ala His Gly Phe Ser Ser Asn Ile Ser Asn Tyr Tyr Gly 
    210                 215                 220                 


Asn Ser Arg Asn Ile Ser Tyr Gly Asn Ala Ile Asn Ser Ala Leu Ser 
225                 230                 235                 240 


Ala Ser Tyr Gly Tyr Thr Lys Pro Phe Val Ile Asp Thr Ser Arg Asn 
                245                 250                 255     


Gly Asn Asp Ser Asn Gly Glu Trp Cys Asn Pro Ala Gly Arg Arg Ile 
            260                 265                 270         


Gly Ala Val Ser Gln Thr Gly Gly Gly Ala Glu Met Leu Leu Trp Leu 
        275                 280                 285             


Lys Thr Pro Gly Glu Ser Asp Gly Asn Cys Gly Val Gly Ala Gly Ser 
    290                 295                 300                 


Val Ala Gly Gln Phe Leu Pro Glu Val Ala Tyr Lys Met Ile Tyr Gly 
305                 310                 315                 320 


Tyr 
    


<210> 225
<211> 2007
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 225
atgaattgca ccatgaaacc gatgacccgc gcggtcgccg gcggccttgc cgcgcttgcc     60

cttgccgcct gcggcagcag tgacagcgac agccccggat acacccagcc ggtgttcggc    120

aacacgacct acaatatagt caaggttgat ggctatacct tcaaggacat gaaccgcaac    180

ggcaagatcg acccttacga ggattggcgc ctgaccgccg aggcgcgcgc cgacgacctg    240

atcggtcgca tgtcgctcga tgaaaaagcc ggcctgatga tgcatggcac ggcgcccacg    300

gtggccgacc cgaccggcat tggccagggc agcggctatg acctgacggc cttgcaggag    360

ctgatcgtca agcagtacgt caacaccttc atcacccgca tggcgggcga ttcggccaat    420

atggcggcgc agtacaacaa ggtgcaggcg ctgagcgaaa cctcgcgcca cggcattccg    480

gtgtcgatct ccaccgaccc gcgccatcat ttccagtaca cggtgggcgc cagcgcgtcg    540

accaacggct tttcgcagtg gcctgaaacc ctgggcctgg cagcgattgg cgacgatgcg    600

ctggtgcgcc gcttcggcga catcgcgcgc caggaatacc tggccgtcgg catcacgcag    660

gcgctgtcgc cgcaggccga cctggccacc gagccgcgct ggtcgcgcat caacggcacg    720

tttggcgaag acgccgacct ggccaagcgc atggtgcaga actatatcga gggcttccag    780

gatggcaata ccggcctgca cgacggcagc gtggtggccg ttgtcaagca ctgggtcggc    840

tatggcgcga ccaaagaggg ttttgacggc cacaattact atggccgcta catgacctac    900

ccgggcaaca acttcgccta tcacgtgaaa ccgttcgaag gcgcgttcaa cgccaaggcc    960

gcctccgtga tgccgaccta cgccctaccg gacggcaata tcaccatcga cggcatcacg   1020

ctggaacagg tggcggccgg cttcagcaag accatgctga ccgatctgct gcgcggcaaa   1080

tacggcttcg aaggcgtgat cctgtccgac tggggcatca cctccgactg cgacgccaac   1140

tgccgcaacg gcacgcccgc cggcgtggcg ccctcgttta tcggtttcgg cacgccgtgg   1200

ggcatggaaa acgccaccaa ggccgagcgc tacgtgaaag ccgtcaatgc cggcatggac   1260

cagtttggcg gcgtgacgga agcgccttac ctgacgcagg cggtgcagcg cggccagctg   1320

acggaagcgc gcatcaacgc ctcggcacgc cgcatcctgg tacagaaatt caagcagggc   1380

ctgttcgagc atccgttcgt cgacacggca aaagccgccg ccacggtggg caaggccgac   1440

ttcgtcgaag cgggcctgga agcccagcgc cgttcgctgg tgctgctgga aaacaaggac   1500

aaggtgctgc cgctggccac caccgtcaag aaggtgtatt tgtacggcat cgacgcggcc   1560

gtcgcccgcc agtacggcta caccgtggtg gcgacgccgc aggaagccga cgtggcgctg   1620

ctgcgcgtgg ccgcgccgta tgaaatcctg cacccgaact acatcttcgg cagcatgcag   1680

cacgaaggcc gcctcaattt cgtcgacggc gacgccgact acgaggcgat caagaatgcc   1740

gcccgcctgg cgccgaaaac cgtggtcacc gtctacctgg accgcccggc catcctcggc   1800

aacgtgcagg acaaggccag cgccattatc ggcaacttcg gcgtcagcga cggcgcgctg   1860

ttcgacgtct tgaccggcaa ggccaagccg caaggcaagc tgccattcga gctgccgtcg   1920

tcgatggctg aagtgcaggt gcagaagtcc gacgtgccat atgacacggc gcggccgctg   1980

tacaagttcg gctacggtct ggcgtac                                       2007

<210> 226
<211> 669
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (151)...(382)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (493)...(669)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (2)...(5)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (41)...(44)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (241)...(244)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (338)...(341)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (361)...(378)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (410)...(413)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (452)...(455)
<223> N-glycosylation site. Prosite id = PS00001

<400> 226
Met Asn Cys Thr Met Lys Pro Met Thr Arg Ala Val Ala Gly Gly Leu 
1               5                   10                  15      


Ala Ala Leu Ala Leu Ala Ala Cys Gly Ser Ser Asp Ser Asp Ser Pro 
            20                  25                  30          


Gly Tyr Thr Gln Pro Val Phe Gly Asn Thr Thr Tyr Asn Ile Val Lys 
        35                  40                  45              


Val Asp Gly Tyr Thr Phe Lys Asp Met Asn Arg Asn Gly Lys Ile Asp 
    50                  55                  60                  


Pro Tyr Glu Asp Trp Arg Leu Thr Ala Glu Ala Arg Ala Asp Asp Leu 
65                  70                  75                  80  


Ile Gly Arg Met Ser Leu Asp Glu Lys Ala Gly Leu Met Met His Gly 
                85                  90                  95      


Thr Ala Pro Thr Val Ala Asp Pro Thr Gly Ile Gly Gln Gly Ser Gly 
            100                 105                 110         


Tyr Asp Leu Thr Ala Leu Gln Glu Leu Ile Val Lys Gln Tyr Val Asn 
        115                 120                 125             


Thr Phe Ile Thr Arg Met Ala Gly Asp Ser Ala Asn Met Ala Ala Gln 
    130                 135                 140                 


Tyr Asn Lys Val Gln Ala Leu Ser Glu Thr Ser Arg His Gly Ile Pro 
145                 150                 155                 160 


Val Ser Ile Ser Thr Asp Pro Arg His His Phe Gln Tyr Thr Val Gly 
                165                 170                 175     


Ala Ser Ala Ser Thr Asn Gly Phe Ser Gln Trp Pro Glu Thr Leu Gly 
            180                 185                 190         


Leu Ala Ala Ile Gly Asp Asp Ala Leu Val Arg Arg Phe Gly Asp Ile 
        195                 200                 205             


Ala Arg Gln Glu Tyr Leu Ala Val Gly Ile Thr Gln Ala Leu Ser Pro 
    210                 215                 220                 


Gln Ala Asp Leu Ala Thr Glu Pro Arg Trp Ser Arg Ile Asn Gly Thr 
225                 230                 235                 240 


Phe Gly Glu Asp Ala Asp Leu Ala Lys Arg Met Val Gln Asn Tyr Ile 
                245                 250                 255     


Glu Gly Phe Gln Asp Gly Asn Thr Gly Leu His Asp Gly Ser Val Val 
            260                 265                 270         


Ala Val Val Lys His Trp Val Gly Tyr Gly Ala Thr Lys Glu Gly Phe 
        275                 280                 285             


Asp Gly His Asn Tyr Tyr Gly Arg Tyr Met Thr Tyr Pro Gly Asn Asn 
    290                 295                 300                 


Phe Ala Tyr His Val Lys Pro Phe Glu Gly Ala Phe Asn Ala Lys Ala 
305                 310                 315                 320 


Ala Ser Val Met Pro Thr Tyr Ala Leu Pro Asp Gly Asn Ile Thr Ile 
                325                 330                 335     


Asp Gly Ile Thr Leu Glu Gln Val Ala Ala Gly Phe Ser Lys Thr Met 
            340                 345                 350         


Leu Thr Asp Leu Leu Arg Gly Lys Tyr Gly Phe Glu Gly Val Ile Leu 
        355                 360                 365             


Ser Asp Trp Gly Ile Thr Ser Asp Cys Asp Ala Asn Cys Arg Asn Gly 
    370                 375                 380                 


Thr Pro Ala Gly Val Ala Pro Ser Phe Ile Gly Phe Gly Thr Pro Trp 
385                 390                 395                 400 


Gly Met Glu Asn Ala Thr Lys Ala Glu Arg Tyr Val Lys Ala Val Asn 
                405                 410                 415     


Ala Gly Met Asp Gln Phe Gly Gly Val Thr Glu Ala Pro Tyr Leu Thr 
            420                 425                 430         


Gln Ala Val Gln Arg Gly Gln Leu Thr Glu Ala Arg Ile Asn Ala Ser 
        435                 440                 445             


Ala Arg Arg Ile Leu Val Gln Lys Phe Lys Gln Gly Leu Phe Glu His 
    450                 455                 460                 


Pro Phe Val Asp Thr Ala Lys Ala Ala Ala Thr Val Gly Lys Ala Asp 
465                 470                 475                 480 


Phe Val Glu Ala Gly Leu Glu Ala Gln Arg Arg Ser Leu Val Leu Leu 
                485                 490                 495     


Glu Asn Lys Asp Lys Val Leu Pro Leu Ala Thr Thr Val Lys Lys Val 
            500                 505                 510         


Tyr Leu Tyr Gly Ile Asp Ala Ala Val Ala Arg Gln Tyr Gly Tyr Thr 
        515                 520                 525             


Val Val Ala Thr Pro Gln Glu Ala Asp Val Ala Leu Leu Arg Val Ala 
    530                 535                 540                 


Ala Pro Tyr Glu Ile Leu His Pro Asn Tyr Ile Phe Gly Ser Met Gln 
545                 550                 555                 560 


His Glu Gly Arg Leu Asn Phe Val Asp Gly Asp Ala Asp Tyr Glu Ala 
                565                 570                 575     


Ile Lys Asn Ala Ala Arg Leu Ala Pro Lys Thr Val Val Thr Val Tyr 
            580                 585                 590         


Leu Asp Arg Pro Ala Ile Leu Gly Asn Val Gln Asp Lys Ala Ser Ala 
        595                 600                 605             


Ile Ile Gly Asn Phe Gly Val Ser Asp Gly Ala Leu Phe Asp Val Leu 
    610                 615                 620                 


Thr Gly Lys Ala Lys Pro Gln Gly Lys Leu Pro Phe Glu Leu Pro Ser 
625                 630                 635                 640 


Ser Met Ala Glu Val Gln Val Gln Lys Ser Asp Val Pro Tyr Asp Thr 
                645                 650                 655     


Ala Arg Pro Leu Tyr Lys Phe Gly Tyr Gly Leu Ala Tyr 
            660                 665                 


<210> 227
<211> 1314
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 227
atgaccgatc gtgatgtttc gcgccgtgcg ctgctgtccc tggcggcggt cgccgccgcg     60

accccggccg tagccgctgg ccaggccaag ccagccggca acatgcccaa ggacttcctg    120

tggggcgcgg ccaccgccgg tcaccaggtc gagggcaaca acgtcaacag cgacatctgg    180

ctgctggagc agttgaagcc cggcccgttc atggagccgt ccggcgacgc ctgcgaccac    240

tatcatcgct atgccgacga catcgccatg ttggcgaagc tgggcttcaa cacctatcgc    300

ttctcgctgg aatgggcgcg gatcgagccg gccaagggcc agttctcggc cgccgagctg    360

aaccactacc gccaggtcgc cgccacttgc cgcaagcatg gcgtgacgcc ggtggtcacc    420

ttcaaccact tcaccgtgcc gcgctggttc gcggcccagg gcggctggga gaacccggag    480

tcgccggcgc tgttcgcgcg ctattgcgac tacgcggtca agggcatcgg cgacctgatc    540

ggcgtggcgg cgaccttcaa cgagccgaac atcggcatgc tgctggtgtg gatgctgccg    600

ccgttcatcc tggaccagat gaagcaggcc atggccgacg cggccaaggc ctgcggcagc    660

gccaccttct ccagcgccca gttcggtcgc caggacgtca tgatgcccaa tctgatcgag    720

gctcaccgcc tgggttatgc ggcgatcaag gccggccccg gcgacttccc cgtcggcgcc    780

accgtggcga tgatggacga ccaggcggtc ggcccaggca gccgccgcga cgagaagcgg    840

gcccagtgct acacgccgtg gctcgacatg ctgaaggcca ccggcgactt cgtgggcgtg    900

cagacctaca gccgcgcccg cgtcgacgcc aagggccaga tgcacccgcc cgagggcgcc    960

gaactgaccc agaccggcga ggagttctgg ccccaggcgc tggagcagac catccgctac   1020

gcccacgccg ccacgggcaa gccgatctat gtcaccgaga acggcgtctc gaccgagaac   1080

gacgcccgcc gcgtcgccta catccagacc gctctgcagg gcgtgaaggc ctgcctggcc   1140

gacggcgtgc cggtgcgcgg ctacatccac tggtcgctga tggacaattt cgagtgggtg   1200

ttcggctacg ggcccaagtt cggcctggtc gccgtcgacc gcgccaccca ggtccgcacg   1260

cccaagccca gcgccgtgct gctgggctcc atcgcccgcg ccaaccggat ctga         1314

<210> 228
<211> 437
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(25)

<220> 
<221> DOMAIN
<222> (31)...(437)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (354)...(362)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 228
Met Thr Asp Arg Asp Val Ser Arg Arg Ala Leu Leu Ser Leu Ala Ala 
1               5                   10                  15      


Val Ala Ala Ala Thr Pro Ala Val Ala Ala Gly Gln Ala Lys Pro Ala 
            20                  25                  30          


Gly Asn Met Pro Lys Asp Phe Leu Trp Gly Ala Ala Thr Ala Gly His 
        35                  40                  45              


Gln Val Glu Gly Asn Asn Val Asn Ser Asp Ile Trp Leu Leu Glu Gln 
    50                  55                  60                  


Leu Lys Pro Gly Pro Phe Met Glu Pro Ser Gly Asp Ala Cys Asp His 
65                  70                  75                  80  


Tyr His Arg Tyr Ala Asp Asp Ile Ala Met Leu Ala Lys Leu Gly Phe 
                85                  90                  95      


Asn Thr Tyr Arg Phe Ser Leu Glu Trp Ala Arg Ile Glu Pro Ala Lys 
            100                 105                 110         


Gly Gln Phe Ser Ala Ala Glu Leu Asn His Tyr Arg Gln Val Ala Ala 
        115                 120                 125             


Thr Cys Arg Lys His Gly Val Thr Pro Val Val Thr Phe Asn His Phe 
    130                 135                 140                 


Thr Val Pro Arg Trp Phe Ala Ala Gln Gly Gly Trp Glu Asn Pro Glu 
145                 150                 155                 160 


Ser Pro Ala Leu Phe Ala Arg Tyr Cys Asp Tyr Ala Val Lys Gly Ile 
                165                 170                 175     


Gly Asp Leu Ile Gly Val Ala Ala Thr Phe Asn Glu Pro Asn Ile Gly 
            180                 185                 190         


Met Leu Leu Val Trp Met Leu Pro Pro Phe Ile Leu Asp Gln Met Lys 
        195                 200                 205             


Gln Ala Met Ala Asp Ala Ala Lys Ala Cys Gly Ser Ala Thr Phe Ser 
    210                 215                 220                 


Ser Ala Gln Phe Gly Arg Gln Asp Val Met Met Pro Asn Leu Ile Glu 
225                 230                 235                 240 


Ala His Arg Leu Gly Tyr Ala Ala Ile Lys Ala Gly Pro Gly Asp Phe 
                245                 250                 255     


Pro Val Gly Ala Thr Val Ala Met Met Asp Asp Gln Ala Val Gly Pro 
            260                 265                 270         


Gly Ser Arg Arg Asp Glu Lys Arg Ala Gln Cys Tyr Thr Pro Trp Leu 
        275                 280                 285             


Asp Met Leu Lys Ala Thr Gly Asp Phe Val Gly Val Gln Thr Tyr Ser 
    290                 295                 300                 


Arg Ala Arg Val Asp Ala Lys Gly Gln Met His Pro Pro Glu Gly Ala 
305                 310                 315                 320 


Glu Leu Thr Gln Thr Gly Glu Glu Phe Trp Pro Gln Ala Leu Glu Gln 
                325                 330                 335     


Thr Ile Arg Tyr Ala His Ala Ala Thr Gly Lys Pro Ile Tyr Val Thr 
            340                 345                 350         


Glu Asn Gly Val Ser Thr Glu Asn Asp Ala Arg Arg Val Ala Tyr Ile 
        355                 360                 365             


Gln Thr Ala Leu Gln Gly Val Lys Ala Cys Leu Ala Asp Gly Val Pro 
    370                 375                 380                 


Val Arg Gly Tyr Ile His Trp Ser Leu Met Asp Asn Phe Glu Trp Val 
385                 390                 395                 400 


Phe Gly Tyr Gly Pro Lys Phe Gly Leu Val Ala Val Asp Arg Ala Thr 
                405                 410                 415     


Gln Val Arg Thr Pro Lys Pro Ser Ala Val Leu Leu Gly Ser Ile Ala 
            420                 425                 430         


Arg Ala Asn Arg Ile 
        435         


<210> 229
<211> 1455
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 229
atgaatcgcc gcgaactgct cgcctccacc ctggccttca gtgccgcctc ggccctgcct     60

gccgccgccc gtgaaaaggc gttcgtgccg ggcaccttcc cgccgggctt cctgtggggc    120

gccgccaccg ccgcctacca ggtcgagggc gcctataacg aggatggcaa aggcgaatcc    180

gtctgggacc gcttcgtgct ggcgccgggc aagatcaagg aaggccacac cggcaacgtc    240

gcctgcgaca gctaccacaa atatgccgag gatatcgccc tgctgaaggc gatgaacctg    300

aagacctacc gcttctcgac cgcctggacg cgcatccagc ccgatggcac cggcccggcc    360

aatccgaagg ggctcgacta ctattcgcgc ctgaccgacg ccttgctgga ggccggcatt    420

cgtccgatgc cgacgctcta tcactgggat ctgccgcaga acctcgaaga cctcggtggc    480

tggcccaacc gcgacacggc ctggcgctat gccgactatg ccgatatcat ggtgcgagcg    540

ctgggtgacc gtatcgagaa ctggtcgctg ttcaacgaat ccaagacctt caccggcctg    600

ggctacaatg tcggcatctt cgcgccggga cggaaagacc cttacgcctt tatccgctcg    660

acccacacgg tcaacctggc gcacggcctg ggctacaaag cgatcaaggc ggccaatccg    720

aagctgaagg tcggcagcgt ctatgacgtc acgccgatga tcccggcctc gcaatccgag    780

gccgatgtcc gcgccgccga tatctgggac aagtggcaga acctgtggtt cgtcaacacc    840

accctgaccg ggcagtatcc ggaaggcgtc ttcccggtgg acaaacagga agccctgctc    900

ggctaccagg ccggcgacga tgcgatcatg aaggccgatc tcgatttcgt cgggctcaac    960

tactattccg gcacgctgtg ttcgtatggc aaggaggcca gcggcgtgcc ctttatcgac   1020

gtcaacgccc agtggcccta cgccacgccg gacaaggata tgggccggac cgacttcaac   1080

tgggcggtct atccgcaggg gttctacgat atcctgacgc gcatgcacaa ggtcaccggc   1140

gaccggccga tcgagatcac cgaaaacggc tgtgcctata atgtcggtcc cgacgctacc   1200

ggccagatcc acgatccgaa gcgcatcgat ttctacaagg gccacttcga ggcgatgtcg   1260

cgcgccatcc atgacggcgt gccggtgcgc ggctatcacg catggagcct gatggacaat   1320

ttcgagtggg cttcgggtta caccatgcgt ttcggcctga cctatgtcga tttcgacaac   1380

ggccagacgc gcacgatcaa ggattccggc aagtggtacg cccaggtcgc caaggccaat   1440

tcgagcaatc tgtaa                                                    1455

<210> 230
<211> 484
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(23)

<220> 
<221> DOMAIN
<222> (29)...(482)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (37)...(51)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (189)...(192)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (194)...(197)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (283)...(286)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (487)...(490)
<223> N-glycosylation site. Prosite id = PS00001

<400> 230
Met Asn Arg Arg Glu Leu Leu Ala Ser Thr Leu Ala Phe Ser Ala Ala 
1               5                   10                  15      


Ser Ala Leu Pro Ala Ala Ala Arg Glu Lys Ala Phe Val Pro Gly Thr 
            20                  25                  30          


Phe Pro Pro Gly Phe Leu Trp Gly Ala Ala Thr Ala Ala Tyr Gln Val 
        35                  40                  45              


Glu Gly Ala Tyr Asn Glu Asp Gly Lys Gly Glu Ser Val Trp Asp Arg 
    50                  55                  60                  


Phe Val Leu Ala Pro Gly Lys Ile Lys Glu Gly His Thr Gly Asn Val 
65                  70                  75                  80  


Ala Cys Asp Ser Tyr His Lys Tyr Ala Glu Asp Ile Ala Leu Leu Lys 
                85                  90                  95      


Ala Met Asn Leu Lys Thr Tyr Arg Phe Ser Thr Ala Trp Thr Arg Ile 
            100                 105                 110         


Gln Pro Asp Gly Thr Gly Pro Ala Asn Pro Lys Gly Leu Asp Tyr Tyr 
        115                 120                 125             


Ser Arg Leu Thr Asp Ala Leu Leu Glu Ala Gly Ile Arg Pro Met Pro 
    130                 135                 140                 


Thr Leu Tyr His Trp Asp Leu Pro Gln Asn Leu Glu Asp Leu Gly Gly 
145                 150                 155                 160 


Trp Pro Asn Arg Asp Thr Ala Trp Arg Tyr Ala Asp Tyr Ala Asp Ile 
                165                 170                 175     


Met Val Arg Ala Leu Gly Asp Arg Ile Glu Asn Trp Ser Leu Phe Asn 
            180                 185                 190         


Glu Ser Lys Thr Phe Thr Gly Leu Gly Tyr Asn Val Gly Ile Phe Ala 
        195                 200                 205             


Pro Gly Arg Lys Asp Pro Tyr Ala Phe Ile Arg Ser Thr His Thr Val 
    210                 215                 220                 


Asn Leu Ala His Gly Leu Gly Tyr Lys Ala Ile Lys Ala Ala Asn Pro 
225                 230                 235                 240 


Lys Leu Lys Val Gly Ser Val Tyr Asp Val Thr Pro Met Ile Pro Ala 
                245                 250                 255     


Ser Gln Ser Glu Ala Asp Val Arg Ala Ala Asp Ile Trp Asp Lys Trp 
            260                 265                 270         


Gln Asn Leu Trp Phe Val Asn Thr Thr Leu Thr Gly Gln Tyr Pro Glu 
        275                 280                 285             


Gly Val Phe Pro Val Asp Lys Gln Glu Ala Leu Leu Gly Tyr Gln Ala 
    290                 295                 300                 


Gly Asp Asp Ala Ile Met Lys Ala Asp Leu Asp Phe Val Gly Leu Asn 
305                 310                 315                 320 


Tyr Tyr Ser Gly Thr Leu Cys Ser Tyr Gly Lys Glu Ala Ser Gly Val 
                325                 330                 335     


Pro Phe Ile Asp Val Asn Ala Gln Trp Pro Tyr Ala Thr Pro Asp Lys 
            340                 345                 350         


Asp Met Gly Arg Thr Asp Phe Asn Trp Ala Val Tyr Pro Gln Gly Phe 
        355                 360                 365             


Tyr Asp Ile Leu Thr Arg Met His Lys Val Thr Gly Asp Arg Pro Ile 
    370                 375                 380                 


Glu Ile Thr Glu Asn Gly Cys Ala Tyr Asn Val Gly Pro Asp Ala Thr 
385                 390                 395                 400 


Gly Gln Ile His Asp Pro Lys Arg Ile Asp Phe Tyr Lys Gly His Phe 
                405                 410                 415     


Glu Ala Met Ser Arg Ala Ile His Asp Gly Val Pro Val Arg Gly Tyr 
            420                 425                 430         


His Ala Trp Ser Leu Met Asp Asn Phe Glu Trp Ala Ser Gly Tyr Thr 
        435                 440                 445             


Met Arg Phe Gly Leu Thr Tyr Val Asp Phe Asp Asn Gly Gln Thr Arg 
    450                 455                 460                 


Thr Ile Lys Asp Ser Gly Lys Trp Tyr Ala Gln Val Ala Lys Ala Asn 
465                 470                 475                 480 


Ser Ser Asn Leu 
                


<210> 231
<211> 1128
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 231
atgaccgacc acaacgcttc cgaaaccagc ctgttcgaac agtgcggcta cagccgcgag     60

gccatccagg cccgcctgga gcgcaactgg tatgagatgt tcgaaggccc ggacaagatt    120

tactgggaga acgacgaagg cctggggtac gtgatggaca ccggcaacca cgacgtgcgc    180

accgagggca tgagctacgc gatgatgatc gccgtgcagt acggccgcaa ggacgtgttc    240

gacaagctgt ggggttgggt catgaaatac atgttcatga ccgagggcct gcaccagggc    300

tacttcgcct ggtctgtgga ccccagcggc gtaccgaacg ccgacggtcc ggccccggac    360

ggcgaggaat acttcgcgat ggacctgttc ctggcctccg cgcgatgggg cgacggcgaa    420

ggcgtgtacg agtactcccg ccacgcccgc tcgatcctcc acacctgcgt gcaccagggc    480

gaggacggtg aaggctatcc gatgtggaac ccggagaacc atctgatcaa gttcatcccg    540

gaaaccgaat ggaccgaccc gtcctaccat ctgccgcact tctacgaggt gttcgccgag    600

cgcgccgacg aggccgaccg tccgttctgg gcgcaggccg ccaaggcgag ccgcgagtac    660

ctggtcaccg cctgccaccc gcagaccggc atgaaccccg aatactcaaa ctatgatggc    720

acgccgcacg tcgacgagcg cgaccactgg catttctact ccgacgccta ccgcaccgcc    780

ggcaacatcg ggctggactg cctgtggaac ggcgtcgtgc cggaactgtg cgatgcgaat    840

gcgcgtctgc agcgtttctt cctcgaacac gaccgcacct gcgtgtatgc gatcgacggc    900

acgccggtgg acgagaccgt gctgcacccg gtcggcttca tcgccgccac cgccgaaggc    960

tcgctcgccg cgatgcactc gcaggagccg gacgcgctcg acaacgcgat ccgctgggtg   1020

cgcctgctgt gggacacccc gatccgcacc ggcacgcgcc gctactacga caacttcctc   1080

tacgccttcg cgttcctggc gctggcgggg gagtaccgca cctggtga                1128

<210> 232
<211> 375
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (22)...(370)
<223> Glycosyl hydrolases family 8

<220> 
<221> SITE
<222> (5)...(8)
<223> N-glycosylation site. Prosite id = PS00001

<400> 232
Met Thr Asp His Asn Ala Ser Glu Thr Ser Leu Phe Glu Gln Cys Gly 
1               5                   10                  15      


Tyr Ser Arg Glu Ala Ile Gln Ala Arg Leu Glu Arg Asn Trp Tyr Glu 
            20                  25                  30          


Met Phe Glu Gly Pro Asp Lys Ile Tyr Trp Glu Asn Asp Glu Gly Leu 
        35                  40                  45              


Gly Tyr Val Met Asp Thr Gly Asn His Asp Val Arg Thr Glu Gly Met 
    50                  55                  60                  


Ser Tyr Ala Met Met Ile Ala Val Gln Tyr Gly Arg Lys Asp Val Phe 
65                  70                  75                  80  


Asp Lys Leu Trp Gly Trp Val Met Lys Tyr Met Phe Met Thr Glu Gly 
                85                  90                  95      


Leu His Gln Gly Tyr Phe Ala Trp Ser Val Asp Pro Ser Gly Val Pro 
            100                 105                 110         


Asn Ala Asp Gly Pro Ala Pro Asp Gly Glu Glu Tyr Phe Ala Met Asp 
        115                 120                 125             


Leu Phe Leu Ala Ser Ala Arg Trp Gly Asp Gly Glu Gly Val Tyr Glu 
    130                 135                 140                 


Tyr Ser Arg His Ala Arg Ser Ile Leu His Thr Cys Val His Gln Gly 
145                 150                 155                 160 


Glu Asp Gly Glu Gly Tyr Pro Met Trp Asn Pro Glu Asn His Leu Ile 
                165                 170                 175     


Lys Phe Ile Pro Glu Thr Glu Trp Thr Asp Pro Ser Tyr His Leu Pro 
            180                 185                 190         


His Phe Tyr Glu Val Phe Ala Glu Arg Ala Asp Glu Ala Asp Arg Pro 
        195                 200                 205             


Phe Trp Ala Gln Ala Ala Lys Ala Ser Arg Glu Tyr Leu Val Thr Ala 
    210                 215                 220                 


Cys His Pro Gln Thr Gly Met Asn Pro Glu Tyr Ser Asn Tyr Asp Gly 
225                 230                 235                 240 


Thr Pro His Val Asp Glu Arg Asp His Trp His Phe Tyr Ser Asp Ala 
                245                 250                 255     


Tyr Arg Thr Ala Gly Asn Ile Gly Leu Asp Cys Leu Trp Asn Gly Val 
            260                 265                 270         


Val Pro Glu Leu Cys Asp Ala Asn Ala Arg Leu Gln Arg Phe Phe Leu 
        275                 280                 285             


Glu His Asp Arg Thr Cys Val Tyr Ala Ile Asp Gly Thr Pro Val Asp 
    290                 295                 300                 


Glu Thr Val Leu His Pro Val Gly Phe Ile Ala Ala Thr Ala Glu Gly 
305                 310                 315                 320 


Ser Leu Ala Ala Met His Ser Gln Glu Pro Asp Ala Leu Asp Asn Ala 
                325                 330                 335     


Ile Arg Trp Val Arg Leu Leu Trp Asp Thr Pro Ile Arg Thr Gly Thr 
            340                 345                 350         


Arg Arg Tyr Tyr Asp Asn Phe Leu Tyr Ala Phe Ala Phe Leu Ala Leu 
        355                 360                 365             


Ala Gly Glu Tyr Arg Thr Trp 
    370                 375 


<210> 233
<211> 993
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 233
gtgtttgcta ctaccgaacc cgccaagaaa ctgctggctt atctatggtc gcagtatggc     60

agcaagacca tctcgagcgt catggccgag gtgaactgga atcatcgtct ggcagactat    120

gtgaaaggcg tgaccggtaa atatccggcc atgaactgct atgacttcat ccagatctat    180

gttcccgaga acaactggat caactacaac gatataacgc ctgtcaggga gtggttcgac    240

gccggtggta tcgtccagct catgtggcac ttcaacgtgc ccctcaccga gagcaccgtg    300

cccggtagcg atggctcagg cgtaacctgt acgcctagtc agaccacttt caaggcatcc    360

aatgccctga ccagcggcac gtgggaaaac cagtggttct atggccagat ggacaaggtg    420

attgacgtct tgctcaagtt gcaggatgca ggcatcgccg ccatgtggcg ccccttccac    480

gagggcgcag gcaatgcgtg tgccaagcaa caggcctcct ggaccacggc ttggttctgg    540

tggggatatg acggcgccga tacctacaag aagttgtggg ttgccatgtt tgactacttc    600

aagcagaagg gcgtgaaaaa cctcatttgg gtatggacca ctcagaatta caacggcgac    660

agctctaaat acaaccagga caccgattgg tatccaggca acgaatatgt cgatatcatt    720

gcccgcgacc tgtacggcta caacgccgcc cagaacaagc aggagtttga tgagatccag    780

gctacctata gcggcaagat ggtgatcctg ggcgagtgtg gagctgacgg taacaccagc    840

ttcgccaaca tcgccgacgt ctggggtgcc ggagccaagt ggggttcgtt catggtgtgg    900

tatggcatca acttttgcag cgacgcctgg tggaagaacg ccatgagcaa tgcctatgtc    960

attacccgcg agcagctgcc cgacctcaag taa                                 993

<210> 234
<211> 330
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (5)...(325)
<223> Glycosyl hydrolase family 26

<220> 
<221> SITE
<222> (282)...(285)
<223> N-glycosylation site. Prosite id = PS00001

<400> 234
Met Phe Ala Thr Thr Glu Pro Ala Lys Lys Leu Leu Ala Tyr Leu Trp 
1               5                   10                  15      


Ser Gln Tyr Gly Ser Lys Thr Ile Ser Ser Val Met Ala Glu Val Asn 
            20                  25                  30          


Trp Asn His Arg Leu Ala Asp Tyr Val Lys Gly Val Thr Gly Lys Tyr 
        35                  40                  45              


Pro Ala Met Asn Cys Tyr Asp Phe Ile Gln Ile Tyr Val Pro Glu Asn 
    50                  55                  60                  


Asn Trp Ile Asn Tyr Asn Asp Ile Thr Pro Val Arg Glu Trp Phe Asp 
65                  70                  75                  80  


Ala Gly Gly Ile Val Gln Leu Met Trp His Phe Asn Val Pro Leu Thr 
                85                  90                  95      


Glu Ser Thr Val Pro Gly Ser Asp Gly Ser Gly Val Thr Cys Thr Pro 
            100                 105                 110         


Ser Gln Thr Thr Phe Lys Ala Ser Asn Ala Leu Thr Ser Gly Thr Trp 
        115                 120                 125             


Glu Asn Gln Trp Phe Tyr Gly Gln Met Asp Lys Val Ile Asp Val Leu 
    130                 135                 140                 


Leu Lys Leu Gln Asp Ala Gly Ile Ala Ala Met Trp Arg Pro Phe His 
145                 150                 155                 160 


Glu Gly Ala Gly Asn Ala Cys Ala Lys Gln Gln Ala Ser Trp Thr Thr 
                165                 170                 175     


Ala Trp Phe Trp Trp Gly Tyr Asp Gly Ala Asp Thr Tyr Lys Lys Leu 
            180                 185                 190         


Trp Val Ala Met Phe Asp Tyr Phe Lys Gln Lys Gly Val Lys Asn Leu 
        195                 200                 205             


Ile Trp Val Trp Thr Thr Gln Asn Tyr Asn Gly Asp Ser Ser Lys Tyr 
    210                 215                 220                 


Asn Gln Asp Thr Asp Trp Tyr Pro Gly Asn Glu Tyr Val Asp Ile Ile 
225                 230                 235                 240 


Ala Arg Asp Leu Tyr Gly Tyr Asn Ala Ala Gln Asn Lys Gln Glu Phe 
                245                 250                 255     


Asp Glu Ile Gln Ala Thr Tyr Ser Gly Lys Met Val Ile Leu Gly Glu 
            260                 265                 270         


Cys Gly Ala Asp Gly Asn Thr Ser Phe Ala Asn Ile Ala Asp Val Trp 
        275                 280                 285             


Gly Ala Gly Ala Lys Trp Gly Ser Phe Met Val Trp Tyr Gly Ile Asn 
    290                 295                 300                 


Phe Cys Ser Asp Ala Trp Trp Lys Asn Ala Met Ser Asn Ala Tyr Val 
305                 310                 315                 320 


Ile Thr Arg Glu Gln Leu Pro Asp Leu Lys 
                325                 330 


<210> 235
<211> 999
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 235
atgagacctg tcatccttgc tgccatcacc atggctttat ccctctttgt ctcctgctca     60

tccggcgaag gctgggtaaa ggtggaagga aataaattca tagaccctca gggaaaggaa    120

ctggtgttcc gcggcctctg cttctcggat cctgtgaaac tggtccgtga cgggcagtgg    180

aatgagcgtt atttcgcgga ggcggcggcc tggggagcca atgtggtccg tttcgccgtc    240

catcccacca acctgaactc catgggctgg gaagagacct tccaggccat ggaccagggc    300

attgcgtggg ccaaacagca tgggatgtat gttatcatgg actggcacac catcggcaac    360

ctgaaggagg agaagtttac ttctcccatg tacaggacca cccgggagga gacgttcaag    420

ttctggcgca ccgtagccag gcgttacaag gacgagccgg cggtggcgct gtacgaactc    480

ttcaacgaac ccaccgtcac cgcagagggg gttggttcct gcacctggac cgaatggaaa    540

gaactgcagg agcagatcat agataccgtg cgcacatata acccccgggc cgtgtgcctt    600

tgcgccggat tcaattgggc gtacgacctt acgccggtgg cttcggagcc catagcccgt    660

cccaacgtgg cctatgtatc ccacccgtat cccatgaaac gggaacagcc ctgggaagga    720

cagtgggaga aagacttcgg ctatgtggcc gatacttacc ccgttatctg cacggagatt    780

ggattctgcc tgccggatga acccggtgcc catatcccgg tgatgtccac ggaggtgtat    840

gggcagcaca taacccaata ctttgaaaag aagggcatct ccttcacggt atggtgcttt    900

gataccagct gggcgcccac gctaatcagt gactgggact ttacgcccac cacccagggc    960

cgcttcttca aagactatct gcagaataag cagccctag                           999

<210> 236
<211> 332
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(21)

<220> 
<221> DOMAIN
<222> (31)...(308)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 236
Met Arg Pro Val Ile Leu Ala Ala Ile Thr Met Ala Leu Ser Leu Phe 
1               5                   10                  15      


Val Ser Cys Ser Ser Gly Glu Gly Trp Val Lys Val Glu Gly Asn Lys 
            20                  25                  30          


Phe Ile Asp Pro Gln Gly Lys Glu Leu Val Phe Arg Gly Leu Cys Phe 
        35                  40                  45              


Ser Asp Pro Val Lys Leu Val Arg Asp Gly Gln Trp Asn Glu Arg Tyr 
    50                  55                  60                  


Phe Ala Glu Ala Ala Ala Trp Gly Ala Asn Val Val Arg Phe Ala Val 
65                  70                  75                  80  


His Pro Thr Asn Leu Asn Ser Met Gly Trp Glu Glu Thr Phe Gln Ala 
                85                  90                  95      


Met Asp Gln Gly Ile Ala Trp Ala Lys Gln His Gly Met Tyr Val Ile 
            100                 105                 110         


Met Asp Trp His Thr Ile Gly Asn Leu Lys Glu Glu Lys Phe Thr Ser 
        115                 120                 125             


Pro Met Tyr Arg Thr Thr Arg Glu Glu Thr Phe Lys Phe Trp Arg Thr 
    130                 135                 140                 


Val Ala Arg Arg Tyr Lys Asp Glu Pro Ala Val Ala Leu Tyr Glu Leu 
145                 150                 155                 160 


Phe Asn Glu Pro Thr Val Thr Ala Glu Gly Val Gly Ser Cys Thr Trp 
                165                 170                 175     


Thr Glu Trp Lys Glu Leu Gln Glu Gln Ile Ile Asp Thr Val Arg Thr 
            180                 185                 190         


Tyr Asn Pro Arg Ala Val Cys Leu Cys Ala Gly Phe Asn Trp Ala Tyr 
        195                 200                 205             


Asp Leu Thr Pro Val Ala Ser Glu Pro Ile Ala Arg Pro Asn Val Ala 
    210                 215                 220                 


Tyr Val Ser His Pro Tyr Pro Met Lys Arg Glu Gln Pro Trp Glu Gly 
225                 230                 235                 240 


Gln Trp Glu Lys Asp Phe Gly Tyr Val Ala Asp Thr Tyr Pro Val Ile 
                245                 250                 255     


Cys Thr Glu Ile Gly Phe Cys Leu Pro Asp Glu Pro Gly Ala His Ile 
            260                 265                 270         


Pro Val Met Ser Thr Glu Val Tyr Gly Gln His Ile Thr Gln Tyr Phe 
        275                 280                 285             


Glu Lys Lys Gly Ile Ser Phe Thr Val Trp Cys Phe Asp Thr Ser Trp 
    290                 295                 300                 


Ala Pro Thr Leu Ile Ser Asp Trp Asp Phe Thr Pro Thr Thr Gln Gly 
305                 310                 315                 320 


Arg Phe Phe Lys Asp Tyr Leu Gln Asn Lys Gln Pro 
                325                 330         


<210> 237
<211> 2712
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 237
atggcgacag ctagagcacg agcagatata tctaccacac cagtcacagc ctcgacagat     60

gctgccaaga acctgtatgc ctatttcctg gaccagtatg gcaagaagac gatttccagc    120

gtcatggcca atgtcaactg gaacaacact tgtgccgaga aagtctataa actcacgggc    180

aagtatcctg ccatgaactg ctacgacttc atccacatct gtttctcgcc agccaactgg    240

attgactaca ccgacatcac tcctgccaag gaatggcacg atgcgggcgg tatcgtacag    300

ttgatgtggc atttcaatgt gcctaagagc cagggggcaa cagatgttac ctgcacgccc    360

agcgagacca cctttaaggc ttccaatgct ctggttagcg gcacgtggga gaacaaatgg    420

ttctacgagc agatggacaa ggtcattgcc accatcctca agttacagga cgctggcatt    480

gccgctacct ggcgaccttt ccatgaggca gcaggcaatg cttgcgccaa gcagcaggcc    540

gactggacca aagcatggtt ctggtggggc tacgacggtg ccgacaccta caagaaactg    600

tggattgcca tgtacgacta tttcaagctg aaaggcgtga acaacctcat ctggatgtgg    660

accacccaga attataatgg tgacagcagc aaatacaacc aggacaccga ctggtaccct    720

ggcgacgagt atgttgacat cgtggcccgc gacctctatg gctacaatgc cgaccagaac    780

ctgcaggagt tcagcgagat tcaggctgcc tatcccaaca agatggtggt tctgggtgaa    840

tgcggaaaag gtgatagcgg cgaccccggc aagatgtccg atgtatgggc gaaaggtgcc    900

aagtggggcc acttcatggt atggtatcaa ggcgaacaag gctctaccga cacgatgtgc    960

agcgacgact ggtggaagga tgccatgagc agcgccaacg tcatcacccg cgacaaggtg   1020

gttatccccg atgtcacttc aaccatcgag aatgccacgg atgccgtgaa gaacatggga   1080

ctggggtgga acctggggaa cgccctcgac gccaatgccc agcaatacca tgatgccacc   1140

caggacaact actggggaca gcaggacatt acctctgaga gctgctgggg tcagctaccc   1200

accaaggcag agctgatggc catgatgaaa gaagccggtt tcggagccat ccgcgttccc   1260

gtgacatggt ataaccacat ggacaaggac ggcaatgtgg atgcagcatg gatgaatcgt   1320

gtgcatgagg tggttgacta tgtcatcagc cagggaatgt actgcatcct caacgtacac   1380

cacgacacgg gtgccgacag ctacgacagc cagaagaacc tcaccggcta ccattggatc   1440

aaggccgacg aaaccaacta cgccaccaac aaggcccgct atgagaagct gtggcagcag   1500

atagcccagg agttccgcaa ctacggccag ctgctgctgt tcgagggcta taacgagatg   1560

ctcgatgcca acaactcctg gaattttgca cagagcagtt cagcctacga tgccatcaac   1620

aaatacgccc agagctttgt cgatgtcgta cgcgccaccg gtggcaacaa tgcccagcgc   1680

aacctcattg tcagcacata cggcgcctgc tcaggcaacg gcacgtggga tgcaagagtg   1740

caagacccct tgaagaaact gcagattccc acgggtgaaa gcaaccatat catcttcgag   1800

gttcacaact atccctccat cgtcaacaag gacaacgcgg gcaactacgt cagcgatcgc   1860

accatcagcg aaatcaaggc agagattgat gcatggctta agaacttaaa gacccacctc   1920

gtcagcaagg gcgctcccgt catcatcggc gaatggggca ccaacaacgt cgatgccggc   1980

ggtggcaaga cagactacga cctccataag gacctgatgt tcgaatttgt cagctacatg   2040

ataaagacca tgaagcagaa cgacattgcc accttctact ggatgggact taccgacggc   2100

gctccacgca cctaccccgc cttcacacag cccgacctgg cgctgaagat gctgcaggcc   2160

tatcacggcg actcttggaa tccctacctg cctgacgcca aggactttcc cgaaggcaaa   2220

atcacctcgg ccacggtgaa tttcaacagc caatggggcg aactgaccat ccacgatgga   2280

gctattgaca agaccgtcta tagaggtatc aaggtggagc tggaagaaaa gcctgccact   2340

ggagccctgt ctttcaaggt atatgccaac agtgagaagg caacagccat caattccaaa   2400

accccacagt tggctttctt cagttacaca ggcatccaga aaatcaacct acagtggaac   2460

atagccacca aggggagtat caaaatcaag agcgtcaacc ttatcaagca cgacgactcc   2520

acagaaccct gtagtctgaa agtggcttgg ggttgtactc tcagcgacca gaactacgcc   2580

acgggcatcg aagacattac tatcactcct gttcgtcatg acgatggaat catctacaat   2640

ctgagcggac agcctgtaac ctctcctcag cgcggcatct acatcctcaa cggaaagaaa   2700

atcatcaaat ag                                                       2712

<210> 238
<211> 903
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (15)...(339)
<223> Glycosyl hydrolase family 26

<220> 
<221> DOMAIN
<222> (366)...(705)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (48)...(51)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (356)...(359)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (480)...(483)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (532)...(535)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (581)...(584)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (893)...(896)
<223> N-glycosylation site. Prosite id = PS00001

<400> 238
Met Ala Thr Ala Arg Ala Arg Ala Asp Ile Ser Thr Thr Pro Val Thr 
1               5                   10                  15      


Ala Ser Thr Asp Ala Ala Lys Asn Leu Tyr Ala Tyr Phe Leu Asp Gln 
            20                  25                  30          


Tyr Gly Lys Lys Thr Ile Ser Ser Val Met Ala Asn Val Asn Trp Asn 
        35                  40                  45              


Asn Thr Cys Ala Glu Lys Val Tyr Lys Leu Thr Gly Lys Tyr Pro Ala 
    50                  55                  60                  


Met Asn Cys Tyr Asp Phe Ile His Ile Cys Phe Ser Pro Ala Asn Trp 
65                  70                  75                  80  


Ile Asp Tyr Thr Asp Ile Thr Pro Ala Lys Glu Trp His Asp Ala Gly 
                85                  90                  95      


Gly Ile Val Gln Leu Met Trp His Phe Asn Val Pro Lys Ser Gln Gly 
            100                 105                 110         


Ala Thr Asp Val Thr Cys Thr Pro Ser Glu Thr Thr Phe Lys Ala Ser 
        115                 120                 125             


Asn Ala Leu Val Ser Gly Thr Trp Glu Asn Lys Trp Phe Tyr Glu Gln 
    130                 135                 140                 


Met Asp Lys Val Ile Ala Thr Ile Leu Lys Leu Gln Asp Ala Gly Ile 
145                 150                 155                 160 


Ala Ala Thr Trp Arg Pro Phe His Glu Ala Ala Gly Asn Ala Cys Ala 
                165                 170                 175     


Lys Gln Gln Ala Asp Trp Thr Lys Ala Trp Phe Trp Trp Gly Tyr Asp 
            180                 185                 190         


Gly Ala Asp Thr Tyr Lys Lys Leu Trp Ile Ala Met Tyr Asp Tyr Phe 
        195                 200                 205             


Lys Leu Lys Gly Val Asn Asn Leu Ile Trp Met Trp Thr Thr Gln Asn 
    210                 215                 220                 


Tyr Asn Gly Asp Ser Ser Lys Tyr Asn Gln Asp Thr Asp Trp Tyr Pro 
225                 230                 235                 240 


Gly Asp Glu Tyr Val Asp Ile Val Ala Arg Asp Leu Tyr Gly Tyr Asn 
                245                 250                 255     


Ala Asp Gln Asn Leu Gln Glu Phe Ser Glu Ile Gln Ala Ala Tyr Pro 
            260                 265                 270         


Asn Lys Met Val Val Leu Gly Glu Cys Gly Lys Gly Asp Ser Gly Asp 
        275                 280                 285             


Pro Gly Lys Met Ser Asp Val Trp Ala Lys Gly Ala Lys Trp Gly His 
    290                 295                 300                 


Phe Met Val Trp Tyr Gln Gly Glu Gln Gly Ser Thr Asp Thr Met Cys 
305                 310                 315                 320 


Ser Asp Asp Trp Trp Lys Asp Ala Met Ser Ser Ala Asn Val Ile Thr 
                325                 330                 335     


Arg Asp Lys Val Val Ile Pro Asp Val Thr Ser Thr Ile Glu Asn Ala 
            340                 345                 350         


Thr Asp Ala Val Lys Asn Met Gly Leu Gly Trp Asn Leu Gly Asn Ala 
        355                 360                 365             


Leu Asp Ala Asn Ala Gln Gln Tyr His Asp Ala Thr Gln Asp Asn Tyr 
    370                 375                 380                 


Trp Gly Gln Gln Asp Ile Thr Ser Glu Ser Cys Trp Gly Gln Leu Pro 
385                 390                 395                 400 


Thr Lys Ala Glu Leu Met Ala Met Met Lys Glu Ala Gly Phe Gly Ala 
                405                 410                 415     


Ile Arg Val Pro Val Thr Trp Tyr Asn His Met Asp Lys Asp Gly Asn 
            420                 425                 430         


Val Asp Ala Ala Trp Met Asn Arg Val His Glu Val Val Asp Tyr Val 
        435                 440                 445             


Ile Ser Gln Gly Met Tyr Cys Ile Leu Asn Val His His Asp Thr Gly 
    450                 455                 460                 


Ala Asp Ser Tyr Asp Ser Gln Lys Asn Leu Thr Gly Tyr His Trp Ile 
465                 470                 475                 480 


Lys Ala Asp Glu Thr Asn Tyr Ala Thr Asn Lys Ala Arg Tyr Glu Lys 
                485                 490                 495     


Leu Trp Gln Gln Ile Ala Gln Glu Phe Arg Asn Tyr Gly Gln Leu Leu 
            500                 505                 510         


Leu Phe Glu Gly Tyr Asn Glu Met Leu Asp Ala Asn Asn Ser Trp Asn 
        515                 520                 525             


Phe Ala Gln Ser Ser Ser Ala Tyr Asp Ala Ile Asn Lys Tyr Ala Gln 
    530                 535                 540                 


Ser Phe Val Asp Val Val Arg Ala Thr Gly Gly Asn Asn Ala Gln Arg 
545                 550                 555                 560 


Asn Leu Ile Val Ser Thr Tyr Gly Ala Cys Ser Gly Asn Gly Thr Trp 
                565                 570                 575     


Asp Ala Arg Val Gln Asp Pro Leu Lys Lys Leu Gln Ile Pro Thr Gly 
            580                 585                 590         


Glu Ser Asn His Ile Ile Phe Glu Val His Asn Tyr Pro Ser Ile Val 
        595                 600                 605             


Asn Lys Asp Asn Ala Gly Asn Tyr Val Ser Asp Arg Thr Ile Ser Glu 
    610                 615                 620                 


Ile Lys Ala Glu Ile Asp Ala Trp Leu Lys Asn Leu Lys Thr His Leu 
625                 630                 635                 640 


Val Ser Lys Gly Ala Pro Val Ile Ile Gly Glu Trp Gly Thr Asn Asn 
                645                 650                 655     


Val Asp Ala Gly Gly Gly Lys Thr Asp Tyr Asp Leu His Lys Asp Leu 
            660                 665                 670         


Met Phe Glu Phe Val Ser Tyr Met Ile Lys Thr Met Lys Gln Asn Asp 
        675                 680                 685             


Ile Ala Thr Phe Tyr Trp Met Gly Leu Thr Asp Gly Ala Pro Arg Thr 
    690                 695                 700                 


Tyr Pro Ala Phe Thr Gln Pro Asp Leu Ala Leu Lys Met Leu Gln Ala 
705                 710                 715                 720 


Tyr His Gly Asp Ser Trp Asn Pro Tyr Leu Pro Asp Ala Lys Asp Phe 
                725                 730                 735     


Pro Glu Gly Lys Ile Thr Ser Ala Thr Val Asn Phe Asn Ser Gln Trp 
            740                 745                 750         


Gly Glu Leu Thr Ile His Asp Gly Ala Ile Asp Lys Thr Val Tyr Arg 
        755                 760                 765             


Gly Ile Lys Val Glu Leu Glu Glu Lys Pro Ala Thr Gly Ala Leu Ser 
    770                 775                 780                 


Phe Lys Val Tyr Ala Asn Ser Glu Lys Ala Thr Ala Ile Asn Ser Lys 
785                 790                 795                 800 


Thr Pro Gln Leu Ala Phe Phe Ser Tyr Thr Gly Ile Gln Lys Ile Asn 
                805                 810                 815     


Leu Gln Trp Asn Ile Ala Thr Lys Gly Ser Ile Lys Ile Lys Ser Val 
            820                 825                 830         


Asn Leu Ile Lys His Asp Asp Ser Thr Glu Pro Cys Ser Leu Lys Val 
        835                 840                 845             


Ala Trp Gly Cys Thr Leu Ser Asp Gln Asn Tyr Ala Thr Gly Ile Glu 
    850                 855                 860                 


Asp Ile Thr Ile Thr Pro Val Arg His Asp Asp Gly Ile Ile Tyr Asn 
865                 870                 875                 880 


Leu Ser Gly Gln Pro Val Thr Ser Pro Gln Arg Gly Ile Tyr Ile Leu 
                885                 890                 895     


Asn Gly Lys Lys Ile Ile Lys 
            900             


<210> 239
<211> 1035
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 239
atgagagata taagtcctgc agagctggtt gccgagatga caaccggatg gaatcttgga     60

aatacctttg atgcatatgg aaaaggcggt cttgatgatg agacaggctg gggaaatccc    120

tatactacta aggaaatgat tgatgtagtc tgtgaaaagg ggtttaattc tatcagaatc    180

ccaataacct gggctgatca tatgggtgct gctcctgact atacagtaga tgaggactgg    240

atgaaccgtg tagaagaggt tgtaaattat gctcttgatg acgggatgta tgtcattatc    300

aattcccacc acgaagagtc ctggagaatc cctgatgatg cacacattga tgcagtagat    360

gaacaggttg gaaagctctg ggtccagata gctgagaggt tcagggatta tggcgaccat    420

cttatttttg aggggctaaa tgagccacgt gttaagggcg gtgaaaatga gtggaatggc    480

ggaacgaccg aaggacgtaa atgcctggac agacttaatc agacttttgt agattcagta    540

agatcaacag gtggaaataa tgaaaaaaga cttgtactta taacaagctt tgcatcctca    600

cacgtaatac agacaatagg aagccttaaa attccaagcg acgatcacct tgctgtttca    660

atccatgcct atacgcctta tgattttaca tatgcctccg gcacctcctc tgagctttta    720

acctgggatg gttccagaaa aagtgatatt gcttctgtta ttggtgatgt aaaaagaatc    780

tttatagaca agggtattcc tgtccttatg acagaatatg gtgcagttga taaagatggc    840

aactccggtg atgtaagcgc ctgggtaact gagtatttaa cacgcgcaaa aaaagccggt    900

atcccatgct tttggtggga caatggcctg tatgaatcag gtgatgaaca ttttgctata    960

ttcaaccgca atgacctgac ctggtacaga gaagacgtcg ttgatgccat tatggctgtc   1020

tactatgccc aataa                                                    1035

<210> 240
<211> 344
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (20)...(316)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (1)...(4)
<223> Tubulin-beta mRNA autoregulation signal. Prosite id = PS00228

<220> 
<221> SITE
<222> (143)...(152)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (175)...(178)
<223> N-glycosylation site. Prosite id = PS00001

<400> 240
Met Arg Asp Ile Ser Pro Ala Glu Leu Val Ala Glu Met Thr Thr Gly 
1               5                   10                  15      


Trp Asn Leu Gly Asn Thr Phe Asp Ala Tyr Gly Lys Gly Gly Leu Asp 
            20                  25                  30          


Asp Glu Thr Gly Trp Gly Asn Pro Tyr Thr Thr Lys Glu Met Ile Asp 
        35                  40                  45              


Val Val Cys Glu Lys Gly Phe Asn Ser Ile Arg Ile Pro Ile Thr Trp 
    50                  55                  60                  


Ala Asp His Met Gly Ala Ala Pro Asp Tyr Thr Val Asp Glu Asp Trp 
65                  70                  75                  80  


Met Asn Arg Val Glu Glu Val Val Asn Tyr Ala Leu Asp Asp Gly Met 
                85                  90                  95      


Tyr Val Ile Ile Asn Ser His His Glu Glu Ser Trp Arg Ile Pro Asp 
            100                 105                 110         


Asp Ala His Ile Asp Ala Val Asp Glu Gln Val Gly Lys Leu Trp Val 
        115                 120                 125             


Gln Ile Ala Glu Arg Phe Arg Asp Tyr Gly Asp His Leu Ile Phe Glu 
    130                 135                 140                 


Gly Leu Asn Glu Pro Arg Val Lys Gly Gly Glu Asn Glu Trp Asn Gly 
145                 150                 155                 160 


Gly Thr Thr Glu Gly Arg Lys Cys Leu Asp Arg Leu Asn Gln Thr Phe 
                165                 170                 175     


Val Asp Ser Val Arg Ser Thr Gly Gly Asn Asn Glu Lys Arg Leu Val 
            180                 185                 190         


Leu Ile Thr Ser Phe Ala Ser Ser His Val Ile Gln Thr Ile Gly Ser 
        195                 200                 205             


Leu Lys Ile Pro Ser Asp Asp His Leu Ala Val Ser Ile His Ala Tyr 
    210                 215                 220                 


Thr Pro Tyr Asp Phe Thr Tyr Ala Ser Gly Thr Ser Ser Glu Leu Leu 
225                 230                 235                 240 


Thr Trp Asp Gly Ser Arg Lys Ser Asp Ile Ala Ser Val Ile Gly Asp 
                245                 250                 255     


Val Lys Arg Ile Phe Ile Asp Lys Gly Ile Pro Val Leu Met Thr Glu 
            260                 265                 270         


Tyr Gly Ala Val Asp Lys Asp Gly Asn Ser Gly Asp Val Ser Ala Trp 
        275                 280                 285             


Val Thr Glu Tyr Leu Thr Arg Ala Lys Lys Ala Gly Ile Pro Cys Phe 
    290                 295                 300                 


Trp Trp Asp Asn Gly Leu Tyr Glu Ser Gly Asp Glu His Phe Ala Ile 
305                 310                 315                 320 


Phe Asn Arg Asn Asp Leu Thr Trp Tyr Arg Glu Asp Val Val Asp Ala 
                325                 330                 335     


Ile Met Ala Val Tyr Tyr Ala Gln 
            340                 


<210> 241
<211> 1107
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 241
atgaatgtgt tgcgtagtgg actcgtgacg atgctgctgc tggctgcctt tagtgttcag     60

gcagcctgta cctggcctgc ctgggagcag tttaaaaagg attacatcag tcaggaaggg    120

cgcgtcatcg accccagcga cgcgcgcaaa atcaccacct ccgaagggca aagttacggc    180

atgttctttg ccctggcggc taacgaccgt gcagctttcg ataatcttct cgactggacg    240

cagaacaatc tcgctcaggg ttctttaaaa gaacatttgc ccgcctggct gtggggcaag    300

aaagagaaca gtaagtggga agtgctggac agcaattcgg cctccgatgg tgatgtctgg    360

atggcctggt cgttgctgga ggcggggcgt ttgtggaaag agcagcgtta taccgacatc    420

ggcagcgcat tgctaaaacg tatcgcgcgg gaggaagtgg tgacggtgcc tgggctgggc    480

tccatgttgt taccgggcaa agtgggtttt gctgaggata acagctggcg ttttaacccc    540

agctacctgc cgccgacgct ggcgcagtat ttcacccgct ttggcgcgcc gtggaccacg    600

ctgcgcgaaa ccaatcaacg tttattgctg gaaaccgccc cgaaaggctt ttcgccagac    660

tgggtgcgct atgagaaaga caaaggctgg cagctaaaag ccgaaaaaac attgatcagc    720

agctacgacg ctatccgcgt ttacatgtgg gtaggcatga tgcctgacag cgatccgcaa    780

aaagcgcgga tgctcaaccg gtttaaaccg atggcgacat tcactgagaa aaacggttat    840

ccgccggaaa aagtggatgt ggctacgggg aaagcgcagg gtaaaggacc ggtcggtttt    900

tctgccgcca tgctgccctt tttacaaaac cgcgatgcgc aggccgttca gcgccagcgc    960

gtggccgata actttcccgg cagcgatgcc tattacaact atgtgctgac cctgtttgga   1020

caaggctggg atcaacaccg tttccgcttc tcgacaaaag gtgagttatt acctgactgg   1080

ggccaggaat gcgcaaattc acactaa                                       1107

<210> 242
<211> 368
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(21)

<220> 
<221> DOMAIN
<222> (1)...(346)
<223> Glycosyl hydrolases family 8

<400> 242
Met Asn Val Leu Arg Ser Gly Leu Val Thr Met Leu Leu Leu Ala Ala 
1               5                   10                  15      


Phe Ser Val Gln Ala Ala Cys Thr Trp Pro Ala Trp Glu Gln Phe Lys 
            20                  25                  30          


Lys Asp Tyr Ile Ser Gln Glu Gly Arg Val Ile Asp Pro Ser Asp Ala 
        35                  40                  45              


Arg Lys Ile Thr Thr Ser Glu Gly Gln Ser Tyr Gly Met Phe Phe Ala 
    50                  55                  60                  


Leu Ala Ala Asn Asp Arg Ala Ala Phe Asp Asn Leu Leu Asp Trp Thr 
65                  70                  75                  80  


Gln Asn Asn Leu Ala Gln Gly Ser Leu Lys Glu His Leu Pro Ala Trp 
                85                  90                  95      


Leu Trp Gly Lys Lys Glu Asn Ser Lys Trp Glu Val Leu Asp Ser Asn 
            100                 105                 110         


Ser Ala Ser Asp Gly Asp Val Trp Met Ala Trp Ser Leu Leu Glu Ala 
        115                 120                 125             


Gly Arg Leu Trp Lys Glu Gln Arg Tyr Thr Asp Ile Gly Ser Ala Leu 
    130                 135                 140                 


Leu Lys Arg Ile Ala Arg Glu Glu Val Val Thr Val Pro Gly Leu Gly 
145                 150                 155                 160 


Ser Met Leu Leu Pro Gly Lys Val Gly Phe Ala Glu Asp Asn Ser Trp 
                165                 170                 175     


Arg Phe Asn Pro Ser Tyr Leu Pro Pro Thr Leu Ala Gln Tyr Phe Thr 
            180                 185                 190         


Arg Phe Gly Ala Pro Trp Thr Thr Leu Arg Glu Thr Asn Gln Arg Leu 
        195                 200                 205             


Leu Leu Glu Thr Ala Pro Lys Gly Phe Ser Pro Asp Trp Val Arg Tyr 
    210                 215                 220                 


Glu Lys Asp Lys Gly Trp Gln Leu Lys Ala Glu Lys Thr Leu Ile Ser 
225                 230                 235                 240 


Ser Tyr Asp Ala Ile Arg Val Tyr Met Trp Val Gly Met Met Pro Asp 
                245                 250                 255     


Ser Asp Pro Gln Lys Ala Arg Met Leu Asn Arg Phe Lys Pro Met Ala 
            260                 265                 270         


Thr Phe Thr Glu Lys Asn Gly Tyr Pro Pro Glu Lys Val Asp Val Ala 
        275                 280                 285             


Thr Gly Lys Ala Gln Gly Lys Gly Pro Val Gly Phe Ser Ala Ala Met 
    290                 295                 300                 


Leu Pro Phe Leu Gln Asn Arg Asp Ala Gln Ala Val Gln Arg Gln Arg 
305                 310                 315                 320 


Val Ala Asp Asn Phe Pro Gly Ser Asp Ala Tyr Tyr Asn Tyr Val Leu 
                325                 330                 335     


Thr Leu Phe Gly Gln Gly Trp Asp Gln His Arg Phe Arg Phe Ser Thr 
            340                 345                 350         


Lys Gly Glu Leu Leu Pro Asp Trp Gly Gln Glu Cys Ala Asn Ser His 
        355                 360                 365             


<210> 243
<211> 1290
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 243
atgaaaagca aagtgaaaat gttctttgcg gctgccatcg tgtggagtgc atgtagttca     60

acaggatatg ccgctgccat tgagaaggag aagcacatgt cagagcttcg gacagaggat    120

ctttttgtta aaaaagtaga ggggatgaac aaggatttta tcaaaggggc agatgtgtcc    180

agcgttattg ctttggaaaa tagcggagtc accttttata atacaagcgg aaaacgccag    240

gacatcttta caaccttaaa acaggctggg gtcaactatg ttcgcgtccg catctggaat    300

cacccgtatg attcaaatgg caacgggtat ggcgggggaa acaatgatgt tcaaaaagcc    360

atcgaaatcg gaaaaagagc gacagcgaac ggaatgaagg tgctggccga ctttcactac    420

tctgatttct gggccgatcc agcgaaacaa aaggtgccca aatcctgggt gaatctcagc    480

tttgaagcaa aaaaagggaa gttctatgag tatacgaaac aaagcctgca aaagatgatc    540

aaggaaggcg ttgacatcgg catggttcag gtcggaaatg aaacaacagg aggatttgcc    600

ggtgagactg attggacgaa gatgtgccaa ttatttaatg aaggaagccg agcggtcagg    660

gagacaaatt caaatatttt ggtcgccctg cattttacca atcctgaaac ggctggaagg    720

tattcattta ttgcggaaac actcagcaaa aacaaagtgg attatgatgt gtttgcgagc    780

tcctattatc ctttctggca tggcacatta caaaatttga cctccgtgct gaaggctgtt    840

gccaatactt acggcaaaaa agtcatggtg gcggagacat cgtacaccta taccgctgag    900

gatggcgatg ggcatggaaa tacagcacca aaaagcgggc agacgctgcc atatccaatt    960

tctgttcaag gccaggcaac tgcagtaaga gatgtaatgg aggcagtggc gaatacgggc   1020

aaagcaggac ttggtgtttt ctactgggag cctgcgtgga ttccggtcgg accgaaaaca   1080

cagatagaga aaaacaaagt gttatgggaa acatacgggt cagggtgggc ttccagctac   1140

gctgctgaat acgaccctga agacgccggg aagtggtatg ggggaagtgc tgtagataat   1200

caagctttgt ttgattttaa tggacacccg ctgccttcct tgcaggtgtt tcaatatgcg   1260

gagtcaggac atattccaaa gaaacgctga                                    1290

<210> 244
<211> 429
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(24)

<220> 
<221> DOMAIN
<222> (54)...(417)
<223> Glycosyl hydrolase family 53

<220> 
<221> DOMAIN
<222> (59)...(354)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (75)...(78)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (160)...(163)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (195)...(198)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (276)...(279)
<223> N-glycosylation site. Prosite id = PS00001

<400> 244
Met Lys Ser Lys Val Lys Met Phe Phe Ala Ala Ala Ile Val Trp Ser 
1               5                   10                  15      


Ala Cys Ser Ser Thr Gly Tyr Ala Ala Ala Ile Glu Lys Glu Lys His 
            20                  25                  30          


Met Ser Glu Leu Arg Thr Glu Asp Leu Phe Val Lys Lys Val Glu Gly 
        35                  40                  45              


Met Asn Lys Asp Phe Ile Lys Gly Ala Asp Val Ser Ser Val Ile Ala 
    50                  55                  60                  


Leu Glu Asn Ser Gly Val Thr Phe Tyr Asn Thr Ser Gly Lys Arg Gln 
65                  70                  75                  80  


Asp Ile Phe Thr Thr Leu Lys Gln Ala Gly Val Asn Tyr Val Arg Val 
                85                  90                  95      


Arg Ile Trp Asn His Pro Tyr Asp Ser Asn Gly Asn Gly Tyr Gly Gly 
            100                 105                 110         


Gly Asn Asn Asp Val Gln Lys Ala Ile Glu Ile Gly Lys Arg Ala Thr 
        115                 120                 125             


Ala Asn Gly Met Lys Val Leu Ala Asp Phe His Tyr Ser Asp Phe Trp 
    130                 135                 140                 


Ala Asp Pro Ala Lys Gln Lys Val Pro Lys Ser Trp Val Asn Leu Ser 
145                 150                 155                 160 


Phe Glu Ala Lys Lys Gly Lys Phe Tyr Glu Tyr Thr Lys Gln Ser Leu 
                165                 170                 175     


Gln Lys Met Ile Lys Glu Gly Val Asp Ile Gly Met Val Gln Val Gly 
            180                 185                 190         


Asn Glu Thr Thr Gly Gly Phe Ala Gly Glu Thr Asp Trp Thr Lys Met 
        195                 200                 205             


Cys Gln Leu Phe Asn Glu Gly Ser Arg Ala Val Arg Glu Thr Asn Ser 
    210                 215                 220                 


Asn Ile Leu Val Ala Leu His Phe Thr Asn Pro Glu Thr Ala Gly Arg 
225                 230                 235                 240 


Tyr Ser Phe Ile Ala Glu Thr Leu Ser Lys Asn Lys Val Asp Tyr Asp 
                245                 250                 255     


Val Phe Ala Ser Ser Tyr Tyr Pro Phe Trp His Gly Thr Leu Gln Asn 
            260                 265                 270         


Leu Thr Ser Val Leu Lys Ala Val Ala Asn Thr Tyr Gly Lys Lys Val 
        275                 280                 285             


Met Val Ala Glu Thr Ser Tyr Thr Tyr Thr Ala Glu Asp Gly Asp Gly 
    290                 295                 300                 


His Gly Asn Thr Ala Pro Lys Ser Gly Gln Thr Leu Pro Tyr Pro Ile 
305                 310                 315                 320 


Ser Val Gln Gly Gln Ala Thr Ala Val Arg Asp Val Met Glu Ala Val 
                325                 330                 335     


Ala Asn Thr Gly Lys Ala Gly Leu Gly Val Phe Tyr Trp Glu Pro Ala 
            340                 345                 350         


Trp Ile Pro Val Gly Pro Lys Thr Gln Ile Glu Lys Asn Lys Val Leu 
        355                 360                 365             


Trp Glu Thr Tyr Gly Ser Gly Trp Ala Ser Ser Tyr Ala Ala Glu Tyr 
    370                 375                 380                 


Asp Pro Glu Asp Ala Gly Lys Trp Tyr Gly Gly Ser Ala Val Asp Asn 
385                 390                 395                 400 


Gln Ala Leu Phe Asp Phe Asn Gly His Pro Leu Pro Ser Leu Gln Val 
                405                 410                 415     


Phe Gln Tyr Ala Glu Ser Gly His Ile Pro Lys Lys Arg 
            420                 425                 


<210> 245
<211> 3069
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 245
atggggaaga tcagtaagta ttttgctatg tttttagcat ttctaatggt gtttagttca     60

ttattcgtaa attttcaacc aaggaatgtt caagctgcca caccatcttt ggtaaatggc    120

gggtttgaat cggacttttg ggctgatggt tcatggagtg ttgaagcggc ggtttgggat    180

catcttgatt tacagcattt ttcttattcc ggagattcat ggatgaagaa ggatgaggga    240

gagcatgctt ttaagtattg gattaaagac agtgctggcg aaaatcaggc atttacggtt    300

aaacagacta tagccacact tccagctgga agctacgaac tatcgatcca ttctatgggt    360

ggtgcaggta ctgaagcggg ttctgtacaa ctatttgccg gaaatgaaaa gacagaggca    420

tcagttacgt ccggctataa taactgggga acgatcacgt tgaattttga agtaagtgaa    480

gatgtttcca attttgaggt aggtgcgatt gtgagtggtg cgccaaaggc ttggggctat    540

ttagatagcg taagcttaaa gtcactgaat acgagtgttc ctgatccagt agaagcagat    600

atttttgtag aaagagtaga cggtatcagc gagcatttta ttaaaggtgt cgatgtttct    660

agtattattt cgctagaaaa cagtggtgtc acatttaaaa atgaagcagg tgctgaacag    720

gatattttta aaacattagc ggattcaggc gttaactatg tccgtgttcg tgtctggaac    780

gacccgtttg acgctgcagg aaatggttat ggcggaggga acaatgattt acaaacggcg    840

atagaaattg gaaaaagagc tacagcaaat ggaatgaagc tattagtaga cttccattat    900

tctgatttct gggcagaccc tgctaaacag caagcaccaa aggcttgggc aaccttaagc    960

ttcgaagata agaaaaaagc tttatatgac tatacaaaag atagcctgca agcaatgaag   1020

aatgcgggaa ttaatattgg tatggtacaa gttgggaatg aaaccaatgg tggtgttgca   1080

ggagagaaag attggacaaa gattagtgct ttatttaatg aaggaagtaa agcagttaga   1140

tcgatagatc cgaatatctt agtagccgtg cactttacga accctgaaac agcaggaaga   1200

tatacatcca tagctaaaac tcttcaggat aacggtgtag actatgatgt atttgcgagc   1260

tcgtattatc cattctggca tggtacatta agcaatttaa cgactgtttt aaagaatgtt   1320

gccgatacgt acggcaagaa agtaatggtt gctgaaactt cctatgcata tacagctgaa   1380

gatggagacg gccatggtaa cacagcaccg aaggattctg gtcagacttt aaattacccg   1440

attactgttc agggacaagc taattccgtt agagacgtaa ttgaagcggt tgtcaatgtt   1500

ggtgaagcgg gaatcggtgt gttctattgg gaaccagcat ggcttcctgt tggtccagct   1560

tctcagcttg agcaaaataa agcaatctgg gaacagtatg gttctggctg ggctagcagt   1620

tatgctgccg aatatgatcc gcatgacgca ggcgcgtggt atggcggaag tgcagtagac   1680

aaccaggcat tatttgattt caccggtaaa ccattagctt cattaaatgt atttaactat   1740

gttaatacag gtgctgttgc tccattaaga attgatgaaa ttaaagatgt cactgttaat   1800

gcgattttag gagaagaaat tacattaccg gaaactgtta ccgttacgta taataatgga   1860

ttaacgggaa aggcttctgt cacatgggat ggtgcagcac tggaacaagc catcagcaag   1920

ggtgttggaa gatacgtaat tgaaggtcaa gttgaaggtg gagggggtgt caaagcacgc   1980

cttaccatta atcctaaaaa ttatgttgta aaccctggct ttgaaaataa agatcgttcg   2040

atgtggaaga ttagttatgg aaatagttct accccatatg tttcatatca acaaaaggct   2100

tctgacgcga aatctgggga atatgctctc catttttact caggttcagg tgttaatttc   2160

aacgtagagc aaacgattac aggcttagaa ccagggtatt ataatctttc tatgtttctt   2220

caaggcggag atgcgcataa cccagaaatg tatttgtatg caaaaacggg tggagaagaa   2280

ctaaaagatg atacaggtgt aaatggctgg gttgtttgga gcaatccaca aattaatgaa   2340

attcttgttg tagatggaac cattaccatt ggtgccagca tcaaagcaga tgcaggtgca   2400

tggggaacgc tagacgattt ctatttatac cgtgtacgtg attacgacaa caaagcacct   2460

gaaactaatg tggtcctttc aggacaagac tataatggct ggtacaacca ggatgttaac   2520

gttactctga atgctgcgga tgatcagtct ggagtagcga aaactgagta ccgcgtaaat   2580

aatggtgatt ggcaaaccta tcagggacca tttgaagtaa gtacggaagg ggcgaatgtt   2640

gtccaataca gaagtacgga caaagatggt aatgttgaag aaactcaatc ggttacagcg   2700

aaaatggata aaactgttcc gacattggat gtttcgttta acaaggccat tataaccgac   2760

cgaaaccatg ctcttattcc aattaaagct tcggtggttg gtgcagatac cctttcaggg   2820

attagcagaa ttgagttaat atcagttgaa agcaatcaac cggataatgg gaaaggcgac   2880

gggaatacag accaagatat tcagggaaca tactttggaa catttgatac tgattatctg   2940

ctaagagcag aaagaagtgg aagcggagac agagtttata ctgtaaccta taaggcgtgg   3000

gatcaagccg gaaattccgt tatccagtct aaacagatta ttgttaagca tgataactcg   3060

aaaaaatag                                                           3069

<210> 246
<211> 1022
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(32)

<220> 
<221> DOMAIN
<222> (214)...(578)
<223> Glycosyl hydrolase family 53

<220> 
<221> DOMAIN
<222> (216)...(520)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (594)...(654)
<223> Bacterial Ig-like domain (group 4)

<220> 
<221> SITE
<222> (192)...(195)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (358)...(361)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (438)...(441)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (698)...(701)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (746)...(749)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (852)...(855)
<223> N-glycosylation site. Prosite id = PS00001

<400> 246
Met Gly Lys Ile Ser Lys Tyr Phe Ala Met Phe Leu Ala Phe Leu Met 
1               5                   10                  15      


Val Phe Ser Ser Leu Phe Val Asn Phe Gln Pro Arg Asn Val Gln Ala 
            20                  25                  30          


Ala Thr Pro Ser Leu Val Asn Gly Gly Phe Glu Ser Asp Phe Trp Ala 
        35                  40                  45              


Asp Gly Ser Trp Ser Val Glu Ala Ala Val Trp Asp His Leu Asp Leu 
    50                  55                  60                  


Gln His Phe Ser Tyr Ser Gly Asp Ser Trp Met Lys Lys Asp Glu Gly 
65                  70                  75                  80  


Glu His Ala Phe Lys Tyr Trp Ile Lys Asp Ser Ala Gly Glu Asn Gln 
                85                  90                  95      


Ala Phe Thr Val Lys Gln Thr Ile Ala Thr Leu Pro Ala Gly Ser Tyr 
            100                 105                 110         


Glu Leu Ser Ile His Ser Met Gly Gly Ala Gly Thr Glu Ala Gly Ser 
        115                 120                 125             


Val Gln Leu Phe Ala Gly Asn Glu Lys Thr Glu Ala Ser Val Thr Ser 
    130                 135                 140                 


Gly Tyr Asn Asn Trp Gly Thr Ile Thr Leu Asn Phe Glu Val Ser Glu 
145                 150                 155                 160 


Asp Val Ser Asn Phe Glu Val Gly Ala Ile Val Ser Gly Ala Pro Lys 
                165                 170                 175     


Ala Trp Gly Tyr Leu Asp Ser Val Ser Leu Lys Ser Leu Asn Thr Ser 
            180                 185                 190         


Val Pro Asp Pro Val Glu Ala Asp Ile Phe Val Glu Arg Val Asp Gly 
        195                 200                 205             


Ile Ser Glu His Phe Ile Lys Gly Val Asp Val Ser Ser Ile Ile Ser 
    210                 215                 220                 


Leu Glu Asn Ser Gly Val Thr Phe Lys Asn Glu Ala Gly Ala Glu Gln 
225                 230                 235                 240 


Asp Ile Phe Lys Thr Leu Ala Asp Ser Gly Val Asn Tyr Val Arg Val 
                245                 250                 255     


Arg Val Trp Asn Asp Pro Phe Asp Ala Ala Gly Asn Gly Tyr Gly Gly 
            260                 265                 270         


Gly Asn Asn Asp Leu Gln Thr Ala Ile Glu Ile Gly Lys Arg Ala Thr 
        275                 280                 285             


Ala Asn Gly Met Lys Leu Leu Val Asp Phe His Tyr Ser Asp Phe Trp 
    290                 295                 300                 


Ala Asp Pro Ala Lys Gln Gln Ala Pro Lys Ala Trp Ala Thr Leu Ser 
305                 310                 315                 320 


Phe Glu Asp Lys Lys Lys Ala Leu Tyr Asp Tyr Thr Lys Asp Ser Leu 
                325                 330                 335     


Gln Ala Met Lys Asn Ala Gly Ile Asn Ile Gly Met Val Gln Val Gly 
            340                 345                 350         


Asn Glu Thr Asn Gly Gly Val Ala Gly Glu Lys Asp Trp Thr Lys Ile 
        355                 360                 365             


Ser Ala Leu Phe Asn Glu Gly Ser Lys Ala Val Arg Ser Ile Asp Pro 
    370                 375                 380                 


Asn Ile Leu Val Ala Val His Phe Thr Asn Pro Glu Thr Ala Gly Arg 
385                 390                 395                 400 


Tyr Thr Ser Ile Ala Lys Thr Leu Gln Asp Asn Gly Val Asp Tyr Asp 
                405                 410                 415     


Val Phe Ala Ser Ser Tyr Tyr Pro Phe Trp His Gly Thr Leu Ser Asn 
            420                 425                 430         


Leu Thr Thr Val Leu Lys Asn Val Ala Asp Thr Tyr Gly Lys Lys Val 
        435                 440                 445             


Met Val Ala Glu Thr Ser Tyr Ala Tyr Thr Ala Glu Asp Gly Asp Gly 
    450                 455                 460                 


His Gly Asn Thr Ala Pro Lys Asp Ser Gly Gln Thr Leu Asn Tyr Pro 
465                 470                 475                 480 


Ile Thr Val Gln Gly Gln Ala Asn Ser Val Arg Asp Val Ile Glu Ala 
                485                 490                 495     


Val Val Asn Val Gly Glu Ala Gly Ile Gly Val Phe Tyr Trp Glu Pro 
            500                 505                 510         


Ala Trp Leu Pro Val Gly Pro Ala Ser Gln Leu Glu Gln Asn Lys Ala 
        515                 520                 525             


Ile Trp Glu Gln Tyr Gly Ser Gly Trp Ala Ser Ser Tyr Ala Ala Glu 
    530                 535                 540                 


Tyr Asp Pro His Asp Ala Gly Ala Trp Tyr Gly Gly Ser Ala Val Asp 
545                 550                 555                 560 


Asn Gln Ala Leu Phe Asp Phe Thr Gly Lys Pro Leu Ala Ser Leu Asn 
                565                 570                 575     


Val Phe Asn Tyr Val Asn Thr Gly Ala Val Ala Pro Leu Arg Ile Asp 
            580                 585                 590         


Glu Ile Lys Asp Val Thr Val Asn Ala Ile Leu Gly Glu Glu Ile Thr 
        595                 600                 605             


Leu Pro Glu Thr Val Thr Val Thr Tyr Asn Asn Gly Leu Thr Gly Lys 
    610                 615                 620                 


Ala Ser Val Thr Trp Asp Gly Ala Ala Leu Glu Gln Ala Ile Ser Lys 
625                 630                 635                 640 


Gly Val Gly Arg Tyr Val Ile Glu Gly Gln Val Glu Gly Gly Gly Gly 
                645                 650                 655     


Val Lys Ala Arg Leu Thr Ile Asn Pro Lys Asn Tyr Val Val Asn Pro 
            660                 665                 670         


Gly Phe Glu Asn Lys Asp Arg Ser Met Trp Lys Ile Ser Tyr Gly Asn 
        675                 680                 685             


Ser Ser Thr Pro Tyr Val Ser Tyr Gln Gln Lys Ala Ser Asp Ala Lys 
    690                 695                 700                 


Ser Gly Glu Tyr Ala Leu His Phe Tyr Ser Gly Ser Gly Val Asn Phe 
705                 710                 715                 720 


Asn Val Glu Gln Thr Ile Thr Gly Leu Glu Pro Gly Tyr Tyr Asn Leu 
                725                 730                 735     


Ser Met Phe Leu Gln Gly Gly Asp Ala His Asn Pro Glu Met Tyr Leu 
            740                 745                 750         


Tyr Ala Lys Thr Gly Gly Glu Glu Leu Lys Asp Asp Thr Gly Val Asn 
        755                 760                 765             


Gly Trp Val Val Trp Ser Asn Pro Gln Ile Asn Glu Ile Leu Val Val 
    770                 775                 780                 


Asp Gly Thr Ile Thr Ile Gly Ala Ser Ile Lys Ala Asp Ala Gly Ala 
785                 790                 795                 800 


Trp Gly Thr Leu Asp Asp Phe Tyr Leu Tyr Arg Val Arg Asp Tyr Asp 
                805                 810                 815     


Asn Lys Ala Pro Glu Thr Asn Val Val Leu Ser Gly Gln Asp Tyr Asn 
            820                 825                 830         


Gly Trp Tyr Asn Gln Asp Val Asn Val Thr Leu Asn Ala Ala Asp Asp 
        835                 840                 845             


Gln Ser Gly Val Ala Lys Thr Glu Tyr Arg Val Asn Asn Gly Asp Trp 
    850                 855                 860                 


Gln Thr Tyr Gln Gly Pro Phe Glu Val Ser Thr Glu Gly Ala Asn Val 
865                 870                 875                 880 


Val Gln Tyr Arg Ser Thr Asp Lys Asp Gly Asn Val Glu Glu Thr Gln 
                885                 890                 895     


Ser Val Thr Ala Lys Met Asp Lys Thr Val Pro Thr Leu Asp Val Ser 
            900                 905                 910         


Phe Asn Lys Ala Ile Ile Thr Asp Arg Asn His Ala Leu Ile Pro Ile 
        915                 920                 925             


Lys Ala Ser Val Val Gly Ala Asp Thr Leu Ser Gly Ile Ser Arg Ile 
    930                 935                 940                 


Glu Leu Ile Ser Val Glu Ser Asn Gln Pro Asp Asn Gly Lys Gly Asp 
945                 950                 955                 960 


Gly Asn Thr Asp Gln Asp Ile Gln Gly Thr Tyr Phe Gly Thr Phe Asp 
                965                 970                 975     


Thr Asp Tyr Leu Leu Arg Ala Glu Arg Ser Gly Ser Gly Asp Arg Val 
            980                 985                 990         


Tyr Thr Val Thr Tyr Lys Ala Trp  Asp Gln Ala Gly Asn  Ser Val Ile 
        995                 1000                 1005             


Gln Ser  Lys Gln Ile Ile Val  Lys His Asp Asn Ser  Lys Lys 
    1010                 1015                 1020         


<210> 247
<211> 2958
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 247
ttggtgaatg ggggatttga atcagatttt tgggatgatg aatcatggaa agttgaagca     60

cctgtttgga atcatcttga tttacaatac ttttcctatt ccggagatcc atatattaag    120

aaggatgaag gtgagcatgc atttaaatat tggattaaag aaacggcaag cgaaacccaa    180

tcatttactg ttaagcagac tctagcaaaa cttccagccg gaagctacga actaacaatc    240

cattctatgg gtggtgcagg tacggaagcg ggatctgtaa aactatttgc tggaaatcaa    300

atgacaaatg cagaagcaac tacaggctat aataactggg gaacaattac gctgaatttt    360

gaggttagtg aagaagtttc aaattttgag gtgggtgcca ttgtgagtgg tgcaccaaag    420

gcttggggct atttagatag cgtaacttta aagtcgctga atgctagcat ccctgatcca    480

gtggatgctg atatttttgt tgaaagagta gacggtatca gtaaggactt tattaaaggt    540

gtagacgttt cgagtatagt ttcattagaa aacagtggcg taacatttaa aaacgaagca    600

ggcgcagaac aagatatttt taagacgtta gcggattctg gcgttaatta tgtacgcgtt    660

cgcgtctgga acgatccatt tgatgctgct ggaaacggtt atggcggcgg aaacaacgat    720

ttagcaactg cgattgaaat tgggaaaaga gcaacagcaa atggaatgaa gttattagta    780

gacttccatt actctgattt ctgggcggac cctgctaaac agcaagcacc aaaggcttgg    840

gcaaacttaa gttttgaaga taagaaaacc gctttataca actatacaaa agaaagccta    900

gaagcgatga aggctgcagg aattaacatt ggtatggttc aggttgggaa tgaaacaaat    960

ggtggagttg cgggagagaa ggagtggaca aaggttagtg cactcttcaa tgagggtagt   1020

aaagcgatta gagccgtaga ttctactatt ttagtagcgg tacattttac aaatccggaa   1080

actgcaggaa gatatgcctc tttagcaaaa acacttcatg ataacggagt agactatgat   1140

gtctttgcca gctcttatta cccattctgg catggtacct taagtaattt aacggctgtt   1200

ctgaaaaatg ttgccgatac ctatggcaaa aaggtaatgg tcgccgaaac ctcttatgcg   1260

tatacagctg aagatggaga tggccacgga aacacagcac cgaaggattc cggtcaggta   1320

ttaaattacc cgattactgt tcaaggccaa gcgaattcgg ttagagacgt aatccaagca   1380

gttgctaatg ttggagaagc gggaattggt gtgttttatt gggaaccggc atggcttcca   1440

gttggtccag ctactcagtt tgagcaaaat aaggcaatct gggaaaagta tggttctggt   1500

tgggcaacta gcttcgctgg ggaatatgat ccacatgatg caggtgcgtg gtatggcgga   1560

agtgcagtag ataaccaagc tttgtttgat ttcactggta aaccattacc ttcattaaac   1620

gtgtttaact atgtcgatac gggagcagtt gcaccgttaa aaatcgatga aattaaagat   1680

gttaccgtaa gtgcaattct aggagaagat attactttac ctgaaacggt aacagttacc   1740

tataataatg gaacaaaagg ctcgacttct gttacttggg atggacctgc acttgaacaa   1800

gcgattaaca gcggtgctgg aaagtatgta attgaaggcg tagttgaagg tggatcgact   1860

gtaaaagcac atcttaccat caatcctaaa aattatgtgg taaaccctgg ctttgaaaat   1920

agtgatcgag caatgtggaa ggttagttat ggaaatggaa ctcagccaca tacttcattc   1980

caaaaaaagg cttcagacgc gaaatcagga gaatatgctt tacattttta ctcagataaa   2040

ggtgtggatt tcaaagtaga gcaaaccatt acaggtttag aaccaggtta ttacaacctt   2100

tcaatgttcc tgcaaggtgg ggacgcagcg aacccagaaa tgtacttgtt tgcgaaaact   2160

ggtgaaaaag aactaaaagc caatacctcc gtaaacggct gggtaaattg gagcaatcct   2220

caaatcaagg agatccttgt tctagatgga accattacca tcggggcaag gatcaaagca   2280

aacgcaggtg catggggtac attagatgat ttctatttat atcgcgtgga tgttgacacc   2340

aaggcacccg taacggaggc ggccctttct gggcaagacc gtaatggatg gtataacgag   2400

agtgtaaatg ttactctgaa tgcttcagat gataaatctg gtgttgcaaa aacagagtat   2460

aaacttaata caggtaattg gcaaacctat caggggtcac ttgatgtaag tgctgaagga   2520

gaaaatgtgg ttcagtacag aagtacagat ctcttaggta acgttgaaga agttcaatct   2580

gttacagtaa aaatcgataa aagtgctcca acattgaatg tgtcgtttaa tacgagcgtc   2640

ctaaccgatc gtaaccatgc gcttattcca attaaagcgt tggttgaagg cgcagatact   2700

ctttcagggc tagagagaat tgaactagta tccattacaa gtaatcaacc agacaatgga   2760

aaaggtgatg gggatacact ccatgatatt cagggagcag actttggaac ctttgatact   2820

gattttctac taagagcaga gagaagtgga agtggggata gagtttatac tgtaacctat   2880

aaggcatggg atcataccgg aaactctgtg atccaatcca atcagattat tgtaaagctt   2940

aataattcga aaaaataa                                                 2958

<210> 248
<211> 985
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (178)...(542)
<223> Glycosyl hydrolase family 53

<220> 
<221> DOMAIN
<222> (180)...(484)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (558)...(619)
<223> Bacterial Ig-like domain (group 4)

<220> 
<221> SITE
<222> (156)...(159)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (286)...(289)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (298)...(301)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (321)...(324)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (402)...(405)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (591)...(594)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (662)...(665)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (709)...(712)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (739)...(742)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (747)...(750)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (811)...(814)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (815)...(818)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (819)...(822)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (886)...(889)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (890)...(893)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (996)...(999)
<223> N-glycosylation site. Prosite id = PS00001

<400> 248
Met Val Asn Gly Gly Phe Glu Ser Asp Phe Trp Asp Asp Glu Ser Trp 
1               5                   10                  15      


Lys Val Glu Ala Pro Val Trp Asn His Leu Asp Leu Gln Tyr Phe Ser 
            20                  25                  30          


Tyr Ser Gly Asp Pro Tyr Ile Lys Lys Asp Glu Gly Glu His Ala Phe 
        35                  40                  45              


Lys Tyr Trp Ile Lys Glu Thr Ala Ser Glu Thr Gln Ser Phe Thr Val 
    50                  55                  60                  


Lys Gln Thr Leu Ala Lys Leu Pro Ala Gly Ser Tyr Glu Leu Thr Ile 
65                  70                  75                  80  


His Ser Met Gly Gly Ala Gly Thr Glu Ala Gly Ser Val Lys Leu Phe 
                85                  90                  95      


Ala Gly Asn Gln Met Thr Asn Ala Glu Ala Thr Thr Gly Tyr Asn Asn 
            100                 105                 110         


Trp Gly Thr Ile Thr Leu Asn Phe Glu Val Ser Glu Glu Val Ser Asn 
        115                 120                 125             


Phe Glu Val Gly Ala Ile Val Ser Gly Ala Pro Lys Ala Trp Gly Tyr 
    130                 135                 140                 


Leu Asp Ser Val Thr Leu Lys Ser Leu Asn Ala Ser Ile Pro Asp Pro 
145                 150                 155                 160 


Val Asp Ala Asp Ile Phe Val Glu Arg Val Asp Gly Ile Ser Lys Asp 
                165                 170                 175     


Phe Ile Lys Gly Val Asp Val Ser Ser Ile Val Ser Leu Glu Asn Ser 
            180                 185                 190         


Gly Val Thr Phe Lys Asn Glu Ala Gly Ala Glu Gln Asp Ile Phe Lys 
        195                 200                 205             


Thr Leu Ala Asp Ser Gly Val Asn Tyr Val Arg Val Arg Val Trp Asn 
    210                 215                 220                 


Asp Pro Phe Asp Ala Ala Gly Asn Gly Tyr Gly Gly Gly Asn Asn Asp 
225                 230                 235                 240 


Leu Ala Thr Ala Ile Glu Ile Gly Lys Arg Ala Thr Ala Asn Gly Met 
                245                 250                 255     


Lys Leu Leu Val Asp Phe His Tyr Ser Asp Phe Trp Ala Asp Pro Ala 
            260                 265                 270         


Lys Gln Gln Ala Pro Lys Ala Trp Ala Asn Leu Ser Phe Glu Asp Lys 
        275                 280                 285             


Lys Thr Ala Leu Tyr Asn Tyr Thr Lys Glu Ser Leu Glu Ala Met Lys 
    290                 295                 300                 


Ala Ala Gly Ile Asn Ile Gly Met Val Gln Val Gly Asn Glu Thr Asn 
305                 310                 315                 320 


Gly Gly Val Ala Gly Glu Lys Glu Trp Thr Lys Val Ser Ala Leu Phe 
                325                 330                 335     


Asn Glu Gly Ser Lys Ala Ile Arg Ala Val Asp Ser Thr Ile Leu Val 
            340                 345                 350         


Ala Val His Phe Thr Asn Pro Glu Thr Ala Gly Arg Tyr Ala Ser Leu 
        355                 360                 365             


Ala Lys Thr Leu His Asp Asn Gly Val Asp Tyr Asp Val Phe Ala Ser 
    370                 375                 380                 


Ser Tyr Tyr Pro Phe Trp His Gly Thr Leu Ser Asn Leu Thr Ala Val 
385                 390                 395                 400 


Leu Lys Asn Val Ala Asp Thr Tyr Gly Lys Lys Val Met Val Ala Glu 
                405                 410                 415     


Thr Ser Tyr Ala Tyr Thr Ala Glu Asp Gly Asp Gly His Gly Asn Thr 
            420                 425                 430         


Ala Pro Lys Asp Ser Gly Gln Val Leu Asn Tyr Pro Ile Thr Val Gln 
        435                 440                 445             


Gly Gln Ala Asn Ser Val Arg Asp Val Ile Gln Ala Val Ala Asn Val 
    450                 455                 460                 


Gly Glu Ala Gly Ile Gly Val Phe Tyr Trp Glu Pro Ala Trp Leu Pro 
465                 470                 475                 480 


Val Gly Pro Ala Thr Gln Phe Glu Gln Asn Lys Ala Ile Trp Glu Lys 
                485                 490                 495     


Tyr Gly Ser Gly Trp Ala Thr Ser Phe Ala Gly Glu Tyr Asp Pro His 
            500                 505                 510         


Asp Ala Gly Ala Trp Tyr Gly Gly Ser Ala Val Asp Asn Gln Ala Leu 
        515                 520                 525             


Phe Asp Phe Thr Gly Lys Pro Leu Pro Ser Leu Asn Val Phe Asn Tyr 
    530                 535                 540                 


Val Asp Thr Gly Ala Val Ala Pro Leu Lys Ile Asp Glu Ile Lys Asp 
545                 550                 555                 560 


Val Thr Val Ser Ala Ile Leu Gly Glu Asp Ile Thr Leu Pro Glu Thr 
                565                 570                 575     


Val Thr Val Thr Tyr Asn Asn Gly Thr Lys Gly Ser Thr Ser Val Thr 
            580                 585                 590         


Trp Asp Gly Pro Ala Leu Glu Gln Ala Ile Asn Ser Gly Ala Gly Lys 
        595                 600                 605             


Tyr Val Ile Glu Gly Val Val Glu Gly Gly Ser Thr Val Lys Ala His 
    610                 615                 620                 


Leu Thr Ile Asn Pro Lys Asn Tyr Val Val Asn Pro Gly Phe Glu Asn 
625                 630                 635                 640 


Ser Asp Arg Ala Met Trp Lys Val Ser Tyr Gly Asn Gly Thr Gln Pro 
                645                 650                 655     


His Thr Ser Phe Gln Lys Lys Ala Ser Asp Ala Lys Ser Gly Glu Tyr 
            660                 665                 670         


Ala Leu His Phe Tyr Ser Asp Lys Gly Val Asp Phe Lys Val Glu Gln 
        675                 680                 685             


Thr Ile Thr Gly Leu Glu Pro Gly Tyr Tyr Asn Leu Ser Met Phe Leu 
    690                 695                 700                 


Gln Gly Gly Asp Ala Ala Asn Pro Glu Met Tyr Leu Phe Ala Lys Thr 
705                 710                 715                 720 


Gly Glu Lys Glu Leu Lys Ala Asn Thr Ser Val Asn Gly Trp Val Asn 
                725                 730                 735     


Trp Ser Asn Pro Gln Ile Lys Glu Ile Leu Val Leu Asp Gly Thr Ile 
            740                 745                 750         


Thr Ile Gly Ala Arg Ile Lys Ala Asn Ala Gly Ala Trp Gly Thr Leu 
        755                 760                 765             


Asp Asp Phe Tyr Leu Tyr Arg Val Asp Val Asp Thr Lys Ala Pro Val 
    770                 775                 780                 


Thr Glu Ala Ala Leu Ser Gly Gln Asp Arg Asn Gly Trp Tyr Asn Glu 
785                 790                 795                 800 


Ser Val Asn Val Thr Leu Asn Ala Ser Asp Asp Lys Ser Gly Val Ala 
                805                 810                 815     


Lys Thr Glu Tyr Lys Leu Asn Thr Gly Asn Trp Gln Thr Tyr Gln Gly 
            820                 825                 830         


Ser Leu Asp Val Ser Ala Glu Gly Glu Asn Val Val Gln Tyr Arg Ser 
        835                 840                 845             


Thr Asp Leu Leu Gly Asn Val Glu Glu Val Gln Ser Val Thr Val Lys 
    850                 855                 860                 


Ile Asp Lys Ser Ala Pro Thr Leu Asn Val Ser Phe Asn Thr Ser Val 
865                 870                 875                 880 


Leu Thr Asp Arg Asn His Ala Leu Ile Pro Ile Lys Ala Leu Val Glu 
                885                 890                 895     


Gly Ala Asp Thr Leu Ser Gly Leu Glu Arg Ile Glu Leu Val Ser Ile 
            900                 905                 910         


Thr Ser Asn Gln Pro Asp Asn Gly Lys Gly Asp Gly Asp Thr Leu His 
        915                 920                 925             


Asp Ile Gln Gly Ala Asp Phe Gly Thr Phe Asp Thr Asp Phe Leu Leu 
    930                 935                 940                 


Arg Ala Glu Arg Ser Gly Ser Gly Asp Arg Val Tyr Thr Val Thr Tyr 
945                 950                 955                 960 


Lys Ala Trp Asp His Thr Gly Asn Ser Val Ile Gln Ser Asn Gln Ile 
                965                 970                 975     


Ile Val Lys Leu Asn Asn Ser Lys Lys 
            980                 985 


<210> 249
<211> 1452
<212> DNA
<213> Thermococcus AEPII1a

<400> 249
atgaagttcc catctaactt tctttttggc tactcctggt cgggcttcca gtttgaaatg     60

ggtttacctg ggagtgaagt tgagagcgac tggtgggcat gggtccacga taaggagaac    120

atcttctcgg gcctagttag cggtgaccta ccagagaacg ggcctgctta ctggcacctc    180

tacaagaaag accacgacat agctgaaagc cttggcatgg acgcgataag aggcggaatc    240

gagtgggcga ggatcttccc aaaacccacc tttgacgtga aggttgacgt ggaaaaggac    300

gaaaacggga acataatctc cattgacgtc ccggagagcg cgatagagga gctagaaaag    360

cttgccaaca tggatgccct caaccactac cgcgaaatct actcggactg gaaggagagg    420

ggcaagacct tcatattgaa cctctatcac tggccccttc ccctctggct ccacgacccg    480

ataggcgtta gaaagctcgg ccctgataga gctccctcgg gctggctgga cgagaggagc    540

gtggtggagt tcaccaagtt cgctgcattc atcgcctacc acttggatga cctcgttgac    600

atgtggagca cgatgaacga gccgaatgtg gtttacgagc agggttacac gaggcctcag    660

tcgggctttc caccgggtta tctcagccac gaggccgctg gaaaggcgaa gctcaacctc    720

atgcaggctc acgctagagc ttacgatgcg ataaaagagc actcggacaa gcccgtgggg    780

ttgatatact cctttgtctg gcacgatgcc ctaaacgagg aagcggagga gattgtgaag    840

gagataagga ggagacacta cgacttcgta accggccttc actccggctc atcggagttc    900

ggggagaggg aggacttcaa ggggaagatc gactggatag gcgtgaacta ctacactagg    960

gttgcttacg agatgaggaa cggccgcttt atggccctac ccgggtacgg ctacatgtgc   1020

gagaggagtg gttacgcaaa atccggaagg cccgcgagcg attttggctg ggagacctat   1080

cctgagggcc tcgaaaacgt cctgatggat ctgaaggagc tctacggcct gccaatgatg   1140

gtgacggaga acgggatggc ggatatggca gacaggcacc gctcttacta cctcgtgagc   1200

cacctcgcgg ctatccacag ggcgatggag aagggtgccg acgttagggg gtacctccac   1260

tggtctctga ccgacaacta cgagtgggcg cagggcttca gaatgcgctt tgggctggtg   1320

atggtggact tcgagactaa gaagcgctac ataaggccga gcgcactcgt cttcagggag   1380

atagccacgc agaaggaaat acccgaagag ctctcccacc tagcgaacct cgaactggta   1440

acgaagaagt ag                                                       1452

<210> 250
<211> 483
<212> PRT
<213> Thermococcus AEPII1a

<220> 
<221> DOMAIN
<222> (1)...(467)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (7)...(21)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<400> 250
Met Lys Phe Pro Ser Asn Phe Leu Phe Gly Tyr Ser Trp Ser Gly Phe 
1               5                   10                  15      


Gln Phe Glu Met Gly Leu Pro Gly Ser Glu Val Glu Ser Asp Trp Trp 
            20                  25                  30          


Ala Trp Val His Asp Lys Glu Asn Ile Phe Ser Gly Leu Val Ser Gly 
        35                  40                  45              


Asp Leu Pro Glu Asn Gly Pro Ala Tyr Trp His Leu Tyr Lys Lys Asp 
    50                  55                  60                  


His Asp Ile Ala Glu Ser Leu Gly Met Asp Ala Ile Arg Gly Gly Ile 
65                  70                  75                  80  


Glu Trp Ala Arg Ile Phe Pro Lys Pro Thr Phe Asp Val Lys Val Asp 
                85                  90                  95      


Val Glu Lys Asp Glu Asn Gly Asn Ile Ile Ser Ile Asp Val Pro Glu 
            100                 105                 110         


Ser Ala Ile Glu Glu Leu Glu Lys Leu Ala Asn Met Asp Ala Leu Asn 
        115                 120                 125             


His Tyr Arg Glu Ile Tyr Ser Asp Trp Lys Glu Arg Gly Lys Thr Phe 
    130                 135                 140                 


Ile Leu Asn Leu Tyr His Trp Pro Leu Pro Leu Trp Leu His Asp Pro 
145                 150                 155                 160 


Ile Gly Val Arg Lys Leu Gly Pro Asp Arg Ala Pro Ser Gly Trp Leu 
                165                 170                 175     


Asp Glu Arg Ser Val Val Glu Phe Thr Lys Phe Ala Ala Phe Ile Ala 
            180                 185                 190         


Tyr His Leu Asp Asp Leu Val Asp Met Trp Ser Thr Met Asn Glu Pro 
        195                 200                 205             


Asn Val Val Tyr Glu Gln Gly Tyr Thr Arg Pro Gln Ser Gly Phe Pro 
    210                 215                 220                 


Pro Gly Tyr Leu Ser His Glu Ala Ala Gly Lys Ala Lys Leu Asn Leu 
225                 230                 235                 240 


Met Gln Ala His Ala Arg Ala Tyr Asp Ala Ile Lys Glu His Ser Asp 
                245                 250                 255     


Lys Pro Val Gly Leu Ile Tyr Ser Phe Val Trp His Asp Ala Leu Asn 
            260                 265                 270         


Glu Glu Ala Glu Glu Ile Val Lys Glu Ile Arg Arg Arg His Tyr Asp 
        275                 280                 285             


Phe Val Thr Gly Leu His Ser Gly Ser Ser Glu Phe Gly Glu Arg Glu 
    290                 295                 300                 


Asp Phe Lys Gly Lys Ile Asp Trp Ile Gly Val Asn Tyr Tyr Thr Arg 
305                 310                 315                 320 


Val Ala Tyr Glu Met Arg Asn Gly Arg Phe Met Ala Leu Pro Gly Tyr 
                325                 330                 335     


Gly Tyr Met Cys Glu Arg Ser Gly Tyr Ala Lys Ser Gly Arg Pro Ala 
            340                 345                 350         


Ser Asp Phe Gly Trp Glu Thr Tyr Pro Glu Gly Leu Glu Asn Val Leu 
        355                 360                 365             


Met Asp Leu Lys Glu Leu Tyr Gly Leu Pro Met Met Val Thr Glu Asn 
    370                 375                 380                 


Gly Met Ala Asp Met Ala Asp Arg His Arg Ser Tyr Tyr Leu Val Ser 
385                 390                 395                 400 


His Leu Ala Ala Ile His Arg Ala Met Glu Lys Gly Ala Asp Val Arg 
                405                 410                 415     


Gly Tyr Leu His Trp Ser Leu Thr Asp Asn Tyr Glu Trp Ala Gln Gly 
            420                 425                 430         


Phe Arg Met Arg Phe Gly Leu Val Met Val Asp Phe Glu Thr Lys Lys 
        435                 440                 445             


Arg Tyr Ile Arg Pro Ser Ala Leu Val Phe Arg Glu Ile Ala Thr Gln 
    450                 455                 460                 


Lys Glu Ile Pro Glu Glu Leu Ser His Leu Ala Asn Leu Glu Leu Val 
465                 470                 475                 480 


Thr Lys Lys 
            


<210> 251
<211> 1455
<212> DNA
<213> Thermococcus AEPII1a

<400> 251
atgaagttcc catctaactt tctttttggc tactcctggt cgggcttcca gtttgaaatg     60

ggtttacctg ggagtgaagt tgagagcgac tggtgggcat gggtccacga taaggagaac    120

atcttctcgg gcctagttag cggtgaccta ccagagaacg ggcctgctta ctggcacctc    180

tacaagaaag accacgacat agctgaaagc cttggcatgg acgcgataag aggcggaatc    240

gagtgggcga ggatcttccc aaaacccacc tttgacgtga aggttgacgt ggaaaaggac    300

gaaaacggga acataatctc cattgacgtc ccggagagcg cgatagagga gctagaaaag    360

cttgccaaca tggatgccct caaccactac cgcgaaatct actcggactg gaaggagagg    420

ggcaagacct tcatattgaa cctctatcac tggccccttc ccctctggct ccacgacccg    480

ataggcgtta gaaagctcgg ccctgataga gctccctcgg gctggctgga cgagaggagc    540

gtggtggagt tcaccaagtt cgctgcattc atcgcctacc acttggatga cctcgttgac    600

atgtggagca cgatgaacga gccgaatgtg gtttacgagc agggttacac gaggcctcag    660

tcgggctttc caccgggtta tctcagccac gaggccgctg gaaaggcgaa gctcaacctc    720

atgcaggctc acgctagagc ttacgatgcg ataaaagagc actcggacaa gccagttgga    780

gttatctacg catataagtg gattgatgcg gaggatgaag ctgcagagga atccgttctg    840

gaactccgca ggagggatta cgacttcgtt gatggtctct actcaggcaa gtccctgact    900

gcaggtgaga gggaggactt caaaggcagg gtcgactggg ttggcgtcaa ctactactcc    960

cgcctgctct ttggaaaggc cggagattca gtgagattac ttgagggcta cggttttgtc   1020

tccccgaggg gtggctacgc caaatcggga aggcctgcga gcgattttgg ctgggagatt   1080

tatcctgagg gcctcgaaaa gctcctggtt gagctgagtg gcaggtacga gcttccgctc   1140

ttcataacgg agaatggtat ggctgatgct gtcgataggt acaggcctta ctacctcgtg   1200

agccacctcg cggctatcca cagggcgatg gagaagggtg ccgacattag ggggtacctc   1260

cactggtctc tgaccgacaa ctacgagtgg gcgcagggct tcagaatgcg ctttgggctg   1320

gtgatggtgg acttcgagac taagaagcgc tacttgaggc cgagcgcact cgtcttcagg   1380

gaaatagcca cgcggaagga aatacccgaa gagcttgaac accttgccga tgtggatgca   1440

atcattgctc ggtga                                                    1455

<210> 252
<211> 484
<212> PRT
<213> Thermococcus AEPII1a

<220> 
<221> DOMAIN
<222> (1)...(468)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (7)...(21)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (385)...(393)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 252
Met Lys Phe Pro Ser Asn Phe Leu Phe Gly Tyr Ser Trp Ser Gly Phe 
1               5                   10                  15      


Gln Phe Glu Met Gly Leu Pro Gly Ser Glu Val Glu Ser Asp Trp Trp 
            20                  25                  30          


Ala Trp Val His Asp Lys Glu Asn Ile Phe Ser Gly Leu Val Ser Gly 
        35                  40                  45              


Asp Leu Pro Glu Asn Gly Pro Ala Tyr Trp His Leu Tyr Lys Lys Asp 
    50                  55                  60                  


His Asp Ile Ala Glu Ser Leu Gly Met Asp Ala Ile Arg Gly Gly Ile 
65                  70                  75                  80  


Glu Trp Ala Arg Ile Phe Pro Lys Pro Thr Phe Asp Val Lys Val Asp 
                85                  90                  95      


Val Glu Lys Asp Glu Asn Gly Asn Ile Ile Ser Ile Asp Val Pro Glu 
            100                 105                 110         


Ser Ala Ile Glu Glu Leu Glu Lys Leu Ala Asn Met Asp Ala Leu Asn 
        115                 120                 125             


His Tyr Arg Glu Ile Tyr Ser Asp Trp Lys Glu Arg Gly Lys Thr Phe 
    130                 135                 140                 


Ile Leu Asn Leu Tyr His Trp Pro Leu Pro Leu Trp Leu His Asp Pro 
145                 150                 155                 160 


Ile Gly Val Arg Lys Leu Gly Pro Asp Arg Ala Pro Ser Gly Trp Leu 
                165                 170                 175     


Asp Glu Arg Ser Val Val Glu Phe Thr Lys Phe Ala Ala Phe Ile Ala 
            180                 185                 190         


Tyr His Leu Asp Asp Leu Val Asp Met Trp Ser Thr Met Asn Glu Pro 
        195                 200                 205             


Asn Val Val Tyr Glu Gln Gly Tyr Thr Arg Pro Gln Ser Gly Phe Pro 
    210                 215                 220                 


Pro Gly Tyr Leu Ser His Glu Ala Ala Gly Lys Ala Lys Leu Asn Leu 
225                 230                 235                 240 


Met Gln Ala His Ala Arg Ala Tyr Asp Ala Ile Lys Glu His Ser Asp 
                245                 250                 255     


Lys Pro Val Gly Val Ile Tyr Ala Tyr Lys Trp Ile Asp Ala Glu Asp 
            260                 265                 270         


Glu Ala Ala Glu Glu Ser Val Leu Glu Leu Arg Arg Arg Asp Tyr Asp 
        275                 280                 285             


Phe Val Asp Gly Leu Tyr Ser Gly Lys Ser Leu Thr Ala Gly Glu Arg 
    290                 295                 300                 


Glu Asp Phe Lys Gly Arg Val Asp Trp Val Gly Val Asn Tyr Tyr Ser 
305                 310                 315                 320 


Arg Leu Leu Phe Gly Lys Ala Gly Asp Ser Val Arg Leu Leu Glu Gly 
                325                 330                 335     


Tyr Gly Phe Val Ser Pro Arg Gly Gly Tyr Ala Lys Ser Gly Arg Pro 
            340                 345                 350         


Ala Ser Asp Phe Gly Trp Glu Ile Tyr Pro Glu Gly Leu Glu Lys Leu 
        355                 360                 365             


Leu Val Glu Leu Ser Gly Arg Tyr Glu Leu Pro Leu Phe Ile Thr Glu 
    370                 375                 380                 


Asn Gly Met Ala Asp Ala Val Asp Arg Tyr Arg Pro Tyr Tyr Leu Val 
385                 390                 395                 400 


Ser His Leu Ala Ala Ile His Arg Ala Met Glu Lys Gly Ala Asp Ile 
                405                 410                 415     


Arg Gly Tyr Leu His Trp Ser Leu Thr Asp Asn Tyr Glu Trp Ala Gln 
            420                 425                 430         


Gly Phe Arg Met Arg Phe Gly Leu Val Met Val Asp Phe Glu Thr Lys 
        435                 440                 445             


Lys Arg Tyr Leu Arg Pro Ser Ala Leu Val Phe Arg Glu Ile Ala Thr 
    450                 455                 460                 


Arg Lys Glu Ile Pro Glu Glu Leu Glu His Leu Ala Asp Val Asp Ala 
465                 470                 475                 480 


Ile Ile Ala Arg 
                


<210> 253
<211> 2166
<212> DNA
<213> Thermotoga maritima MSB8

<400> 253
atggaaagga tcgatgaaat tctctctcag ttaactacag aggaaaaggt gaagctcgtt     60

gtgggggttg gtcttccagg actttttggg aacccacatt ccagagtggc gggtgcggct    120

ggagaaacac atcccgttcc aagacttgga attcctgcgt ttgtcctggc agatggtccc    180

gcaggactca gaataaatcc cacaagggaa aacgatgaaa acacttacta cacgacggca    240

tttcccgttg aaatcatgct cgcttctacc tggaacagag accttctgga agaagtggga    300

aaagccatgg gagaagaagt tagggaatac ggtgtcgatg tgcttcttgc acctgcgatg    360

aacattcaca gaaaccctct ttgtggaagg aatttcgagt actactcaga agatcctgtc    420

ctttccggtg aaatggcttc agcctttgtc aagggagttc aatctcaagg ggtgggagcc    480

tgcataaaac actttgtcgc gaacaaccag gaaacgaaca ggatggtagt ggacacgatc    540

gtgtccgagc gagccctcag agaaatatat ctgaaaggtt ttgaaattgc tgtcaagaaa    600

gcaagaccct ggaccgtgat gagcgcttac aacaaactga atggaaaata ctgttcacag    660

aacgaatggc ttttgaagaa ggttctcagg gaagaatggg gatttggcgg tttcgtgatg    720

agcgactggt acgcgggaga caaccctgta gaacagctca aggccggaaa cgatatgatc    780

atgcctggga aagcgtatca ggtgaacaca gaaagaagag atgaaataga agaaatcatg    840

gaggcgttga aggagggaaa attgagtgag gaggttctcg atgagtgtgt gagaaacatt    900

ctcaaagttc ttgtgaacgc gccttccttc aaagggtaca ggtactcaaa caagccggat    960

ctcgaatctc acgcggaagt cgcctacgaa gcaggtgcgg agggtgttgt ccttcttgag   1020

aacaacggtg ttcttccgtt cgatgaaaat acccatgtcg ccgtctttgg caccggtcaa   1080

atcgaaacaa taaagggagg aacgggaagt ggagacaccc atccgagata cacgatctct   1140

atccttgaag gcataaaaga aagaaacatg aagttcgacg aagaactcgc ttccacttat   1200

gaggagtaca taaaaaagat gagagaaaca gaggaatata aacccagaac cgactcttgg   1260

ggaacggtca taaaaccgaa actcccagag aatttcctct cagaaaaaga gataaagaaa   1320

cctccaaaga aaaacgatgt tgcagttgtt gtgatcagta ggatctccgg tgagggatac   1380

gacagaaagc cggtgaaagg tgacttctac ctctccgatg acgagctgga actcataaaa   1440

accgtctcga aagaattcca cgatcagggt aagaaagttg tggttcttct gaacatcgga   1500

agtcccatcg aagtcgcaag ctggagagac cttgtggatg gaattcttct cgtctggcag   1560

gcgggacagg agatgggaag aatagtggcc gatgttcttg tgggaaagat taatccctcc   1620

ggaaaacttc caacgacctt cccgaaggat tactcggacg ttccatcctg gacgttccca   1680

ggagagccaa aggacaatcc gcaaagagtg gtgtacgagg aagacatcta cgtgggatac   1740

aggtactacg acaccttcgg tgtggaacct gcctacgaat tcggctacgg cctctcttac   1800

acaaagtttg aatacaaaga tttaaaaatc gctatcgacg gtgagacgct cagagtgtcg   1860

tacacgatca caaacactgg ggacagagct ggaaaggaag tctcacaggt ctacatcaaa   1920

gctccaaaag gaaaaataga caaacccttc caggagctga aagcgtttca caaaacaaaa   1980

cttttgaacc cgggtgaatc agaagaaatc tccttggaaa ttcctctcag agatcttgcg   2040

agtttcgatg ggaaagaatg ggttgtcgag tcaggagaat acgaggtcag ggtcggtgca   2100

tcttcgaggg atataaggtt gagagatatt tttctggttg agggagagaa gagattcaaa   2160

ccatga                                                              2166

<210> 254
<211> 721
<212> PRT
<213> Thermotoga maritima MSB8

<220> 
<221> DOMAIN
<222> (43)...(262)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (336)...(602)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (231)...(248)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<400> 254
Met Glu Arg Ile Asp Glu Ile Leu Ser Gln Leu Thr Thr Glu Glu Lys 
1               5                   10                  15      


Val Lys Leu Val Val Gly Val Gly Leu Pro Gly Leu Phe Gly Asn Pro 
            20                  25                  30          


His Ser Arg Val Ala Gly Ala Ala Gly Glu Thr His Pro Val Pro Arg 
        35                  40                  45              


Leu Gly Ile Pro Ala Phe Val Leu Ala Asp Gly Pro Ala Gly Leu Arg 
    50                  55                  60                  


Ile Asn Pro Thr Arg Glu Asn Asp Glu Asn Thr Tyr Tyr Thr Thr Ala 
65                  70                  75                  80  


Phe Pro Val Glu Ile Met Leu Ala Ser Thr Trp Asn Arg Asp Leu Leu 
                85                  90                  95      


Glu Glu Val Gly Lys Ala Met Gly Glu Glu Val Arg Glu Tyr Gly Val 
            100                 105                 110         


Asp Val Leu Leu Ala Pro Ala Met Asn Ile His Arg Asn Pro Leu Cys 
        115                 120                 125             


Gly Arg Asn Phe Glu Tyr Tyr Ser Glu Asp Pro Val Leu Ser Gly Glu 
    130                 135                 140                 


Met Ala Ser Ala Phe Val Lys Gly Val Gln Ser Gln Gly Val Gly Ala 
145                 150                 155                 160 


Cys Ile Lys His Phe Val Ala Asn Asn Gln Glu Thr Asn Arg Met Val 
                165                 170                 175     


Val Asp Thr Ile Val Ser Glu Arg Ala Leu Arg Glu Ile Tyr Leu Lys 
            180                 185                 190         


Gly Phe Glu Ile Ala Val Lys Lys Ala Arg Pro Trp Thr Val Met Ser 
        195                 200                 205             


Ala Tyr Asn Lys Leu Asn Gly Lys Tyr Cys Ser Gln Asn Glu Trp Leu 
    210                 215                 220                 


Leu Lys Lys Val Leu Arg Glu Glu Trp Gly Phe Gly Gly Phe Val Met 
225                 230                 235                 240 


Ser Asp Trp Tyr Ala Gly Asp Asn Pro Val Glu Gln Leu Lys Ala Gly 
                245                 250                 255     


Asn Asp Met Ile Met Pro Gly Lys Ala Tyr Gln Val Asn Thr Glu Arg 
            260                 265                 270         


Arg Asp Glu Ile Glu Glu Ile Met Glu Ala Leu Lys Glu Gly Lys Leu 
        275                 280                 285             


Ser Glu Glu Val Leu Asp Glu Cys Val Arg Asn Ile Leu Lys Val Leu 
    290                 295                 300                 


Val Asn Ala Pro Ser Phe Lys Gly Tyr Arg Tyr Ser Asn Lys Pro Asp 
305                 310                 315                 320 


Leu Glu Ser His Ala Glu Val Ala Tyr Glu Ala Gly Ala Glu Gly Val 
                325                 330                 335     


Val Leu Leu Glu Asn Asn Gly Val Leu Pro Phe Asp Glu Asn Thr His 
            340                 345                 350         


Val Ala Val Phe Gly Thr Gly Gln Ile Glu Thr Ile Lys Gly Gly Thr 
        355                 360                 365             


Gly Ser Gly Asp Thr His Pro Arg Tyr Thr Ile Ser Ile Leu Glu Gly 
    370                 375                 380                 


Ile Lys Glu Arg Asn Met Lys Phe Asp Glu Glu Leu Ala Ser Thr Tyr 
385                 390                 395                 400 


Glu Glu Tyr Ile Lys Lys Met Arg Glu Thr Glu Glu Tyr Lys Pro Arg 
                405                 410                 415     


Thr Asp Ser Trp Gly Thr Val Ile Lys Pro Lys Leu Pro Glu Asn Phe 
            420                 425                 430         


Leu Ser Glu Lys Glu Ile Lys Lys Pro Pro Lys Lys Asn Asp Val Ala 
        435                 440                 445             


Val Val Val Ile Ser Arg Ile Ser Gly Glu Gly Tyr Asp Arg Lys Pro 
    450                 455                 460                 


Val Lys Gly Asp Phe Tyr Leu Ser Asp Asp Glu Leu Glu Leu Ile Lys 
465                 470                 475                 480 


Thr Val Ser Lys Glu Phe His Asp Gln Gly Lys Lys Val Val Val Leu 
                485                 490                 495     


Leu Asn Ile Gly Ser Pro Ile Glu Val Ala Ser Trp Arg Asp Leu Val 
            500                 505                 510         


Asp Gly Ile Leu Leu Val Trp Gln Ala Gly Gln Glu Met Gly Arg Ile 
        515                 520                 525             


Val Ala Asp Val Leu Val Gly Lys Ile Asn Pro Ser Gly Lys Leu Pro 
    530                 535                 540                 


Thr Thr Phe Pro Lys Asp Tyr Ser Asp Val Pro Ser Trp Thr Phe Pro 
545                 550                 555                 560 


Gly Glu Pro Lys Asp Asn Pro Gln Arg Val Val Tyr Glu Glu Asp Ile 
                565                 570                 575     


Tyr Val Gly Tyr Arg Tyr Tyr Asp Thr Phe Gly Val Glu Pro Ala Tyr 
            580                 585                 590         


Glu Phe Gly Tyr Gly Leu Ser Tyr Thr Lys Phe Glu Tyr Lys Asp Leu 
        595                 600                 605             


Lys Ile Ala Ile Asp Gly Glu Thr Leu Arg Val Ser Tyr Thr Ile Thr 
    610                 615                 620                 


Asn Thr Gly Asp Arg Ala Gly Lys Glu Val Ser Gln Val Tyr Ile Lys 
625                 630                 635                 640 


Ala Pro Lys Gly Lys Ile Asp Lys Pro Phe Gln Glu Leu Lys Ala Phe 
                645                 650                 655     


His Lys Thr Lys Leu Leu Asn Pro Gly Glu Ser Glu Glu Ile Ser Leu 
            660                 665                 670         


Glu Ile Pro Leu Arg Asp Leu Ala Ser Phe Asp Gly Lys Glu Trp Val 
        675                 680                 685             


Val Glu Ser Gly Glu Tyr Glu Val Arg Val Gly Ala Ser Ser Arg Asp 
    690                 695                 700                 


Ile Arg Leu Arg Asp Ile Phe Leu Val Glu Gly Glu Lys Arg Phe Lys 
705                 710                 715                 720 


Pro 
    


<210> 255
<211> 1224
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 255
atgaaagtat tccgaaattc gatcatccgt aagtcggtgg tattgttctg tgctgttcta     60

tggatcttgc cagccggatt gtctctggcg gccaacaagc cgtttcccca gcatacgtct    120

tatacgagcg gttcgattaa accaaacaat gtaacgcagt cagcgatgga caatgcggtc    180

aagagcaaat ggaacagctg gaaagggtct ttcttgaagc cggctgcgac aggacaatat    240

tacgtaaaat acaattcggc gggcgagacg gtgtctgagg ctcatggcta cggaatgatt    300

ttcaccgttc tgatggcggg ttacgacagt aacgcacagt cgtatttcga cggactttat    360

cgctactata aggcgcatcc gagtaataac aatccgtatc tgatggcttg gaaacagaac    420

agcagctttc agaatataga gggagccaat tcggcaacgg atggcgatat ggacattgct    480

tacgcactcc tgcttgcgga caagcagtgg ggaagcagtg gatcgattaa ttatctccag    540

gcagctaagg atatgatcaa tgcgatcatg agtaatgacg ttaatcagtc gcagtggacg    600

ctgcgcttag gcgattgggc aaccagtggc atcttcgata ccgccacgcg gccatcggat    660

ttcatgctga accatatgaa ggcattccgt acggctaccg gcgatgcccg ttgggataac    720

gtcatcaaca aaacctatac gatcatcaac tccatctata acggttacag ctccaatacc    780

ggtttgcttc cggatttcgt tgtcatgtcg ggcggcaatt atcagcctgc ggcagcggaa    840

ttcctggagg gggcgaacga cggaaaatac tattacaact cggccaggac tccttggcgg    900

attacgaccg actatctgat gaccggcgat acgcgcgcgc tgaatcaatt gaacaaaatg    960

aacacgttca ttaagtcggc tgcgaacagc aatcctgcca atatcaaggc agggtataat   1020

ctgaacggaa ctgcgctggt gacttataac agcggagcgt tctatgcacc gttcggcgta   1080

agcgcgatga cgtcgtccag ccaccagagc tggctgaatt cggtatggaa ttatacggcg   1140

aacgcatctg cagagggtta ttatgaggag agcatcaagc tgttctcgat gatcgtcatg   1200

tcgggaaatt ggtggacata ttaa                                          1224

<210> 256
<211> 407
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(30)

<220> 
<221> DOMAIN
<222> (33)...(402)
<223> Glycosyl hydrolases family 8

<220> 
<221> SITE
<222> (50)...(53)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (65)...(68)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (142)...(145)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (154)...(172)
<223> Glycosyl hydrolases family 8 signature. Prosite id = PS00812

<220> 
<221> SITE
<222> (246)...(249)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (347)...(350)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (382)...(385)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (386)...(389)
<223> N-glycosylation site. Prosite id = PS00001

<400> 256
Met Lys Val Phe Arg Asn Ser Ile Ile Arg Lys Ser Val Val Leu Phe 
1               5                   10                  15      


Cys Ala Val Leu Trp Ile Leu Pro Ala Gly Leu Ser Leu Ala Ala Asn 
            20                  25                  30          


Lys Pro Phe Pro Gln His Thr Ser Tyr Thr Ser Gly Ser Ile Lys Pro 
        35                  40                  45              


Asn Asn Val Thr Gln Ser Ala Met Asp Asn Ala Val Lys Ser Lys Trp 
    50                  55                  60                  


Asn Ser Trp Lys Gly Ser Phe Leu Lys Pro Ala Ala Thr Gly Gln Tyr 
65                  70                  75                  80  


Tyr Val Lys Tyr Asn Ser Ala Gly Glu Thr Val Ser Glu Ala His Gly 
                85                  90                  95      


Tyr Gly Met Ile Phe Thr Val Leu Met Ala Gly Tyr Asp Ser Asn Ala 
            100                 105                 110         


Gln Ser Tyr Phe Asp Gly Leu Tyr Arg Tyr Tyr Lys Ala His Pro Ser 
        115                 120                 125             


Asn Asn Asn Pro Tyr Leu Met Ala Trp Lys Gln Asn Ser Ser Phe Gln 
    130                 135                 140                 


Asn Ile Glu Gly Ala Asn Ser Ala Thr Asp Gly Asp Met Asp Ile Ala 
145                 150                 155                 160 


Tyr Ala Leu Leu Leu Ala Asp Lys Gln Trp Gly Ser Ser Gly Ser Ile 
                165                 170                 175     


Asn Tyr Leu Gln Ala Ala Lys Asp Met Ile Asn Ala Ile Met Ser Asn 
            180                 185                 190         


Asp Val Asn Gln Ser Gln Trp Thr Leu Arg Leu Gly Asp Trp Ala Thr 
        195                 200                 205             


Ser Gly Ile Phe Asp Thr Ala Thr Arg Pro Ser Asp Phe Met Leu Asn 
    210                 215                 220                 


His Met Lys Ala Phe Arg Thr Ala Thr Gly Asp Ala Arg Trp Asp Asn 
225                 230                 235                 240 


Val Ile Asn Lys Thr Tyr Thr Ile Ile Asn Ser Ile Tyr Asn Gly Tyr 
                245                 250                 255     


Ser Ser Asn Thr Gly Leu Leu Pro Asp Phe Val Val Met Ser Gly Gly 
            260                 265                 270         


Asn Tyr Gln Pro Ala Ala Ala Glu Phe Leu Glu Gly Ala Asn Asp Gly 
        275                 280                 285             


Lys Tyr Tyr Tyr Asn Ser Ala Arg Thr Pro Trp Arg Ile Thr Thr Asp 
    290                 295                 300                 


Tyr Leu Met Thr Gly Asp Thr Arg Ala Leu Asn Gln Leu Asn Lys Met 
305                 310                 315                 320 


Asn Thr Phe Ile Lys Ser Ala Ala Asn Ser Asn Pro Ala Asn Ile Lys 
                325                 330                 335     


Ala Gly Tyr Asn Leu Asn Gly Thr Ala Leu Val Thr Tyr Asn Ser Gly 
            340                 345                 350         


Ala Phe Tyr Ala Pro Phe Gly Val Ser Ala Met Thr Ser Ser Ser His 
        355                 360                 365             


Gln Ser Trp Leu Asn Ser Val Trp Asn Tyr Thr Ala Asn Ala Ser Ala 
    370                 375                 380                 


Glu Gly Tyr Tyr Glu Glu Ser Ile Lys Leu Phe Ser Met Ile Val Met 
385                 390                 395                 400 


Ser Gly Asn Trp Trp Thr Tyr 
                405         


<210> 257
<211> 1500
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 257
atgaaacgtt cagtctctat ctttatcgca tgtttagtaa tgacagtatt aacaattagc     60

ggtgtcgcgg caccagaagc atctgcagca ggggcgaaaa cgcctgtagc ccttaatggc    120

cagcttagca ttaaaggtac tcagctagtc aatcaaaacg gaaaatcggt gcagctgaag    180

gggatcagct cacacggttt gcagtggttc ggcgattatg tcaataaaga ctctttaaaa    240

tggctaagag acgattgggg aattaccgtc ttccgagcgg caatgtacac ggctgaaggc    300

ggttatatag agaatccgtc tgtgaaaaat aaagtcaaag aagctgttga agcggcaaaa    360

gagctcggga tatatgtcat cattgactgg catattttaa atgacggcaa tccaaatcaa    420

aataaagaga aggcgaagga attctttaag gaaatgtcga gcctttacgg aagcacacca    480

aacgttattt atgaaattgc taatgaaccg aacggtgatg taaattggaa gcgcgatatc    540

aaaccgtatg cggaggaagt gatttccgtt atccgtaaaa atgacccgga taacatcatt    600

attaccggaa ctggcacttg gagtcaggat gtcaatgatg ctgctgatga tcagcttaag    660

gatgcaaacg tcatgtacgc gcttcatttt tatgcaggta cacacggcca gtatttaagg    720

gataaagccg attatgcgct cagcaaagga gcgccgattt ttgtaacgga atgggggacg    780

agtgacgctt ccggaaatgg cggggtcttc cttgaccagt cgagggaatg gctgaattat    840

ctcgacaaca agaaaatcag ctgggtaaac tggaaccttt ctgataagca ggaatcttcc    900

tcagctttaa agccgggggc atctaaaaca ggcggctggc cgttatcaga tttatccgct    960

tcagggacat ttgtaaggga aaagatccgt ggctcccaac attcgactga agacagatct   1020

gagacaccaa agcaagataa acccgtacag gaaaacagcc tatctgtgca atacagaaca   1080

ggggatggaa gtgtgaacag caaccaaatc cgtcctcaga tccatgtgaa aaacaacagc   1140

aagaccaccg ttaatttaaa aaatgtaact gtccgctact ggtataacac gaaaaacaaa   1200

ggccaaaact tcgactgtga ctacgcgaag atcggatgca gcaatgtgac gcacaagttt   1260

gtgacattac aaaaacctgt aaaaggtgca gatgcctatc tggaacttgg gtttaaaaac   1320

gggacactgt caccgggagc aaacactgga gaaatccaaa ttcgtcttca caatgaggat   1380

tggggcaatt attcacaaat cggggattat tctttttctc agtcaaatac gtttaaagat   1440

acaaaaaaaa tcacattata taataacgga aaactaattt ggggaactga acccaaatag   1500

<210> 258
<211> 499
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (47)...(301)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (356)...(437)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (164)...(173)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (296)...(299)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (383)...(386)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (393)...(396)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (421)...(424)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (446)...(449)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (470)...(473)
<223> N-glycosylation site. Prosite id = PS00001

<400> 258
Met Lys Arg Ser Val Ser Ile Phe Ile Ala Cys Leu Val Met Thr Val 
1               5                   10                  15      


Leu Thr Ile Ser Gly Val Ala Ala Pro Glu Ala Ser Ala Ala Gly Ala 
            20                  25                  30          


Lys Thr Pro Val Ala Leu Asn Gly Gln Leu Ser Ile Lys Gly Thr Gln 
        35                  40                  45              


Leu Val Asn Gln Asn Gly Lys Ser Val Gln Leu Lys Gly Ile Ser Ser 
    50                  55                  60                  


His Gly Leu Gln Trp Phe Gly Asp Tyr Val Asn Lys Asp Ser Leu Lys 
65                  70                  75                  80  


Trp Leu Arg Asp Asp Trp Gly Ile Thr Val Phe Arg Ala Ala Met Tyr 
                85                  90                  95      


Thr Ala Glu Gly Gly Tyr Ile Glu Asn Pro Ser Val Lys Asn Lys Val 
            100                 105                 110         


Lys Glu Ala Val Glu Ala Ala Lys Glu Leu Gly Ile Tyr Val Ile Ile 
        115                 120                 125             


Asp Trp His Ile Leu Asn Asp Gly Asn Pro Asn Gln Asn Lys Glu Lys 
    130                 135                 140                 


Ala Lys Glu Phe Phe Lys Glu Met Ser Ser Leu Tyr Gly Ser Thr Pro 
145                 150                 155                 160 


Asn Val Ile Tyr Glu Ile Ala Asn Glu Pro Asn Gly Asp Val Asn Trp 
                165                 170                 175     


Lys Arg Asp Ile Lys Pro Tyr Ala Glu Glu Val Ile Ser Val Ile Arg 
            180                 185                 190         


Lys Asn Asp Pro Asp Asn Ile Ile Ile Thr Gly Thr Gly Thr Trp Ser 
        195                 200                 205             


Gln Asp Val Asn Asp Ala Ala Asp Asp Gln Leu Lys Asp Ala Asn Val 
    210                 215                 220                 


Met Tyr Ala Leu His Phe Tyr Ala Gly Thr His Gly Gln Tyr Leu Arg 
225                 230                 235                 240 


Asp Lys Ala Asp Tyr Ala Leu Ser Lys Gly Ala Pro Ile Phe Val Thr 
                245                 250                 255     


Glu Trp Gly Thr Ser Asp Ala Ser Gly Asn Gly Gly Val Phe Leu Asp 
            260                 265                 270         


Gln Ser Arg Glu Trp Leu Asn Tyr Leu Asp Asn Lys Lys Ile Ser Trp 
        275                 280                 285             


Val Asn Trp Asn Leu Ser Asp Lys Gln Glu Ser Ser Ser Ala Leu Lys 
    290                 295                 300                 


Pro Gly Ala Ser Lys Thr Gly Gly Trp Pro Leu Ser Asp Leu Ser Ala 
305                 310                 315                 320 


Ser Gly Thr Phe Val Arg Glu Lys Ile Arg Gly Ser Gln His Ser Thr 
                325                 330                 335     


Glu Asp Arg Ser Glu Thr Pro Lys Gln Asp Lys Pro Val Gln Glu Asn 
            340                 345                 350         


Ser Leu Ser Val Gln Tyr Arg Thr Gly Asp Gly Ser Val Asn Ser Asn 
        355                 360                 365             


Gln Ile Arg Pro Gln Ile His Val Lys Asn Asn Ser Lys Thr Thr Val 
    370                 375                 380                 


Asn Leu Lys Asn Val Thr Val Arg Tyr Trp Tyr Asn Thr Lys Asn Lys 
385                 390                 395                 400 


Gly Gln Asn Phe Asp Cys Asp Tyr Ala Lys Ile Gly Cys Ser Asn Val 
                405                 410                 415     


Thr His Lys Phe Val Thr Leu Gln Lys Pro Val Lys Gly Ala Asp Ala 
            420                 425                 430         


Tyr Leu Glu Leu Gly Phe Lys Asn Gly Thr Leu Ser Pro Gly Ala Asn 
        435                 440                 445             


Thr Gly Glu Ile Gln Ile Arg Leu His Asn Glu Asp Trp Gly Asn Tyr 
    450                 455                 460                 


Ser Gln Ile Gly Asp Tyr Ser Phe Ser Gln Ser Asn Thr Phe Lys Asp 
465                 470                 475                 480 


Thr Lys Lys Ile Thr Leu Tyr Asn Asn Gly Lys Leu Ile Trp Gly Thr 
                485                 490                 495     


Glu Pro Lys 
            


<210> 259
<211> 942
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 259
atgctttctt ttgcaaccgc ttgtgctgaa gctcaatcgg ccgaaaatga agtagataca     60

tttgttggta aacatggatg gttatcagtg aaagggactt ccttgataga tgagaatggt    120

gaaactttag taatgaaagg tgttagttta gggtggcaca actggtggtc tcaattttat    180

aatgagtcta cagtaacatg gttgcagaac gattgggact gcaaccttgt tcgtgctgca    240

attggagtag agccgaatgg agcatatata gacaattcgg tcttagctaa caactgtctt    300

gacaacgtcg ttgatgctgc tatcaaaaat ggaatgtatg taatcatcga ttggcatagc    360

cataatgtga gggcggagga agcaaaagca ttttttacac gtgtggcgaa taaatataag    420

gattatccga atattatcta tgaaatattc aatgaacccg aacgcatatc ttgggaagag    480

gtaaaatcat attccgaaga gttaattaaa acgatcaggg ctattgataa gaaaaatgtt    540

atccttgttg gaactccgca ttgggatcag gacatacatt tagctgcgga taaccctatt    600

aaaggatatg ataacatcat gtatacttta catttttatg cagcaaccca taaaaaagaa    660

cttcgagatc gtgccgatta tgctttgaaa aaaggaatcc caatattcgt atccgaatgt    720

gctggtatgg aagctaccgg agacggacct attgaccaca atgaatggaa tacttgggta    780

gagtggatgg ctaaaaacaa tatcagctgg gtagcatggt ctatatctag taagaatgaa    840

acatgttcga tgataaaaga cgattcatct cccattagta attgggccac ggacgatttg    900

aaagaatggg gtgtacttgt taaaagtcta ttgaagaaat aa                       942

<210> 260
<211> 313
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (33)...(283)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (61)...(64)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (271)...(274)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (283)...(286)
<223> N-glycosylation site. Prosite id = PS00001

<400> 260
Met Leu Ser Phe Ala Thr Ala Cys Ala Glu Ala Gln Ser Ala Glu Asn 
1               5                   10                  15      


Glu Val Asp Thr Phe Val Gly Lys His Gly Trp Leu Ser Val Lys Gly 
            20                  25                  30          


Thr Ser Leu Ile Asp Glu Asn Gly Glu Thr Leu Val Met Lys Gly Val 
        35                  40                  45              


Ser Leu Gly Trp His Asn Trp Trp Ser Gln Phe Tyr Asn Glu Ser Thr 
    50                  55                  60                  


Val Thr Trp Leu Gln Asn Asp Trp Asp Cys Asn Leu Val Arg Ala Ala 
65                  70                  75                  80  


Ile Gly Val Glu Pro Asn Gly Ala Tyr Ile Asp Asn Ser Val Leu Ala 
                85                  90                  95      


Asn Asn Cys Leu Asp Asn Val Val Asp Ala Ala Ile Lys Asn Gly Met 
            100                 105                 110         


Tyr Val Ile Ile Asp Trp His Ser His Asn Val Arg Ala Glu Glu Ala 
        115                 120                 125             


Lys Ala Phe Phe Thr Arg Val Ala Asn Lys Tyr Lys Asp Tyr Pro Asn 
    130                 135                 140                 


Ile Ile Tyr Glu Ile Phe Asn Glu Pro Glu Arg Ile Ser Trp Glu Glu 
145                 150                 155                 160 


Val Lys Ser Tyr Ser Glu Glu Leu Ile Lys Thr Ile Arg Ala Ile Asp 
                165                 170                 175     


Lys Lys Asn Val Ile Leu Val Gly Thr Pro His Trp Asp Gln Asp Ile 
            180                 185                 190         


His Leu Ala Ala Asp Asn Pro Ile Lys Gly Tyr Asp Asn Ile Met Tyr 
        195                 200                 205             


Thr Leu His Phe Tyr Ala Ala Thr His Lys Lys Glu Leu Arg Asp Arg 
    210                 215                 220                 


Ala Asp Tyr Ala Leu Lys Lys Gly Ile Pro Ile Phe Val Ser Glu Cys 
225                 230                 235                 240 


Ala Gly Met Glu Ala Thr Gly Asp Gly Pro Ile Asp His Asn Glu Trp 
                245                 250                 255     


Asn Thr Trp Val Glu Trp Met Ala Lys Asn Asn Ile Ser Trp Val Ala 
            260                 265                 270         


Trp Ser Ile Ser Ser Lys Asn Glu Thr Cys Ser Met Ile Lys Asp Asp 
        275                 280                 285             


Ser Ser Pro Ile Ser Asn Trp Ala Thr Asp Asp Leu Lys Glu Trp Gly 
    290                 295                 300                 


Val Leu Val Lys Ser Leu Leu Lys Lys 
305                 310             


<210> 261
<211> 960
<212> DNA
<213> Pyrococcus furiosus VC1

<400> 261
atgagcaaga aaaagttcgt catcgtatct atcttaacaa tccttttagt acaggcaata     60

tattttgtag aaaagtatca tacctctgag gacaagtcaa cttcaaatac ctcatctaca    120

ccaccccaaa caacactttc cactaccaag gttctcaaga ttagataccc tgatgacggt    180

gagtggccag gagctcctat tgataaggat ggtgatggga acccagaatt ctacattgaa    240

ataaacctat ggaacattct taatgctact ggatttgctg agatgacgta caatttaacc    300

agcggcgtcc ttcactacgt ccaacaactt gacaacattg tcttgaggga tagaagtaat    360

tgggtgcatg gataccccga aatattctat ggaaacaagc catggaatgc aaactacgca    420

actgatggcc caataccatt acccagtaaa gtttcaaacc taacagactt ctatctaaca    480

atctcctata aacttgagcc caagaacggc ctgccaatta acttcgcaat agaatcctgg    540

ttaacgagag aagcttggag aacaacagga attaacagcg atgagcaaga agtaatgata    600

tggatttact atgacggatt acaaccggct ggctccaaag ttaaggagat tgtagtccca    660

ataatagtta acggaacacc agtaaatgct acatttgaag tatggaaggc aaacattggt    720

tgggagtatg ttgcatttag aataaagacc ccaatcaaag agggaacagt gacaattcca    780

tacggagcat ttataagtgt tgcagccaac atttcaagct taccaaatta cacagaactt    840

tacttagagg acgtggagat tggaactgag tttggaacgc caagcactac ctccgcccac    900

ctagagtggt ggatcacaaa cataacacta actcctctag atagacctct tatttcctaa    960

<210> 262
<211> 319
<212> PRT
<213> Pyrococcus furiosus VC1

<220> 
<221> SIGNAL
<222> (1)...(19)

<220> 
<221> DOMAIN
<222> (143)...(311)
<223> Glycosyl hydrolase family 12

<220> 
<221> SITE
<222> (36)...(39)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (89)...(92)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (99)...(102)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (155)...(158)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (232)...(235)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (274)...(277)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (280)...(283)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (311)...(314)
<223> N-glycosylation site. Prosite id = PS00001

<400> 262
Met Ser Lys Lys Lys Phe Val Ile Val Ser Ile Leu Thr Ile Leu Leu 
1               5                   10                  15      


Val Gln Ala Ile Tyr Phe Val Glu Lys Tyr His Thr Ser Glu Asp Lys 
            20                  25                  30          


Ser Thr Ser Asn Thr Ser Ser Thr Pro Pro Gln Thr Thr Leu Ser Thr 
        35                  40                  45              


Thr Lys Val Leu Lys Ile Arg Tyr Pro Asp Asp Gly Glu Trp Pro Gly 
    50                  55                  60                  


Ala Pro Ile Asp Lys Asp Gly Asp Gly Asn Pro Glu Phe Tyr Ile Glu 
65                  70                  75                  80  


Ile Asn Leu Trp Asn Ile Leu Asn Ala Thr Gly Phe Ala Glu Met Thr 
                85                  90                  95      


Tyr Asn Leu Thr Ser Gly Val Leu His Tyr Val Gln Gln Leu Asp Asn 
            100                 105                 110         


Ile Val Leu Arg Asp Arg Ser Asn Trp Val His Gly Tyr Pro Glu Ile 
        115                 120                 125             


Phe Tyr Gly Asn Lys Pro Trp Asn Ala Asn Tyr Ala Thr Asp Gly Pro 
    130                 135                 140                 


Ile Pro Leu Pro Ser Lys Val Ser Asn Leu Thr Asp Phe Tyr Leu Thr 
145                 150                 155                 160 


Ile Ser Tyr Lys Leu Glu Pro Lys Asn Gly Leu Pro Ile Asn Phe Ala 
                165                 170                 175     


Ile Glu Ser Trp Leu Thr Arg Glu Ala Trp Arg Thr Thr Gly Ile Asn 
            180                 185                 190         


Ser Asp Glu Gln Glu Val Met Ile Trp Ile Tyr Tyr Asp Gly Leu Gln 
        195                 200                 205             


Pro Ala Gly Ser Lys Val Lys Glu Ile Val Val Pro Ile Ile Val Asn 
    210                 215                 220                 


Gly Thr Pro Val Asn Ala Thr Phe Glu Val Trp Lys Ala Asn Ile Gly 
225                 230                 235                 240 


Trp Glu Tyr Val Ala Phe Arg Ile Lys Thr Pro Ile Lys Glu Gly Thr 
                245                 250                 255     


Val Thr Ile Pro Tyr Gly Ala Phe Ile Ser Val Ala Ala Asn Ile Ser 
            260                 265                 270         


Ser Leu Pro Asn Tyr Thr Glu Leu Tyr Leu Glu Asp Val Glu Ile Gly 
        275                 280                 285             


Thr Glu Phe Gly Thr Pro Ser Thr Thr Ser Ala His Leu Glu Trp Trp 
    290                 295                 300                 


Ile Thr Asn Ile Thr Leu Thr Pro Leu Asp Arg Pro Leu Ile Ser 
305                 310                 315                 


<210> 263
<211> 1419
<212> DNA
<213> Pyrococcus furiosus VC1

<400> 263
atgaagttcc caaaaaactt catgtttgga tattcttggt ctggtttcca gtttgagatg     60

ggactgccag gaagtgaagt ggaaagcgac tggtgggtgt gggttcacga caaggagaac    120

atagcatcag gtctagtaag tggagatcta ccagagaacg gcccagcata ttggcacctc    180

tataagcaag atcatgacat tgcagaaaag ctaggaatgg attgtattag aggtggcatt    240

gagtgggcaa gaatttttcc aaagccaaca tttgacgtta aagttgatgt ggaaaaggat    300

gaagaaggca acataatttc cgtagacgtt ccagagagta caataaaaga gctagagaaa    360

attgccaaca tggaggccct tgaacattat cgcaagattt actcagactg gaaggagagg    420

ggcaaaacct tcatattaaa cctctaccac tggcctcttc cattatggat tcatgaccca    480

attgcagtaa ggaaacttgg cccggatagg gctcctgcag gatggttaga tgagaagaca    540

gtggtagagt ttgtgaagtt tgccgccttc gttgcttatc accttgatga cctcgttgac    600

atgtggagca caatgaacga accaaacgta gtctacaatc aaggttacat taatctacgt    660

tcaggatttc caccaggata tctaagcttt gaagcagcag aaaaggcaaa attcaactta    720

attcaggctc acatcggagc atatgatgcc ataaaagagt attcagaaaa atccgtggga    780

gtgatatacg cctttgcttg gcacgatcct ctagcggagg agtataagga tgaagtagag    840

gaaatcagaa agaaagacta tgagtttgta acaattctac actcaaaagg aaagctagac    900

tggatcggcg taaactacta ctccaggctg gtatatggag ccaaagatgg acacctagtt    960

cctttacctg gatatggatt tatgagtgag agaggaggat ttgcaaagtc aggaagacct   1020

gctagtgact ttggatggga aatgtaccca gagggccttg agaaccttct taagtattta   1080

aacaatgcct acgagctacc aatgataatt acagagaacg gtatggccga tgcagcagat   1140

agatacaggc cacactatct cgtaagccat ctaaaggcag tttacaatgc tatgaaagaa   1200

ggtgctgatg ttagagggta tctccactgg tctctaacag acaactacga atgggcccaa   1260

gggttcagga tgagatttgg attggtttac gtggatttcg agacaaagaa gagatattta   1320

aggccaagcg ccctggtatt cagagaaata gccactcaaa aagaaattcc agaagaatta   1380

gctcacctcg cagacctcaa atttgttaca agaaagtag                          1419

<210> 264
<211> 472
<212> PRT
<213> Pyrococcus furiosus VC1

<220> 
<221> DOMAIN
<222> (1)...(456)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (7)...(21)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (373)...(381)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 264
Met Lys Phe Pro Lys Asn Phe Met Phe Gly Tyr Ser Trp Ser Gly Phe 
1               5                   10                  15      


Gln Phe Glu Met Gly Leu Pro Gly Ser Glu Val Glu Ser Asp Trp Trp 
            20                  25                  30          


Val Trp Val His Asp Lys Glu Asn Ile Ala Ser Gly Leu Val Ser Gly 
        35                  40                  45              


Asp Leu Pro Glu Asn Gly Pro Ala Tyr Trp His Leu Tyr Lys Gln Asp 
    50                  55                  60                  


His Asp Ile Ala Glu Lys Leu Gly Met Asp Cys Ile Arg Gly Gly Ile 
65                  70                  75                  80  


Glu Trp Ala Arg Ile Phe Pro Lys Pro Thr Phe Asp Val Lys Val Asp 
                85                  90                  95      


Val Glu Lys Asp Glu Glu Gly Asn Ile Ile Ser Val Asp Val Pro Glu 
            100                 105                 110         


Ser Thr Ile Lys Glu Leu Glu Lys Ile Ala Asn Met Glu Ala Leu Glu 
        115                 120                 125             


His Tyr Arg Lys Ile Tyr Ser Asp Trp Lys Glu Arg Gly Lys Thr Phe 
    130                 135                 140                 


Ile Leu Asn Leu Tyr His Trp Pro Leu Pro Leu Trp Ile His Asp Pro 
145                 150                 155                 160 


Ile Ala Val Arg Lys Leu Gly Pro Asp Arg Ala Pro Ala Gly Trp Leu 
                165                 170                 175     


Asp Glu Lys Thr Val Val Glu Phe Val Lys Phe Ala Ala Phe Val Ala 
            180                 185                 190         


Tyr His Leu Asp Asp Leu Val Asp Met Trp Ser Thr Met Asn Glu Pro 
        195                 200                 205             


Asn Val Val Tyr Asn Gln Gly Tyr Ile Asn Leu Arg Ser Gly Phe Pro 
    210                 215                 220                 


Pro Gly Tyr Leu Ser Phe Glu Ala Ala Glu Lys Ala Lys Phe Asn Leu 
225                 230                 235                 240 


Ile Gln Ala His Ile Gly Ala Tyr Asp Ala Ile Lys Glu Tyr Ser Glu 
                245                 250                 255     


Lys Ser Val Gly Val Ile Tyr Ala Phe Ala Trp His Asp Pro Leu Ala 
            260                 265                 270         


Glu Glu Tyr Lys Asp Glu Val Glu Glu Ile Arg Lys Lys Asp Tyr Glu 
        275                 280                 285             


Phe Val Thr Ile Leu His Ser Lys Gly Lys Leu Asp Trp Ile Gly Val 
    290                 295                 300                 


Asn Tyr Tyr Ser Arg Leu Val Tyr Gly Ala Lys Asp Gly His Leu Val 
305                 310                 315                 320 


Pro Leu Pro Gly Tyr Gly Phe Met Ser Glu Arg Gly Gly Phe Ala Lys 
                325                 330                 335     


Ser Gly Arg Pro Ala Ser Asp Phe Gly Trp Glu Met Tyr Pro Glu Gly 
            340                 345                 350         


Leu Glu Asn Leu Leu Lys Tyr Leu Asn Asn Ala Tyr Glu Leu Pro Met 
        355                 360                 365             


Ile Ile Thr Glu Asn Gly Met Ala Asp Ala Ala Asp Arg Tyr Arg Pro 
    370                 375                 380                 


His Tyr Leu Val Ser His Leu Lys Ala Val Tyr Asn Ala Met Lys Glu 
385                 390                 395                 400 


Gly Ala Asp Val Arg Gly Tyr Leu His Trp Ser Leu Thr Asp Asn Tyr 
                405                 410                 415     


Glu Trp Ala Gln Gly Phe Arg Met Arg Phe Gly Leu Val Tyr Val Asp 
            420                 425                 430         


Phe Glu Thr Lys Lys Arg Tyr Leu Arg Pro Ser Ala Leu Val Phe Arg 
        435                 440                 445             


Glu Ile Ala Thr Gln Lys Glu Ile Pro Glu Glu Leu Ala His Leu Ala 
    450                 455                 460                 


Asp Leu Lys Phe Val Thr Arg Lys 
465                 470         


<210> 265
<211> 1434
<212> DNA
<213> Bacteria

<400> 265
atgaccgcgt ccgaaacccg gccgctcacg gccacccgct cgttcccggc cgacttcctc     60

tggggcgcgg ccaccgccgc gtaccagatc gagggagcgg cggccgagga cggccgtacc    120

ccgtccatct gggatacctt ctcgcacacc cccggcaagg tcttcgaggg ccacaccggt    180

gacgtggcgg tggaccacta ccaccggttc cgcgaggacg tcgcgatcat gtcggaactc    240

ggcctgaacg cctaccggtt ctccgtctcc tggtcccggg tgcagcccac cgggcggggt    300

ccggccgtcc agaaggggct ggacttctac cgggcgctcg tcgacgagct gctcgccgcc    360

gggatcgagc cggcgctcac cctctaccac tgggacctgc cgcaggagct ggaggacgcc    420

gggggctggc cggagcgggc gaccgcggag cgcttcgccg agtacgcggg gatcgtcgcg    480

ggcgccctcg gggaccgggt gacccgctgg accaccctca acgagccctg gtgcagcgcc    540

ttcctggggt acggctccgg ggtgcacgcc ccgggccgca cggacccggt cgcctcgctg    600

cgcgccgccc accacctcaa cctcgggcac ggtctcgcgg tgcaggcgct gcgggccgcg    660

ctgccggccg accggcagct cgcggtctcg ctcaacctgc acgaggtgcg gccgctgacc    720

cggtccgccg aggacctcga cgcggcccgc cgcatcgacg ccgtcggcaa ccggatctgg    780

ctcgggccga tgctggaggg cgcctacccc gaggacctga tcctcgacac ggcgcagctc    840

acggactggt ccttcgtcaa ggacggcgac ccggcggcga tcgcgcagcc gctggacctg    900

ctcgcgatca actactacac cccgacggtc gtctcgcacg ttccggaggg tgcggagaag    960

ccgcaggacg acggtcacgg caacagcgac cactcgccgt ggcccggggc ggacatggtc   1020

gccttccacc gggcgccggg cgagcggacg gcgatgggct ggccggtcga cgcgagcgcc   1080

ctgtacgacc tgctgacccg ggtgtcggac gcgtacccgc aactcccgct ggtcatcagc   1140

gagaacgggg cggcgtacga ggacgtggtg gcgccggacg gttcggtgca cgacccggag   1200

cgggcggcgt acgtgcacgc gcatctggag gccgtgcacc gcgcgctcgc ggacggcgtg   1260

gacgtgcgcg gctatttcct gtggtcgctg ctcgacaact tcgagtgggc gtacggctat   1320

gcgaagcgct tcggcgcggt ccgggtggac tacgacaccc tggagcggac gccgaagtcc   1380

agcgcccgct ggtacgcgcg ggtggcgcgg tcgggcgaac tgctggcgcc ctga         1434

<210> 266
<211> 477
<212> PRT
<213> Bacteria

<220> 
<221> DOMAIN
<222> (11)...(474)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (19)...(33)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (382)...(390)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 266
Met Thr Ala Ser Glu Thr Arg Pro Leu Thr Ala Thr Arg Ser Phe Pro 
1               5                   10                  15      


Ala Asp Phe Leu Trp Gly Ala Ala Thr Ala Ala Tyr Gln Ile Glu Gly 
            20                  25                  30          


Ala Ala Ala Glu Asp Gly Arg Thr Pro Ser Ile Trp Asp Thr Phe Ser 
        35                  40                  45              


His Thr Pro Gly Lys Val Phe Glu Gly His Thr Gly Asp Val Ala Val 
    50                  55                  60                  


Asp His Tyr His Arg Phe Arg Glu Asp Val Ala Ile Met Ser Glu Leu 
65                  70                  75                  80  


Gly Leu Asn Ala Tyr Arg Phe Ser Val Ser Trp Ser Arg Val Gln Pro 
                85                  90                  95      


Thr Gly Arg Gly Pro Ala Val Gln Lys Gly Leu Asp Phe Tyr Arg Ala 
            100                 105                 110         


Leu Val Asp Glu Leu Leu Ala Ala Gly Ile Glu Pro Ala Leu Thr Leu 
        115                 120                 125             


Tyr His Trp Asp Leu Pro Gln Glu Leu Glu Asp Ala Gly Gly Trp Pro 
    130                 135                 140                 


Glu Arg Ala Thr Ala Glu Arg Phe Ala Glu Tyr Ala Gly Ile Val Ala 
145                 150                 155                 160 


Gly Ala Leu Gly Asp Arg Val Thr Arg Trp Thr Thr Leu Asn Glu Pro 
                165                 170                 175     


Trp Cys Ser Ala Phe Leu Gly Tyr Gly Ser Gly Val His Ala Pro Gly 
            180                 185                 190         


Arg Thr Asp Pro Val Ala Ser Leu Arg Ala Ala His His Leu Asn Leu 
        195                 200                 205             


Gly His Gly Leu Ala Val Gln Ala Leu Arg Ala Ala Leu Pro Ala Asp 
    210                 215                 220                 


Arg Gln Leu Ala Val Ser Leu Asn Leu His Glu Val Arg Pro Leu Thr 
225                 230                 235                 240 


Arg Ser Ala Glu Asp Leu Asp Ala Ala Arg Arg Ile Asp Ala Val Gly 
                245                 250                 255     


Asn Arg Ile Trp Leu Gly Pro Met Leu Glu Gly Ala Tyr Pro Glu Asp 
            260                 265                 270         


Leu Ile Leu Asp Thr Ala Gln Leu Thr Asp Trp Ser Phe Val Lys Asp 
        275                 280                 285             


Gly Asp Pro Ala Ala Ile Ala Gln Pro Leu Asp Leu Leu Ala Ile Asn 
    290                 295                 300                 


Tyr Tyr Thr Pro Thr Val Val Ser His Val Pro Glu Gly Ala Glu Lys 
305                 310                 315                 320 


Pro Gln Asp Asp Gly His Gly Asn Ser Asp His Ser Pro Trp Pro Gly 
                325                 330                 335     


Ala Asp Met Val Ala Phe His Arg Ala Pro Gly Glu Arg Thr Ala Met 
            340                 345                 350         


Gly Trp Pro Val Asp Ala Ser Ala Leu Tyr Asp Leu Leu Thr Arg Val 
        355                 360                 365             


Ser Asp Ala Tyr Pro Gln Leu Pro Leu Val Ile Ser Glu Asn Gly Ala 
    370                 375                 380                 


Ala Tyr Glu Asp Val Val Ala Pro Asp Gly Ser Val His Asp Pro Glu 
385                 390                 395                 400 


Arg Ala Ala Tyr Val His Ala His Leu Glu Ala Val His Arg Ala Leu 
                405                 410                 415     


Ala Asp Gly Val Asp Val Arg Gly Tyr Phe Leu Trp Ser Leu Leu Asp 
            420                 425                 430         


Asn Phe Glu Trp Ala Tyr Gly Tyr Ala Lys Arg Phe Gly Ala Val Arg 
        435                 440                 445             


Val Asp Tyr Asp Thr Leu Glu Arg Thr Pro Lys Ser Ser Ala Arg Trp 
    450                 455                 460                 


Tyr Ala Arg Val Ala Arg Ser Gly Glu Leu Leu Ala Pro 
465                 470                 475         


<210> 267
<211> 1287
<212> DNA
<213> Bacteria

<400> 267
atgaacccac gcagcctccg gcgccgtacg accgccgccc tcgcggcgct cgcggcctgt     60

gccgccctcc tcgcgacgca ggcgcaggcg caggcgcagg ccgcacccgt cgcggacgcg    120

ctcacccccg ccacgaggtt ctacgtggac ccgaacagca aggccgccaa gcaggcgctc    180

accgacctcc ggaacggaga cttcgcgaac gccgtgaaca tggccaggct cgcgagctgg    240

cctcaggccg agtggttcac cgagggcaca cccgacgagg tccgggccaa ggtgaggagc    300

ctcgtccgcg aggcccggct ggtgaaccgg acccccgtcc tggtcgccta caacgtcccg    360

ggccgcgact gttcgcagta ctccagcggc ggcgccgcgt cctccgccgc ctaccggaag    420

tggatcgacg ccttcgccgc cggcatcggc gacagcaaag ccgtcgtcgt ggtcgagccc    480

gacggcctgg ccctcctccc gagcgactgc ggaccggggg tggacccgac cggcgaactc    540

acggcgaccc gcgtcgccga cctcacgtac gccgtcagga ccctcaaggc caaggcccgc    600

accacggtct acctcgacgc cggcaacgtc cagtggcggc ccgtcggcga gatggcgcgg    660

cgactgctcg acgccggcgt ccagcacagc gacggcttcg ccctcaacgt ctccaacacc    720

caccccaccg accacaacac gcggtacggc acctggatct cccggtgcat gtggtacgcc    780

accgaggggc ccgagtacgc ccgcggccac accgactggt gcgccaccca gtactactcg    840

cccgccgcgc ccaacgacgg cgcccccggc aacgcggtcg accccgccga ccccgccacc    900

tggcgctgga ccgacgcctg gttcgaccag aacgcgggca ccccgccgaa cgacgccctg    960

caccacttcg tcatcgacac cagccgcaac ggtctgggcg cctggacccc ggagcccggc   1020

aagtactccg gcgacccgga gatctggtgc aacgcacccg gccgcggcct cggcccccgc   1080

ccgaccgccg acaccggtgt cccgctcgtc gacgcctacc tgtgggtcaa gatccccggc   1140

gagtccgacg gcagctgcac gcgcaacacc ggcggcacga tcgaccccga gtacggcatc   1200

gtcgacccac ccgccggcgc ctggtggccc gcccaggccc acaccctggc ccgcaacgcg   1260

gcaccacgac tgaccttcaa ccactga                                       1287

<210> 268
<211> 428
<212> PRT
<213> Bacteria

<220> 
<221> SIGNAL
<222> (1)...(30)

<220> 
<221> DOMAIN
<222> (49)...(396)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (239)...(242)
<223> N-glycosylation site. Prosite id = PS00001

<400> 268
Met Asn Pro Arg Ser Leu Arg Arg Arg Thr Thr Ala Ala Leu Ala Ala 
1               5                   10                  15      


Leu Ala Ala Cys Ala Ala Leu Leu Ala Thr Gln Ala Gln Ala Gln Ala 
            20                  25                  30          


Gln Ala Ala Pro Val Ala Asp Ala Leu Thr Pro Ala Thr Arg Phe Tyr 
        35                  40                  45              


Val Asp Pro Asn Ser Lys Ala Ala Lys Gln Ala Leu Thr Asp Leu Arg 
    50                  55                  60                  


Asn Gly Asp Phe Ala Asn Ala Val Asn Met Ala Arg Leu Ala Ser Trp 
65                  70                  75                  80  


Pro Gln Ala Glu Trp Phe Thr Glu Gly Thr Pro Asp Glu Val Arg Ala 
                85                  90                  95      


Lys Val Arg Ser Leu Val Arg Glu Ala Arg Leu Val Asn Arg Thr Pro 
            100                 105                 110         


Val Leu Val Ala Tyr Asn Val Pro Gly Arg Asp Cys Ser Gln Tyr Ser 
        115                 120                 125             


Ser Gly Gly Ala Ala Ser Ser Ala Ala Tyr Arg Lys Trp Ile Asp Ala 
    130                 135                 140                 


Phe Ala Ala Gly Ile Gly Asp Ser Lys Ala Val Val Val Val Glu Pro 
145                 150                 155                 160 


Asp Gly Leu Ala Leu Leu Pro Ser Asp Cys Gly Pro Gly Val Asp Pro 
                165                 170                 175     


Thr Gly Glu Leu Thr Ala Thr Arg Val Ala Asp Leu Thr Tyr Ala Val 
            180                 185                 190         


Arg Thr Leu Lys Ala Lys Ala Arg Thr Thr Val Tyr Leu Asp Ala Gly 
        195                 200                 205             


Asn Val Gln Trp Arg Pro Val Gly Glu Met Ala Arg Arg Leu Leu Asp 
    210                 215                 220                 


Ala Gly Val Gln His Ser Asp Gly Phe Ala Leu Asn Val Ser Asn Thr 
225                 230                 235                 240 


His Pro Thr Asp His Asn Thr Arg Tyr Gly Thr Trp Ile Ser Arg Cys 
                245                 250                 255     


Met Trp Tyr Ala Thr Glu Gly Pro Glu Tyr Ala Arg Gly His Thr Asp 
            260                 265                 270         


Trp Cys Ala Thr Gln Tyr Tyr Ser Pro Ala Ala Pro Asn Asp Gly Ala 
        275                 280                 285             


Pro Gly Asn Ala Val Asp Pro Ala Asp Pro Ala Thr Trp Arg Trp Thr 
    290                 295                 300                 


Asp Ala Trp Phe Asp Gln Asn Ala Gly Thr Pro Pro Asn Asp Ala Leu 
305                 310                 315                 320 


His His Phe Val Ile Asp Thr Ser Arg Asn Gly Leu Gly Ala Trp Thr 
                325                 330                 335     


Pro Glu Pro Gly Lys Tyr Ser Gly Asp Pro Glu Ile Trp Cys Asn Ala 
            340                 345                 350         


Pro Gly Arg Gly Leu Gly Pro Arg Pro Thr Ala Asp Thr Gly Val Pro 
        355                 360                 365             


Leu Val Asp Ala Tyr Leu Trp Val Lys Ile Pro Gly Glu Ser Asp Gly 
    370                 375                 380                 


Ser Cys Thr Arg Asn Thr Gly Gly Thr Ile Asp Pro Glu Tyr Gly Ile 
385                 390                 395                 400 


Val Asp Pro Pro Ala Gly Ala Trp Trp Pro Ala Gln Ala His Thr Leu 
                405                 410                 415     


Ala Arg Asn Ala Ala Pro Arg Leu Thr Phe Asn His 
            420                 425             


<210> 269
<211> 966
<212> DNA
<213> Bacteria

<400> 269
atgcgccgca gaatccgcgc cctcgtcgca gccctctccg cactgccgtt ggcgctcgtc     60

gtcgccccgt ccgcccacgc ggcggatccc accaccatga ccagcgggtt ctacacggac    120

cccgactcca gcgcgaagaa gtgggtcgcc gccaaccccg gcgacggccg ggcccccgcg    180

atcagcacct ccctcgccaa cacccccatg gcccgctggt tcggcgcctg gagcggcacg    240

atcggcaccg cggcgggcgc gtacgtcggc gcggcggacc agcaggacaa gctgcccatc    300

ctggtcgcgt acaacatcta cctccgcgac tcctgcggcg ggcactccgg cggcggggcc    360

ccgtcggcct ccgcgtacgc gacctggatc gcgcagttcg ccggcgggat cgcgggccgc    420

ccggccgtcg tcctcctcga acccgactcc ctcgcggact acggctgcct gaaccagacc    480

cagatccgcg aacgccaggg catgatcagc ggcgcgctcg ccgagttcaa ccggcagtcc    540

cccaacacgt gggtctacct ggacgccggc aaccccggct gggtgagcgc ggcgacgatg    600

gcccagcgcc tccacgaagc cggcctccgc caggcacacg gcttctccct caacatctcg    660

aactactaca ccaccgacca gaacaccgcc tacggcaacg cagtcaacag cgaactggcc    720

gcccgctacg gctacaccaa gccgttcgtc gtcgacacca gccgcaacgg caagggctcc    780

aacggcgagt ggtgcaacgc ggccggccgc cgcatcggca cgcccacccg gctgggcggg    840

ggcgccgaga tgctgctgtg gatcaaggcc ccgggcgagt ccgacggcaa ctgcggcgtc    900

gggaccggtt ccacggccgg ccagttcctc ccggaggccg cctacaagat gatctacggc    960

tactga                                                               966

<210> 270
<211> 321
<212> PRT
<213> Bacteria

<220> 
<221> SIGNAL
<222> (1)...(27)

<220> 
<221> DOMAIN
<222> (39)...(311)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (145)...(154)
<223> Glycosyl hydrolases family 6 signature 2. Prosite id = PS00656

<220> 
<221> SITE
<222> (160)...(163)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (221)...(224)
<223> N-glycosylation site. Prosite id = PS00001

<400> 270
Met Arg Arg Arg Ile Arg Ala Leu Val Ala Ala Leu Ser Ala Leu Pro 
1               5                   10                  15      


Leu Ala Leu Val Val Ala Pro Ser Ala His Ala Ala Asp Pro Thr Thr 
            20                  25                  30          


Met Thr Ser Gly Phe Tyr Thr Asp Pro Asp Ser Ser Ala Lys Lys Trp 
        35                  40                  45              


Val Ala Ala Asn Pro Gly Asp Gly Arg Ala Pro Ala Ile Ser Thr Ser 
    50                  55                  60                  


Leu Ala Asn Thr Pro Met Ala Arg Trp Phe Gly Ala Trp Ser Gly Thr 
65                  70                  75                  80  


Ile Gly Thr Ala Ala Gly Ala Tyr Val Gly Ala Ala Asp Gln Gln Asp 
                85                  90                  95      


Lys Leu Pro Ile Leu Val Ala Tyr Asn Ile Tyr Leu Arg Asp Ser Cys 
            100                 105                 110         


Gly Gly His Ser Gly Gly Gly Ala Pro Ser Ala Ser Ala Tyr Ala Thr 
        115                 120                 125             


Trp Ile Ala Gln Phe Ala Gly Gly Ile Ala Gly Arg Pro Ala Val Val 
    130                 135                 140                 


Leu Leu Glu Pro Asp Ser Leu Ala Asp Tyr Gly Cys Leu Asn Gln Thr 
145                 150                 155                 160 


Gln Ile Arg Glu Arg Gln Gly Met Ile Ser Gly Ala Leu Ala Glu Phe 
                165                 170                 175     


Asn Arg Gln Ser Pro Asn Thr Trp Val Tyr Leu Asp Ala Gly Asn Pro 
            180                 185                 190         


Gly Trp Val Ser Ala Ala Thr Met Ala Gln Arg Leu His Glu Ala Gly 
        195                 200                 205             


Leu Arg Gln Ala His Gly Phe Ser Leu Asn Ile Ser Asn Tyr Tyr Thr 
    210                 215                 220                 


Thr Asp Gln Asn Thr Ala Tyr Gly Asn Ala Val Asn Ser Glu Leu Ala 
225                 230                 235                 240 


Ala Arg Tyr Gly Tyr Thr Lys Pro Phe Val Val Asp Thr Ser Arg Asn 
                245                 250                 255     


Gly Lys Gly Ser Asn Gly Glu Trp Cys Asn Ala Ala Gly Arg Arg Ile 
            260                 265                 270         


Gly Thr Pro Thr Arg Leu Gly Gly Gly Ala Glu Met Leu Leu Trp Ile 
        275                 280                 285             


Lys Ala Pro Gly Glu Ser Asp Gly Asn Cys Gly Val Gly Thr Gly Ser 
    290                 295                 300                 


Thr Ala Gly Gln Phe Leu Pro Glu Ala Ala Tyr Lys Met Ile Tyr Gly 
305                 310                 315                 320 


Tyr 
    


<210> 271
<211> 1404
<212> DNA
<213> Bacteria

<400> 271
atgaccgcgc tcgacgcccg caccgacacc acgaccgtcc tccgtttccc cgcgggcttc     60

cgctggggca ccgccacggc cgcctaccag atcgaggggg cggcgacgga ggacggccgc    120

accccgtcca tctgggacac cttcagccgc acgcccggca aggtgcgcaa cggcgacacc    180

ggtgacatcg ccgccgacca ctaccaccgg gtcgacgagg acgtcgccct gatgcggcgg    240

ctcggcgtga ccgactaccg cttctcgatc gcctggcccc gggtgcagcc caccgggcgc    300

ggcccggccg tgcgcaaggg cctggacttc taccggcggc tggtcgaccg cctcctcgac    360

gccggcatcc gtcccgtggc gacgctctac cactgggacc tgccgcagga gttggaggac    420

gccgggggct ggccgcagcg ggagaccgcg taccgcttcg cggagtacgc gggcatcatg    480

gcggacgccc tcggcgaccg ggtggcgacc tggacgacgc tcaacgagcc ctggtgcgcg    540

gccttcctcg gctacggcaa cggcgtgcac gcgccgggcc gcaccagcgc cgtcgcctcg    600

ctgcgggcgg cccaccacct caacctcgcg cacggcctcg cggcgcgcac cctgcgcggg    660

cggctgcccg gcgctgcgga ggtgtcgctg accctcaacc tccatgcggt gcggccctgt    720

tcgcaggcgc cggaggatct ggacgcggcc cgccggatcg acgcggtcgg caaccggatc    780

ttcctcgacc ccgtcttcca cggccggctc ccggaagacc tcgtgcggga caccgccccg    840

gtcacggact ggtccttcgt ggccgacgga gacctggcgg cggcggccgc gccgatcgac    900

tcgctcggca tcaactacta ctccccgtcc gtcgtcggcg ccggcacgtc ggagtcgccc    960

tcgccgtggg cgggcgcgga gcggcacgtc cggttcgagc ccgcgccggg gccgcggacg   1020

gcgatggact ggccggtgga cgcggacggt ctgtacgagc tgctgacccg gctgcgggac   1080

gagctccctg acgtaccgct ggtgatcacc gagaacgggg cggcgtacga cgactacgcc   1140

gacccctccg ggaacgtgaa ggacccggag cgggtggcgt acctccacgc ccacctggcg   1200

gcggtgcacc gggcgctggc ggacggcgcc gacgtccgcg ggtacttcct ctggtcgctc   1260

ctggacaact tcgagtgggc gtacggctac agcaagcgct tcgggatcgt gcacgtggac   1320

ttcgcgacgc agcgccggac gctgaaggac agcgcccggt ggtacgcgga ggtcatcgcg   1380

cgcggcggtc tggaaggggc ctga                                          1404

<210> 272
<211> 467
<212> PRT
<213> Bacteria

<220> 
<221> DOMAIN
<222> (12)...(464)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (20)...(34)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (372)...(380)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 272
Met Thr Ala Leu Asp Ala Arg Thr Asp Thr Thr Thr Val Leu Arg Phe 
1               5                   10                  15      


Pro Ala Gly Phe Arg Trp Gly Thr Ala Thr Ala Ala Tyr Gln Ile Glu 
            20                  25                  30          


Gly Ala Ala Thr Glu Asp Gly Arg Thr Pro Ser Ile Trp Asp Thr Phe 
        35                  40                  45              


Ser Arg Thr Pro Gly Lys Val Arg Asn Gly Asp Thr Gly Asp Ile Ala 
    50                  55                  60                  


Ala Asp His Tyr His Arg Val Asp Glu Asp Val Ala Leu Met Arg Arg 
65                  70                  75                  80  


Leu Gly Val Thr Asp Tyr Arg Phe Ser Ile Ala Trp Pro Arg Val Gln 
                85                  90                  95      


Pro Thr Gly Arg Gly Pro Ala Val Arg Lys Gly Leu Asp Phe Tyr Arg 
            100                 105                 110         


Arg Leu Val Asp Arg Leu Leu Asp Ala Gly Ile Arg Pro Val Ala Thr 
        115                 120                 125             


Leu Tyr His Trp Asp Leu Pro Gln Glu Leu Glu Asp Ala Gly Gly Trp 
    130                 135                 140                 


Pro Gln Arg Glu Thr Ala Tyr Arg Phe Ala Glu Tyr Ala Gly Ile Met 
145                 150                 155                 160 


Ala Asp Ala Leu Gly Asp Arg Val Ala Thr Trp Thr Thr Leu Asn Glu 
                165                 170                 175     


Pro Trp Cys Ala Ala Phe Leu Gly Tyr Gly Asn Gly Val His Ala Pro 
            180                 185                 190         


Gly Arg Thr Ser Ala Val Ala Ser Leu Arg Ala Ala His His Leu Asn 
        195                 200                 205             


Leu Ala His Gly Leu Ala Ala Arg Thr Leu Arg Gly Arg Leu Pro Gly 
    210                 215                 220                 


Ala Ala Glu Val Ser Leu Thr Leu Asn Leu His Ala Val Arg Pro Cys 
225                 230                 235                 240 


Ser Gln Ala Pro Glu Asp Leu Asp Ala Ala Arg Arg Ile Asp Ala Val 
                245                 250                 255     


Gly Asn Arg Ile Phe Leu Asp Pro Val Phe His Gly Arg Leu Pro Glu 
            260                 265                 270         


Asp Leu Val Arg Asp Thr Ala Pro Val Thr Asp Trp Ser Phe Val Ala 
        275                 280                 285             


Asp Gly Asp Leu Ala Ala Ala Ala Ala Pro Ile Asp Ser Leu Gly Ile 
    290                 295                 300                 


Asn Tyr Tyr Ser Pro Ser Val Val Gly Ala Gly Thr Ser Glu Ser Pro 
305                 310                 315                 320 


Ser Pro Trp Ala Gly Ala Glu Arg His Val Arg Phe Glu Pro Ala Pro 
                325                 330                 335     


Gly Pro Arg Thr Ala Met Asp Trp Pro Val Asp Ala Asp Gly Leu Tyr 
            340                 345                 350         


Glu Leu Leu Thr Arg Leu Arg Asp Glu Leu Pro Asp Val Pro Leu Val 
        355                 360                 365             


Ile Thr Glu Asn Gly Ala Ala Tyr Asp Asp Tyr Ala Asp Pro Ser Gly 
    370                 375                 380                 


Asn Val Lys Asp Pro Glu Arg Val Ala Tyr Leu His Ala His Leu Ala 
385                 390                 395                 400 


Ala Val His Arg Ala Leu Ala Asp Gly Ala Asp Val Arg Gly Tyr Phe 
                405                 410                 415     


Leu Trp Ser Leu Leu Asp Asn Phe Glu Trp Ala Tyr Gly Tyr Ser Lys 
            420                 425                 430         


Arg Phe Gly Ile Val His Val Asp Phe Ala Thr Gln Arg Arg Thr Leu 
        435                 440                 445             


Lys Asp Ser Ala Arg Trp Tyr Ala Glu Val Ile Ala Arg Gly Gly Leu 
    450                 455                 460                 


Glu Gly Ala 
465         


<210> 273
<211> 1410
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 273
atggacgcgg gggatatgaa catgaagttt gaaaatcgaa tcggacgatt cacccgatgg     60

tgctcgctcg tggcgatcgt cggcgtcgcg ccggcgttcg ccgacgtggc gcccttgtcg    120

gtgagcggca accagatccg cgcgggcggg caacccgcga gcttcgcggg caacagtctc    180

ttctggagca acacgggctg gggcggcgag aagtactaca acgcgaacgt ggtccggtgg    240

ctcaagacgg actggaagtc gacgatcgtg cgcgcggcga tgggcgtcga cgaccagggc    300

gggttcctgc aggacccgac cggcaatcgc aatcgcgtga aggccgtggt cgacgcggcg    360

atcgcgaacg acctgtacgt gatcatcgac tggcactcgc accacgcgga gaactaccgc    420

agccagtcga tcgcgttctt ccaggagatg gcgcgcacgt acggtcatcg caatcacgtg    480

atctacgaga tctacaacga gccgctgcag gtgtcgtgga gcgggacgat caagccgtat    540

gcgcaggccg tgatcagcgc gatccgcgcg atcgatcccg acaacctgat cgtggtcggc    600

acgccgacgt ggtcgcagga cgtggacgtc gcggcggcgg atccgatcgc cggcacgaac    660

atcgcgtaca cgctgcactt ctacgccggc acgcacgggc agtacctgcg cgacaaggcg    720

cagactgcgc tgaatcgcgg cgtggcgctg ttcgtgacgg agtggggctc ggtgaacgcg    780

aacggcgatg gcgccgtcgc gaccgcggag accaacaact ggatgacgtt cctcaagtcg    840

cgcggcatca gtcacgcgaa ctgggcgacg aacgacaagg cggaaggcgc ttccgcgctc    900

gtgccgggcg cgagcacctc gggcaactgg acggcaaacc agctgacggc gtcgggcgcg    960

ctcgcgaagc agatcatctc ggggtggggc ggcacgacgc cgaatccgcc gggtaacgtc   1020

atcgcgacga tccaggccga ggcgttcagc cagatgagcg gcatccagac cgagaacacc   1080

acggactcgg gcggcggcac gaacgtcggg tggatagacg ccggcgactg gctctcgtac   1140

cagaactcgc ccgtgaccat cccggcgacg ggcacctacc gcatcgagta ccgggtcgcg   1200

agcctgaacg gcggcggcgg gctcaggctc gaggctgcgg gcggcagtcc ggtgtatggc   1260

caactcgcgg tcccgggcac ggccggctgg cagaactgga cgaccatctc gcacacggtc   1320

acgttgaacg ccggcacgct gcgcttcggc atcaacgcca tctccggggg ctggaacctc   1380

aactggttcc gcatcgtccg cgtgagttga                                    1410

<210> 274
<211> 469
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(34)

<220> 
<221> DOMAIN
<222> (43)...(298)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (345)...(467)
<223> Carbohydrate binding module (family 6)

<220> 
<221> SITE
<222> (162)...(171)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (313)...(316)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (364)...(367)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (438)...(441)
<223> N-glycosylation site. Prosite id = PS00001

<400> 274
Met Asp Ala Gly Asp Met Asn Met Lys Phe Glu Asn Arg Ile Gly Arg 
1               5                   10                  15      


Phe Thr Arg Trp Cys Ser Leu Val Ala Ile Val Gly Val Ala Pro Ala 
            20                  25                  30          


Phe Ala Asp Val Ala Pro Leu Ser Val Ser Gly Asn Gln Ile Arg Ala 
        35                  40                  45              


Gly Gly Gln Pro Ala Ser Phe Ala Gly Asn Ser Leu Phe Trp Ser Asn 
    50                  55                  60                  


Thr Gly Trp Gly Gly Glu Lys Tyr Tyr Asn Ala Asn Val Val Arg Trp 
65                  70                  75                  80  


Leu Lys Thr Asp Trp Lys Ser Thr Ile Val Arg Ala Ala Met Gly Val 
                85                  90                  95      


Asp Asp Gln Gly Gly Phe Leu Gln Asp Pro Thr Gly Asn Arg Asn Arg 
            100                 105                 110         


Val Lys Ala Val Val Asp Ala Ala Ile Ala Asn Asp Leu Tyr Val Ile 
        115                 120                 125             


Ile Asp Trp His Ser His His Ala Glu Asn Tyr Arg Ser Gln Ser Ile 
    130                 135                 140                 


Ala Phe Phe Gln Glu Met Ala Arg Thr Tyr Gly His Arg Asn His Val 
145                 150                 155                 160 


Ile Tyr Glu Ile Tyr Asn Glu Pro Leu Gln Val Ser Trp Ser Gly Thr 
                165                 170                 175     


Ile Lys Pro Tyr Ala Gln Ala Val Ile Ser Ala Ile Arg Ala Ile Asp 
            180                 185                 190         


Pro Asp Asn Leu Ile Val Val Gly Thr Pro Thr Trp Ser Gln Asp Val 
        195                 200                 205             


Asp Val Ala Ala Ala Asp Pro Ile Ala Gly Thr Asn Ile Ala Tyr Thr 
    210                 215                 220                 


Leu His Phe Tyr Ala Gly Thr His Gly Gln Tyr Leu Arg Asp Lys Ala 
225                 230                 235                 240 


Gln Thr Ala Leu Asn Arg Gly Val Ala Leu Phe Val Thr Glu Trp Gly 
                245                 250                 255     


Ser Val Asn Ala Asn Gly Asp Gly Ala Val Ala Thr Ala Glu Thr Asn 
            260                 265                 270         


Asn Trp Met Thr Phe Leu Lys Ser Arg Gly Ile Ser His Ala Asn Trp 
        275                 280                 285             


Ala Thr Asn Asp Lys Ala Glu Gly Ala Ser Ala Leu Val Pro Gly Ala 
    290                 295                 300                 


Ser Thr Ser Gly Asn Trp Thr Ala Asn Gln Leu Thr Ala Ser Gly Ala 
305                 310                 315                 320 


Leu Ala Lys Gln Ile Ile Ser Gly Trp Gly Gly Thr Thr Pro Asn Pro 
                325                 330                 335     


Pro Gly Asn Val Ile Ala Thr Ile Gln Ala Glu Ala Phe Ser Gln Met 
            340                 345                 350         


Ser Gly Ile Gln Thr Glu Asn Thr Thr Asp Ser Gly Gly Gly Thr Asn 
        355                 360                 365             


Val Gly Trp Ile Asp Ala Gly Asp Trp Leu Ser Tyr Gln Asn Ser Pro 
    370                 375                 380                 


Val Thr Ile Pro Ala Thr Gly Thr Tyr Arg Ile Glu Tyr Arg Val Ala 
385                 390                 395                 400 


Ser Leu Asn Gly Gly Gly Gly Leu Arg Leu Glu Ala Ala Gly Gly Ser 
                405                 410                 415     


Pro Val Tyr Gly Gln Leu Ala Val Pro Gly Thr Ala Gly Trp Gln Asn 
            420                 425                 430         


Trp Thr Thr Ile Ser His Thr Val Thr Leu Asn Ala Gly Thr Leu Arg 
        435                 440                 445             


Phe Gly Ile Asn Ala Ile Ser Gly Gly Trp Asn Leu Asn Trp Phe Arg 
    450                 455                 460                 


Ile Val Arg Val Ser 
465                 


<210> 275
<211> 876
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 275
atgttcggta acaataaaac tgtccggttg accgttgttt cagggctgac catgttggcc     60

gccggttgtg cgaccgcgcc atgcgagcag cctgtcgccg cggcaccggc gccgtccgcc    120

tcgaccaccg gtggcttacc cgcgctgctg tcggagggca ccgtgataaa acagtcggcg    180

gcggactggg ccaaatttac tatcgagaac tacgaagtca tcaacaacgt ctggaataag    240

aacgccgctt cgggtccgta caaccaggaa atattcataa aggaagacaa ggacgggaat    300

caggtattcg gctggagatg gcgcggcaag ggcggcaatg tgctcgcgta tccggaagtg    360

aacatcggcg ccaagccctg ggatccgccg ccgtcgctga agtccgattt cccgttcgcc    420

gtcggtgcca aggacatcgt cgtcgatttc gacgtcacgc tcaaggcgag cggccgctac    480

aacatggcgt tcgagctctg ggtcgtcaaa gcgctgccgc cgacgcaggc gacgatctcg    540

aaagaaatca tgatctggaa ccacaacagc gggatgacgc cacagggatc gtattccggc    600

acgatcgagg ttggcggcgt taaatacgac gcctacatcc gtgccgtcca tggcgacgaa    660

tcgggcgcga atgccaataa atggtcgtat atggcctttg tcgcgcagac gtccgtcttc    720

aagggcagct tgccgctcaa acccttcatc gacttcttgg tgcagaaggg cgccctgagc    780

agcaaggact acatcgccaa tctcgagtgg ggcaatgaag tcatcgaagg cgaaggcgtg    840

gcggaaatcc gccgcttcag ggtgaaggcg caatag                              876

<210> 276
<211> 291
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(34)

<220> 
<221> DOMAIN
<222> (133)...(291)
<223> Glycosyl hydrolase family 12

<220> 
<221> SITE
<222> (5)...(8)
<223> N-glycosylation site. Prosite id = PS00001

<400> 276
Met Phe Gly Asn Asn Lys Thr Val Arg Leu Thr Val Val Ser Gly Leu 
1               5                   10                  15      


Thr Met Leu Ala Ala Gly Cys Ala Thr Ala Pro Cys Glu Gln Pro Val 
            20                  25                  30          


Ala Ala Ala Pro Ala Pro Ser Ala Ser Thr Thr Gly Gly Leu Pro Ala 
        35                  40                  45              


Leu Leu Ser Glu Gly Thr Val Ile Lys Gln Ser Ala Ala Asp Trp Ala 
    50                  55                  60                  


Lys Phe Thr Ile Glu Asn Tyr Glu Val Ile Asn Asn Val Trp Asn Lys 
65                  70                  75                  80  


Asn Ala Ala Ser Gly Pro Tyr Asn Gln Glu Ile Phe Ile Lys Glu Asp 
                85                  90                  95      


Lys Asp Gly Asn Gln Val Phe Gly Trp Arg Trp Arg Gly Lys Gly Gly 
            100                 105                 110         


Asn Val Leu Ala Tyr Pro Glu Val Asn Ile Gly Ala Lys Pro Trp Asp 
        115                 120                 125             


Pro Pro Pro Ser Leu Lys Ser Asp Phe Pro Phe Ala Val Gly Ala Lys 
    130                 135                 140                 


Asp Ile Val Val Asp Phe Asp Val Thr Leu Lys Ala Ser Gly Arg Tyr 
145                 150                 155                 160 


Asn Met Ala Phe Glu Leu Trp Val Val Lys Ala Leu Pro Pro Thr Gln 
                165                 170                 175     


Ala Thr Ile Ser Lys Glu Ile Met Ile Trp Asn His Asn Ser Gly Met 
            180                 185                 190         


Thr Pro Gln Gly Ser Tyr Ser Gly Thr Ile Glu Val Gly Gly Val Lys 
        195                 200                 205             


Tyr Asp Ala Tyr Ile Arg Ala Val His Gly Asp Glu Ser Gly Ala Asn 
    210                 215                 220                 


Ala Asn Lys Trp Ser Tyr Met Ala Phe Val Ala Gln Thr Ser Val Phe 
225                 230                 235                 240 


Lys Gly Ser Leu Pro Leu Lys Pro Phe Ile Asp Phe Leu Val Gln Lys 
                245                 250                 255     


Gly Ala Leu Ser Ser Lys Asp Tyr Ile Ala Asn Leu Glu Trp Gly Asn 
            260                 265                 270         


Glu Val Ile Glu Gly Glu Gly Val Ala Glu Ile Arg Arg Phe Arg Val 
        275                 280                 285             


Lys Ala Gln 
    290     


<210> 277
<211> 1011
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 277
atgaaaacca aatcaattta ttctatcgca atcctgtcaa tcgcgctgtt cttttttaca     60

actgcgcaaa ccttttcaca aaccccggta gagttgcacg gacgactcag ggtatcggga    120

aaccagattg tgggtgaaca tggcaacccg gtacagctga tgggcatgag cctgttctgg    180

tctgtctggg gtgccgagaa gtattacaat gcagaagtgg taaactggct tgtaaaagac    240

tggaagattg acctgatccg tgctgccatt gccgtggaag ttaaccagga aggcgatgga    300

aacaaaggat ggctattcaa caaggaggga caatacaaac tagccgaaac cattatccag    360

gcagccattg acaatgggat ttatgtattg gtagattggc atacccatcg cacccatacc    420

gatgctggca tcgagttttt cggttacctg gcccaaaagt atggccggca ccccaacctg    480

atttgggaaa ccttcaacga gccggtaaac caaagctggg aagagatcgc tgagtttacc    540

aatgctgtga ccggtgccat tcgcccttac agcgataacc tgatcattgc cggtacacgc    600

cgatggagcc agctggtgaa cgaacctgcc gacaatccgc ttcccgacaa aaacactgct    660

tattccctgc acttctatgc cggaacccat ggccaggaat tgcgcgatat tggtgattat    720

gccctttcaa aaggtatcgc cttattcatt accgaatggg ggacctccca tgccgatggc    780

gggcgcgata tgattgtaca cgaagagaaa gcacaggaat ggatcgattg ggcaatggag    840

cgcaacctta gcatggccaa ctggtcattg tttgacaagg aagaggcttc agccgctctt    900

aatcccgatg ccccggtcaa cggaaactgg gatcccgaaa aacatctttc caaatcggga    960

aggtttgtaa gggatcaaat catacgcatt aatagtaaaa aacacaacta a            1011

<210> 278
<211> 336
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(26)

<220> 
<221> DOMAIN
<222> (41)...(298)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (162)...(171)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (172)...(175)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (286)...(289)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<400> 278
Met Lys Thr Lys Ser Ile Tyr Ser Ile Ala Ile Leu Ser Ile Ala Leu 
1               5                   10                  15      


Phe Phe Phe Thr Thr Ala Gln Thr Phe Ser Gln Thr Pro Val Glu Leu 
            20                  25                  30          


His Gly Arg Leu Arg Val Ser Gly Asn Gln Ile Val Gly Glu His Gly 
        35                  40                  45              


Asn Pro Val Gln Leu Met Gly Met Ser Leu Phe Trp Ser Val Trp Gly 
    50                  55                  60                  


Ala Glu Lys Tyr Tyr Asn Ala Glu Val Val Asn Trp Leu Val Lys Asp 
65                  70                  75                  80  


Trp Lys Ile Asp Leu Ile Arg Ala Ala Ile Ala Val Glu Val Asn Gln 
                85                  90                  95      


Glu Gly Asp Gly Asn Lys Gly Trp Leu Phe Asn Lys Glu Gly Gln Tyr 
            100                 105                 110         


Lys Leu Ala Glu Thr Ile Ile Gln Ala Ala Ile Asp Asn Gly Ile Tyr 
        115                 120                 125             


Val Leu Val Asp Trp His Thr His Arg Thr His Thr Asp Ala Gly Ile 
    130                 135                 140                 


Glu Phe Phe Gly Tyr Leu Ala Gln Lys Tyr Gly Arg His Pro Asn Leu 
145                 150                 155                 160 


Ile Trp Glu Thr Phe Asn Glu Pro Val Asn Gln Ser Trp Glu Glu Ile 
                165                 170                 175     


Ala Glu Phe Thr Asn Ala Val Thr Gly Ala Ile Arg Pro Tyr Ser Asp 
            180                 185                 190         


Asn Leu Ile Ile Ala Gly Thr Arg Arg Trp Ser Gln Leu Val Asn Glu 
        195                 200                 205             


Pro Ala Asp Asn Pro Leu Pro Asp Lys Asn Thr Ala Tyr Ser Leu His 
    210                 215                 220                 


Phe Tyr Ala Gly Thr His Gly Gln Glu Leu Arg Asp Ile Gly Asp Tyr 
225                 230                 235                 240 


Ala Leu Ser Lys Gly Ile Ala Leu Phe Ile Thr Glu Trp Gly Thr Ser 
                245                 250                 255     


His Ala Asp Gly Gly Arg Asp Met Ile Val His Glu Glu Lys Ala Gln 
            260                 265                 270         


Glu Trp Ile Asp Trp Ala Met Glu Arg Asn Leu Ser Met Ala Asn Trp 
        275                 280                 285             


Ser Leu Phe Asp Lys Glu Glu Ala Ser Ala Ala Leu Asn Pro Asp Ala 
    290                 295                 300                 


Pro Val Asn Gly Asn Trp Asp Pro Glu Lys His Leu Ser Lys Ser Gly 
305                 310                 315                 320 


Arg Phe Val Arg Asp Gln Ile Ile Arg Ile Asn Ser Lys Lys His Asn 
                325                 330                 335     


<210> 279
<211> 1992
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 279
atgaaaaaac tgattctaac actctttagc ttatgggcta tatccgccta tgcacaagac     60

gatatacggt taaaccaggt gggattctat cctgctgctg aaaaggtggc tgtaattctg    120

tccgatgagc aaatctcttt tcaattgctg gatgatgaga ccggtgatgt ggtttttacc    180

ggtgagtcat caaccccaaa cacatggccc tattcggaag agaccgtggt actggctgac    240

tttagtgagt ttacggaagc gggaatgtac agggtggctg tagagggaaa gtccgactca    300

tatcctttca ctattgctga taacgcactt ggtgaggtgg ccagagcatc ggtgcgatac    360

ttctacttta accgagcctc aacagcgctg gaggagcagt ttgcgtggcg atgggcacgg    420

ccgatgggcc atccggacga agagatcatt gtccacgcct ctgctgcaac cgatgagcgc    480

ccggaagggt acacctttgc agcgccaaaa gggtggtacg atgcgggcga ttttaacaag    540

tatgtagtga actcgggcat cagcacctac accctgatgg cggcctatga gcactatccg    600

gaattttata ctgcgctgga tttaaacata cctgagagtg agagcggaaa cccggatatt    660

ttggctgaaa tacgatggaa cctcgactgg atgatgcaga tgcaggaccc gaacgatggc    720

ggggtatacc ataaacttac gactctgaac ttttccgggc aggtgatgcc ccaccaggca    780

agggcaaacc gttatgttgt gatgaaatct accgcggccg cgctgaactt tgctgcggtg    840

atggcggtag cttcgagggt atacgaacca tttgatccag atttttctga gcaggcgatt    900

gaggctgcaa aatatgcgtg ggagtgggca aatgaaaacc ctgaggtcta ttatcagcag    960

ccatccggtg tgtttacagg ggagtatggc gacagtgatc tctcagatga gttcgactgg   1020

gctgcggccg aactctacat cacaacaggg aatgacagct attgggatgc attcaatgaa   1080

tcggcacaag tcggaattcc atcatggcag ttcgtgcggc cccttgcctg gatctcactg   1140

gcacaccatc tcgagaacct gaccgatgcc gctaaccaag agcttatcag tgcacgaatc   1200

atcaaccagg caaatacgct cagaaacgag tatgagtcat cagcttacgg tatttcgatg   1260

ggccaggagc cgtggcagtt tttgtggggc agcaacgcga tggcgctgaa ccactcggtt   1320

ctgctgatcc aggcttaccg gctcacttgg gatgaaacct atctcgatgc agcacaatcc   1380

aatctcgatt acattctcgg cagaaatgca accgggtact cttttgtaac cggccacggc   1440

agcagaacgc ctatgaatcc ccatcaccgt caatcggcag cggataataa caccgatccc   1500

gttcccggaa tggtcgtggg cggcccgcac gacggccagc aggataactg caactaccca   1560

tcggatctgc cggcaaaatc gtatctcgat gcgtggtgca gctactccac caacgaagtg   1620

gcaatcaact ggaacgcggc gcttgcatat gtatcgggag cggtagacta ttcccgttcc   1680

ggtatattgg aaacaagtag tgaaacggaa ccgtcagcgg aacagccggt taccgtagag   1740

ctgagtcaaa actaccccaa tccttttaat cctgtgacgg ttattggata tcagttaccg   1800

gtaagcagtg acgtacggct tgaggtattc gatatgctcg ggcgccaggt tgccacactt   1860

gtggatagcc gccagcaagc cggcacccat caggccgaat tcgacgcctc caacctgtcc   1920

agcggagtat atctctaccg tttacaggcc ggaaatgtag tacaaataag acagatggtc   1980

ttggtgaagt ag                                                       1992

<210> 280
<211> 663
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(18)

<220> 
<221> DOMAIN
<222> (14)...(101)
<223> N-terminal ig-like domain of cellulase

<220> 
<221> DOMAIN
<222> (109)...(555)
<223> Glycosyl hydrolase family 9

<220> 
<221> SITE
<222> (253)...(256)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (356)...(359)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (364)...(367)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (391)...(394)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (443)...(446)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (476)...(479)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (481)...(497)
<223> Glycosyl hydrolases family 9 active sites signature 1. Prosite id = PS00592

<220> 
<221> SITE
<222> (503)...(506)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (536)...(554)
<223> Glycosyl hydrolases family 9 active sites signature 2. Prosite id = PS00698

<220> 
<221> SITE
<222> (647)...(650)
<223> N-glycosylation site. Prosite id = PS00001

<400> 280
Met Lys Lys Leu Ile Leu Thr Leu Phe Ser Leu Trp Ala Ile Ser Ala 
1               5                   10                  15      


Tyr Ala Gln Asp Asp Ile Arg Leu Asn Gln Val Gly Phe Tyr Pro Ala 
            20                  25                  30          


Ala Glu Lys Val Ala Val Ile Leu Ser Asp Glu Gln Ile Ser Phe Gln 
        35                  40                  45              


Leu Leu Asp Asp Glu Thr Gly Asp Val Val Phe Thr Gly Glu Ser Ser 
    50                  55                  60                  


Thr Pro Asn Thr Trp Pro Tyr Ser Glu Glu Thr Val Val Leu Ala Asp 
65                  70                  75                  80  


Phe Ser Glu Phe Thr Glu Ala Gly Met Tyr Arg Val Ala Val Glu Gly 
                85                  90                  95      


Lys Ser Asp Ser Tyr Pro Phe Thr Ile Ala Asp Asn Ala Leu Gly Glu 
            100                 105                 110         


Val Ala Arg Ala Ser Val Arg Tyr Phe Tyr Phe Asn Arg Ala Ser Thr 
        115                 120                 125             


Ala Leu Glu Glu Gln Phe Ala Trp Arg Trp Ala Arg Pro Met Gly His 
    130                 135                 140                 


Pro Asp Glu Glu Ile Ile Val His Ala Ser Ala Ala Thr Asp Glu Arg 
145                 150                 155                 160 


Pro Glu Gly Tyr Thr Phe Ala Ala Pro Lys Gly Trp Tyr Asp Ala Gly 
                165                 170                 175     


Asp Phe Asn Lys Tyr Val Val Asn Ser Gly Ile Ser Thr Tyr Thr Leu 
            180                 185                 190         


Met Ala Ala Tyr Glu His Tyr Pro Glu Phe Tyr Thr Ala Leu Asp Leu 
        195                 200                 205             


Asn Ile Pro Glu Ser Glu Ser Gly Asn Pro Asp Ile Leu Ala Glu Ile 
    210                 215                 220                 


Arg Trp Asn Leu Asp Trp Met Met Gln Met Gln Asp Pro Asn Asp Gly 
225                 230                 235                 240 


Gly Val Tyr His Lys Leu Thr Thr Leu Asn Phe Ser Gly Gln Val Met 
                245                 250                 255     


Pro His Gln Ala Arg Ala Asn Arg Tyr Val Val Met Lys Ser Thr Ala 
            260                 265                 270         


Ala Ala Leu Asn Phe Ala Ala Val Met Ala Val Ala Ser Arg Val Tyr 
        275                 280                 285             


Glu Pro Phe Asp Pro Asp Phe Ser Glu Gln Ala Ile Glu Ala Ala Lys 
    290                 295                 300                 


Tyr Ala Trp Glu Trp Ala Asn Glu Asn Pro Glu Val Tyr Tyr Gln Gln 
305                 310                 315                 320 


Pro Ser Gly Val Phe Thr Gly Glu Tyr Gly Asp Ser Asp Leu Ser Asp 
                325                 330                 335     


Glu Phe Asp Trp Ala Ala Ala Glu Leu Tyr Ile Thr Thr Gly Asn Asp 
            340                 345                 350         


Ser Tyr Trp Asp Ala Phe Asn Glu Ser Ala Gln Val Gly Ile Pro Ser 
        355                 360                 365             


Trp Gln Phe Val Arg Pro Leu Ala Trp Ile Ser Leu Ala His His Leu 
    370                 375                 380                 


Glu Asn Leu Thr Asp Ala Ala Asn Gln Glu Leu Ile Ser Ala Arg Ile 
385                 390                 395                 400 


Ile Asn Gln Ala Asn Thr Leu Arg Asn Glu Tyr Glu Ser Ser Ala Tyr 
                405                 410                 415     


Gly Ile Ser Met Gly Gln Glu Pro Trp Gln Phe Leu Trp Gly Ser Asn 
            420                 425                 430         


Ala Met Ala Leu Asn His Ser Val Leu Leu Ile Gln Ala Tyr Arg Leu 
        435                 440                 445             


Thr Trp Asp Glu Thr Tyr Leu Asp Ala Ala Gln Ser Asn Leu Asp Tyr 
    450                 455                 460                 


Ile Leu Gly Arg Asn Ala Thr Gly Tyr Ser Phe Val Thr Gly His Gly 
465                 470                 475                 480 


Ser Arg Thr Pro Met Asn Pro His His Arg Gln Ser Ala Ala Asp Asn 
                485                 490                 495     


Asn Thr Asp Pro Val Pro Gly Met Val Val Gly Gly Pro His Asp Gly 
            500                 505                 510         


Gln Gln Asp Asn Cys Asn Tyr Pro Ser Asp Leu Pro Ala Lys Ser Tyr 
        515                 520                 525             


Leu Asp Ala Trp Cys Ser Tyr Ser Thr Asn Glu Val Ala Ile Asn Trp 
    530                 535                 540                 


Asn Ala Ala Leu Ala Tyr Val Ser Gly Ala Val Asp Tyr Ser Arg Ser 
545                 550                 555                 560 


Gly Ile Leu Glu Thr Ser Ser Glu Thr Glu Pro Ser Ala Glu Gln Pro 
                565                 570                 575     


Val Thr Val Glu Leu Ser Gln Asn Tyr Pro Asn Pro Phe Asn Pro Val 
            580                 585                 590         


Thr Val Ile Gly Tyr Gln Leu Pro Val Ser Ser Asp Val Arg Leu Glu 
        595                 600                 605             


Val Phe Asp Met Leu Gly Arg Gln Val Ala Thr Leu Val Asp Ser Arg 
    610                 615                 620                 


Gln Gln Ala Gly Thr His Gln Ala Glu Phe Asp Ala Ser Asn Leu Ser 
625                 630                 635                 640 


Ser Gly Val Tyr Leu Tyr Arg Leu Gln Ala Gly Asn Val Val Gln Ile 
                645                 650                 655     


Arg Gln Met Val Leu Val Lys 
            660             


<210> 281
<211> 1383
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 281
atgtttactc gaaattgttt gcacgctatt ttaattgttg gcctgctttc tgcctgtgga     60

ggccaggaca aaagtgatcg tcgtcaggga accagccgca tttcatcgct aactattggc    120

aagtttaacc caaatggtga caacccacag tggtcgactt tgaaactggc aatggaaaat    180

atcgatcagg gagctttgtt tcttgaaaga acctatagca tcagtgattt ttccagtgat    240

ttggcagagg atctgcttgt caaagttcct tttggtcgct atcgactgaa cttgttatat    300

tatagtcaac agggaaattt gctctatcag tcatgtccag aagagattca gcgggaacac    360

gtaatcgata ctgctcgcta tagggcggat atagccatct gccgcgaatc tagtgatgaa    420

ccggtaggtc aggtcccaat tgaaccggta agtgaggtta tcatcagtcc gcagcccgat    480

cgaacgggtg aaaacggcgg aaatgcacca ggccaggagg gtggaaattt ctggattgat    540

cccgattccc aggcgatgct tgatttcagg caaatgcaac agtccggaca cccagatgct    600

cgctatattg agtatattgc caagcagcca gcggctgtgt ggtatggcga gtggagcgga    660

aatatttctc aagccgttcg tcagcatatc gccggtgcga atgcgcataa tgcctacgca    720

ctcatgattg cttacaaaat tcctgagcgg gattgtggac agcattcatc aggtgggctg    780

agagcagacg cttatcgaaa ttggattcgg gattttgcaa acgctattgg ctcggcaaag    840

gcaattgttg tgctagaacc tgatgctctt actttgatgg agtgtctgga tagtgacggc    900

gtagcgctga gatacgagct actcaatttt gccctcgccc agtttaagtc aaagcccaat    960

acccgggtct acattgatgc aggacactcc gcatggctgt ccgcgcaaga actggcagat   1020

cggctaagac tggctggcat tagccgtgcc gatggttttt ctttgaacac atcgaattat   1080

caaacgaccg aatcgaatat tcgctacggc caggaagtcc gcaacttgct gggtggtggt   1140

atcaacttca tcgtcgatac cagtcgtaat ggcaacggac caactcctga cgccgaatgg   1200

tgtaatccgc gagggcgggc cttgggtcaa acaccaactt tcagtacagg cgttaccggt   1260

gttgacgcat atttgtggct aaaacgtccg ggcgagtctg atggctattg taatggtggt   1320

cctgcagccg gtgcctggtg gcgggaattg gctatcgaat atgcaaaaaa cgccggtatc   1380

taa                                                                 1383

<210> 282
<211> 460
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (179)...(449)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (224)...(227)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (361)...(364)
<223> N-glycosylation site. Prosite id = PS00001

<400> 282
Met Phe Thr Arg Asn Cys Leu His Ala Ile Leu Ile Val Gly Leu Leu 
1               5                   10                  15      


Ser Ala Cys Gly Gly Gln Asp Lys Ser Asp Arg Arg Gln Gly Thr Ser 
            20                  25                  30          


Arg Ile Ser Ser Leu Thr Ile Gly Lys Phe Asn Pro Asn Gly Asp Asn 
        35                  40                  45              


Pro Gln Trp Ser Thr Leu Lys Leu Ala Met Glu Asn Ile Asp Gln Gly 
    50                  55                  60                  


Ala Leu Phe Leu Glu Arg Thr Tyr Ser Ile Ser Asp Phe Ser Ser Asp 
65                  70                  75                  80  


Leu Ala Glu Asp Leu Leu Val Lys Val Pro Phe Gly Arg Tyr Arg Leu 
                85                  90                  95      


Asn Leu Leu Tyr Tyr Ser Gln Gln Gly Asn Leu Leu Tyr Gln Ser Cys 
            100                 105                 110         


Pro Glu Glu Ile Gln Arg Glu His Val Ile Asp Thr Ala Arg Tyr Arg 
        115                 120                 125             


Ala Asp Ile Ala Ile Cys Arg Glu Ser Ser Asp Glu Pro Val Gly Gln 
    130                 135                 140                 


Val Pro Ile Glu Pro Val Ser Glu Val Ile Ile Ser Pro Gln Pro Asp 
145                 150                 155                 160 


Arg Thr Gly Glu Asn Gly Gly Asn Ala Pro Gly Gln Glu Gly Gly Asn 
                165                 170                 175     


Phe Trp Ile Asp Pro Asp Ser Gln Ala Met Leu Asp Phe Arg Gln Met 
            180                 185                 190         


Gln Gln Ser Gly His Pro Asp Ala Arg Tyr Ile Glu Tyr Ile Ala Lys 
        195                 200                 205             


Gln Pro Ala Ala Val Trp Tyr Gly Glu Trp Ser Gly Asn Ile Ser Gln 
    210                 215                 220                 


Ala Val Arg Gln His Ile Ala Gly Ala Asn Ala His Asn Ala Tyr Ala 
225                 230                 235                 240 


Leu Met Ile Ala Tyr Lys Ile Pro Glu Arg Asp Cys Gly Gln His Ser 
                245                 250                 255     


Ser Gly Gly Leu Arg Ala Asp Ala Tyr Arg Asn Trp Ile Arg Asp Phe 
            260                 265                 270         


Ala Asn Ala Ile Gly Ser Ala Lys Ala Ile Val Val Leu Glu Pro Asp 
        275                 280                 285             


Ala Leu Thr Leu Met Glu Cys Leu Asp Ser Asp Gly Val Ala Leu Arg 
    290                 295                 300                 


Tyr Glu Leu Leu Asn Phe Ala Leu Ala Gln Phe Lys Ser Lys Pro Asn 
305                 310                 315                 320 


Thr Arg Val Tyr Ile Asp Ala Gly His Ser Ala Trp Leu Ser Ala Gln 
                325                 330                 335     


Glu Leu Ala Asp Arg Leu Arg Leu Ala Gly Ile Ser Arg Ala Asp Gly 
            340                 345                 350         


Phe Ser Leu Asn Thr Ser Asn Tyr Gln Thr Thr Glu Ser Asn Ile Arg 
        355                 360                 365             


Tyr Gly Gln Glu Val Arg Asn Leu Leu Gly Gly Gly Ile Asn Phe Ile 
    370                 375                 380                 


Val Asp Thr Ser Arg Asn Gly Asn Gly Pro Thr Pro Asp Ala Glu Trp 
385                 390                 395                 400 


Cys Asn Pro Arg Gly Arg Ala Leu Gly Gln Thr Pro Thr Phe Ser Thr 
                405                 410                 415     


Gly Val Thr Gly Val Asp Ala Tyr Leu Trp Leu Lys Arg Pro Gly Glu 
            420                 425                 430         


Ser Asp Gly Tyr Cys Asn Gly Gly Pro Ala Ala Gly Ala Trp Trp Arg 
        435                 440                 445             


Glu Leu Ala Ile Glu Tyr Ala Lys Asn Ala Gly Ile 
    450                 455                 460 


<210> 283
<211> 1188
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 283
atgcgtaaga tcgtaaaaca aataaattac cttaccccta gtgtactggg tcttctggtg     60

ctgagtctct tctttcaggt tcccactcaa gcttcacacc ccctgctgag cacgccaaat    120

tatctgcata cctccggcag tcaaatcctg gatgcatcaa ataaagttgt cgggttgagc    180

ggaatcaact ggtttggatt tgagacagcc aataacgttc cacatggttt atgggcacgt    240

ggttgggaag atgtgctcga tcagatcaag aaggaaggat acaacgtcat tcgacttcca    300

ttctcgaatg ccatgctcaa gaaagatgta atgccttcgg gtattgatta ccagaagaat    360

ccagacctcg aggggctgac gtccctgcag gtgatggaca agatcattcg aggagccaat    420

gatcggggac tcaagatcat cctggataat caccgttcca catcgggtgg cggaccggag    480

tcaaacggct tatggtacac aagcgaatat tcggaaaacg attggatcct cgattggaag    540

aaacttgtcc gccggtacaa atacatcccg gctgtaatag ccgttgatct tcgtaatgaa    600

ccgtataatg cctgctgggg ctgtggggat ccatccaagg attggagact ggcgtcggag    660

aaagccggca atgccgtgct ttcggtaaat cccaatctgt tagtgattgt cgagggtgtt    720

gctgtccata atggtcaaaa cacatggtgg ggcggtaatt tgctaggtgc gaaagaattt    780

ccagtccgcc tgaatgtgcc gcaccggttg gtgtattctg cacatgaata ccctgagacg    840

atctaccctc aaccgtggtt taccgattcg aattatccca ataaccttgc ggctgtttgg    900

gataaatact ggggttacct ggtcaaggaa aatattgctc ctgtcttgat cggcgaattc    960

ggcacacgac ttgaaacaga aaaagataag cagtggctat cccaattcca ggaatatgtt   1020

cggcaacaca aaatcagttg gacgtactgg tcgctcaatc caaactctgg tgacacaggc   1080

ggattgctgc aggacgattg ggtcaccatc caccagacga agcaaagcat cctcaggcaa   1140

attcaatatc ccttcattcc tcagttcgcc aacctccagg aaaaataa                1188

<210> 284
<211> 395
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(31)

<220> 
<221> DOMAIN
<222> (47)...(360)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 284
Met Arg Lys Ile Val Lys Gln Ile Asn Tyr Leu Thr Pro Ser Val Leu 
1               5                   10                  15      


Gly Leu Leu Val Leu Ser Leu Phe Phe Gln Val Pro Thr Gln Ala Ser 
            20                  25                  30          


His Pro Leu Leu Ser Thr Pro Asn Tyr Leu His Thr Ser Gly Ser Gln 
        35                  40                  45              


Ile Leu Asp Ala Ser Asn Lys Val Val Gly Leu Ser Gly Ile Asn Trp 
    50                  55                  60                  


Phe Gly Phe Glu Thr Ala Asn Asn Val Pro His Gly Leu Trp Ala Arg 
65                  70                  75                  80  


Gly Trp Glu Asp Val Leu Asp Gln Ile Lys Lys Glu Gly Tyr Asn Val 
                85                  90                  95      


Ile Arg Leu Pro Phe Ser Asn Ala Met Leu Lys Lys Asp Val Met Pro 
            100                 105                 110         


Ser Gly Ile Asp Tyr Gln Lys Asn Pro Asp Leu Glu Gly Leu Thr Ser 
        115                 120                 125             


Leu Gln Val Met Asp Lys Ile Ile Arg Gly Ala Asn Asp Arg Gly Leu 
    130                 135                 140                 


Lys Ile Ile Leu Asp Asn His Arg Ser Thr Ser Gly Gly Gly Pro Glu 
145                 150                 155                 160 


Ser Asn Gly Leu Trp Tyr Thr Ser Glu Tyr Ser Glu Asn Asp Trp Ile 
                165                 170                 175     


Leu Asp Trp Lys Lys Leu Val Arg Arg Tyr Lys Tyr Ile Pro Ala Val 
            180                 185                 190         


Ile Ala Val Asp Leu Arg Asn Glu Pro Tyr Asn Ala Cys Trp Gly Cys 
        195                 200                 205             


Gly Asp Pro Ser Lys Asp Trp Arg Leu Ala Ser Glu Lys Ala Gly Asn 
    210                 215                 220                 


Ala Val Leu Ser Val Asn Pro Asn Leu Leu Val Ile Val Glu Gly Val 
225                 230                 235                 240 


Ala Val His Asn Gly Gln Asn Thr Trp Trp Gly Gly Asn Leu Leu Gly 
                245                 250                 255     


Ala Lys Glu Phe Pro Val Arg Leu Asn Val Pro His Arg Leu Val Tyr 
            260                 265                 270         


Ser Ala His Glu Tyr Pro Glu Thr Ile Tyr Pro Gln Pro Trp Phe Thr 
        275                 280                 285             


Asp Ser Asn Tyr Pro Asn Asn Leu Ala Ala Val Trp Asp Lys Tyr Trp 
    290                 295                 300                 


Gly Tyr Leu Val Lys Glu Asn Ile Ala Pro Val Leu Ile Gly Glu Phe 
305                 310                 315                 320 


Gly Thr Arg Leu Glu Thr Glu Lys Asp Lys Gln Trp Leu Ser Gln Phe 
                325                 330                 335     


Gln Glu Tyr Val Arg Gln His Lys Ile Ser Trp Thr Tyr Trp Ser Leu 
            340                 345                 350         


Asn Pro Asn Ser Gly Asp Thr Gly Gly Leu Leu Gln Asp Asp Trp Val 
        355                 360                 365             


Thr Ile His Gln Thr Lys Gln Ser Ile Leu Arg Gln Ile Gln Tyr Pro 
    370                 375                 380                 


Phe Ile Pro Gln Phe Ala Asn Leu Gln Glu Lys 
385                 390                 395 


<210> 285
<211> 2550
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 285
atgaaaaagg taagtaatgc acgtgtattg agtttcctat taatacttgt tctcattttt     60

gggaatctag cttctgtttt tgcaatagaa aattcggatg atagtagttt actaggtaat    120

gataaggtac ttaatccttc ccaagctgga gccctacaaa ttatagatat agatggtcaa    180

aggaccttag gaactattga tggagatcct attcaattaa gaggaatgag tacccatggt    240

cttcaatggt ttggagagat tataaatgat aatgcctttg ctgcattagc taatgattgg    300

gaatctaaca tgatacgact agcgatgtat attggcgaaa atggttatgc taccaatccg    360

agtgtaaaag atttagtgta tgaggggata gaactcgcct ttgagcacga catgtatgtg    420

attgtagact ggcatgttca cgcaccaggg gatccgaggg cagatgtata ctccggagcg    480

tatgagttct ttgaagagtt ggcagatcat tataaggatc atccaaaaaa tcattatatt    540

atctgggagc ttgcaaacga accaagttct aataataacg gaggtccagg aatacctaat    600

aacgaagaag gctggcaagc cgtaaaagaa tatgctgaac ctattgtaga tatgcttcgt    660

gaaaagggag ataatatcat tcttatcggt agcccaaact ggagtcagcg accagattta    720

gctgcggata atccaatcga tgcggaaaat atcatgtact ctgttcattt ttatactgga    780

acgcataccc catcagagga tagttaccca ccaggcaccc ctaatacaga gcgatctaat    840

gtgatgagta acgcaagata tgcattagaa aatggtgttg ctttgttcgc aacagaatgg    900

ggtactagtg aagctagtgg ggataatgga ccgtttttag atgaagctga tgtttggctt    960

agttttctaa atgaaaacaa catcagttgg gcgaactggt ccttaacgaa caaaaatgaa   1020

acttcaggag cttttacacc atttatacta aatcaatccg atgcaacgaa gcttgatcca   1080

ggtgatgatc aagtttggtc tatggaagag ttaagtatct caggggagta tgttcgtgca   1140

agaattaaag gaattgaata tgatccaatt aatcgaacac caagtgaaga ttatacaaaa   1200

gtcatttggg attttgacga tgggacaacg caaggattcc gtgtgaacgg ggacagtcct   1260

attcaagata ttcaattaga taatgttgat aattcattag aaataagtgg tcttgatgct   1320

agtaacgaca catcagaagg aaattactgg gccaatgtac gtttgtcttc agatggatat   1380

aatcctgggg tagacatttt aggggcagag gagcttataa tggaagtaat cgtggaggaa   1440

ccaactacag tttctattgc agctattcca caaagttcga atcacggttg ggcaaatcct   1500

actcgggcag taaaagtgac acctgaggat tttgagttac aaggagatgg aaagttcgtt   1560

gcaccattat ccataacaac agatgatgct ccaaacttaa ataatatcgg aaacgataca   1620

gataatagta tgcttacaaa cttgatttta tttataggta cggaaaatgc ggatgttatt   1680

tcattggata acatctctgt atctggaaat agagaagtaa ttgttgatcc tatagaacat   1740

ggcccacttg gcgtggcaac tctaccatct gattttgagg atggcacaag acaaggatgg   1800

tcttggaata gtgaatccgg agtaagagaa gctctaacta ttgaagaagc aaatggttcc   1860

aacgcactct cctgggaata tgcttaccca gaagtaaagc ctgaggatgg gtgggcatct   1920

gcacctagat tagatctctg gatagaagaa ctagtcagag gagagaatga ttttgttgcg   1980

tttgattttt acctagaccc cgttcgtgca acggagggag caatctctat tcatacggta   2040

tttcaaccac cagcttcaag ttactgggca caagctcctt cgacctttaa catcgaactt   2100

gaagaattaa atgaagctaa agtgacaact gatggattat accactatga agttgcgata   2160

aatattcggg atataaataa tattgaggac gatacagagc tacggaacat gatgttcatt   2220

ttcgcagata tggacagcga ctttgccggt cgagttttta ttgataatat tcggtttgaa   2280

ttgacaacaa tcacggtaga cgacattttc gaaaaaattg atagttttgt tgagaatggg   2340

gatataagac accccgcagg agttcaatta acaaatacac tgagaacggc agaacgacac   2400

tataataatg gtaaaagtaa gcaatctcaa acacatctaa ataaatttca ctctattatt   2460

gagggtaata tgatgaagaa cgtttctaat gaagtgaaaa aagtattgcg ggatgatata   2520

gatcgtcttt ctagtctgtg gtttgattaa                                    2550

<210> 286
<211> 849
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(28)

<220> 
<221> DOMAIN
<222> (64)...(343)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (365)...(568)
<223> Carbohydrate binding domain (family 17/28)

<220> 
<221> DOMAIN
<222> (570)...(760)
<223> Carbohydrate binding domain (family 17/28)

<220> 
<221> SITE
<222> (182)...(191)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (236)...(239)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (332)...(335)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (337)...(340)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (344)...(347)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (356)...(359)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (439)...(446)
<223> Aldehyde dehydrogenases glutamic acid active site. Prosite id = PS00687

<220> 
<221> SITE
<222> (448)...(451)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (546)...(549)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (572)...(575)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (627)...(630)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (839)...(842)
<223> N-glycosylation site. Prosite id = PS00001

<400> 286
Met Lys Lys Val Ser Asn Ala Arg Val Leu Ser Phe Leu Leu Ile Leu 
1               5                   10                  15      


Val Leu Ile Phe Gly Asn Leu Ala Ser Val Phe Ala Ile Glu Asn Ser 
            20                  25                  30          


Asp Asp Ser Ser Leu Leu Gly Asn Asp Lys Val Leu Asn Pro Ser Gln 
        35                  40                  45              


Ala Gly Ala Leu Gln Ile Ile Asp Ile Asp Gly Gln Arg Thr Leu Gly 
    50                  55                  60                  


Thr Ile Asp Gly Asp Pro Ile Gln Leu Arg Gly Met Ser Thr His Gly 
65                  70                  75                  80  


Leu Gln Trp Phe Gly Glu Ile Ile Asn Asp Asn Ala Phe Ala Ala Leu 
                85                  90                  95      


Ala Asn Asp Trp Glu Ser Asn Met Ile Arg Leu Ala Met Tyr Ile Gly 
            100                 105                 110         


Glu Asn Gly Tyr Ala Thr Asn Pro Ser Val Lys Asp Leu Val Tyr Glu 
        115                 120                 125             


Gly Ile Glu Leu Ala Phe Glu His Asp Met Tyr Val Ile Val Asp Trp 
    130                 135                 140                 


His Val His Ala Pro Gly Asp Pro Arg Ala Asp Val Tyr Ser Gly Ala 
145                 150                 155                 160 


Tyr Glu Phe Phe Glu Glu Leu Ala Asp His Tyr Lys Asp His Pro Lys 
                165                 170                 175     


Asn His Tyr Ile Ile Trp Glu Leu Ala Asn Glu Pro Ser Ser Asn Asn 
            180                 185                 190         


Asn Gly Gly Pro Gly Ile Pro Asn Asn Glu Glu Gly Trp Gln Ala Val 
        195                 200                 205             


Lys Glu Tyr Ala Glu Pro Ile Val Asp Met Leu Arg Glu Lys Gly Asp 
    210                 215                 220                 


Asn Ile Ile Leu Ile Gly Ser Pro Asn Trp Ser Gln Arg Pro Asp Leu 
225                 230                 235                 240 


Ala Ala Asp Asn Pro Ile Asp Ala Glu Asn Ile Met Tyr Ser Val His 
                245                 250                 255     


Phe Tyr Thr Gly Thr His Thr Pro Ser Glu Asp Ser Tyr Pro Pro Gly 
            260                 265                 270         


Thr Pro Asn Thr Glu Arg Ser Asn Val Met Ser Asn Ala Arg Tyr Ala 
        275                 280                 285             


Leu Glu Asn Gly Val Ala Leu Phe Ala Thr Glu Trp Gly Thr Ser Glu 
    290                 295                 300                 


Ala Ser Gly Asp Asn Gly Pro Phe Leu Asp Glu Ala Asp Val Trp Leu 
305                 310                 315                 320 


Ser Phe Leu Asn Glu Asn Asn Ile Ser Trp Ala Asn Trp Ser Leu Thr 
                325                 330                 335     


Asn Lys Asn Glu Thr Ser Gly Ala Phe Thr Pro Phe Ile Leu Asn Gln 
            340                 345                 350         


Ser Asp Ala Thr Lys Leu Asp Pro Gly Asp Asp Gln Val Trp Ser Met 
        355                 360                 365             


Glu Glu Leu Ser Ile Ser Gly Glu Tyr Val Arg Ala Arg Ile Lys Gly 
    370                 375                 380                 


Ile Glu Tyr Asp Pro Ile Asn Arg Thr Pro Ser Glu Asp Tyr Thr Lys 
385                 390                 395                 400 


Val Ile Trp Asp Phe Asp Asp Gly Thr Thr Gln Gly Phe Arg Val Asn 
                405                 410                 415     


Gly Asp Ser Pro Ile Gln Asp Ile Gln Leu Asp Asn Val Asp Asn Ser 
            420                 425                 430         


Leu Glu Ile Ser Gly Leu Asp Ala Ser Asn Asp Thr Ser Glu Gly Asn 
        435                 440                 445             


Tyr Trp Ala Asn Val Arg Leu Ser Ser Asp Gly Tyr Asn Pro Gly Val 
    450                 455                 460                 


Asp Ile Leu Gly Ala Glu Glu Leu Ile Met Glu Val Ile Val Glu Glu 
465                 470                 475                 480 


Pro Thr Thr Val Ser Ile Ala Ala Ile Pro Gln Ser Ser Asn His Gly 
                485                 490                 495     


Trp Ala Asn Pro Thr Arg Ala Val Lys Val Thr Pro Glu Asp Phe Glu 
            500                 505                 510         


Leu Gln Gly Asp Gly Lys Phe Val Ala Pro Leu Ser Ile Thr Thr Asp 
        515                 520                 525             


Asp Ala Pro Asn Leu Asn Asn Ile Gly Asn Asp Thr Asp Asn Ser Met 
    530                 535                 540                 


Leu Thr Asn Leu Ile Leu Phe Ile Gly Thr Glu Asn Ala Asp Val Ile 
545                 550                 555                 560 


Ser Leu Asp Asn Ile Ser Val Ser Gly Asn Arg Glu Val Ile Val Asp 
                565                 570                 575     


Pro Ile Glu His Gly Pro Leu Gly Val Ala Thr Leu Pro Ser Asp Phe 
            580                 585                 590         


Glu Asp Gly Thr Arg Gln Gly Trp Ser Trp Asn Ser Glu Ser Gly Val 
        595                 600                 605             


Arg Glu Ala Leu Thr Ile Glu Glu Ala Asn Gly Ser Asn Ala Leu Ser 
    610                 615                 620                 


Trp Glu Tyr Ala Tyr Pro Glu Val Lys Pro Glu Asp Gly Trp Ala Ser 
625                 630                 635                 640 


Ala Pro Arg Leu Asp Leu Trp Ile Glu Glu Leu Val Arg Gly Glu Asn 
                645                 650                 655     


Asp Phe Val Ala Phe Asp Phe Tyr Leu Asp Pro Val Arg Ala Thr Glu 
            660                 665                 670         


Gly Ala Ile Ser Ile His Thr Val Phe Gln Pro Pro Ala Ser Ser Tyr 
        675                 680                 685             


Trp Ala Gln Ala Pro Ser Thr Phe Asn Ile Glu Leu Glu Glu Leu Asn 
    690                 695                 700                 


Glu Ala Lys Val Thr Thr Asp Gly Leu Tyr His Tyr Glu Val Ala Ile 
705                 710                 715                 720 


Asn Ile Arg Asp Ile Asn Asn Ile Glu Asp Asp Thr Glu Leu Arg Asn 
                725                 730                 735     


Met Met Phe Ile Phe Ala Asp Met Asp Ser Asp Phe Ala Gly Arg Val 
            740                 745                 750         


Phe Ile Asp Asn Ile Arg Phe Glu Leu Thr Thr Ile Thr Val Asp Asp 
        755                 760                 765             


Ile Phe Glu Lys Ile Asp Ser Phe Val Glu Asn Gly Asp Ile Arg His 
    770                 775                 780                 


Pro Ala Gly Val Gln Leu Thr Asn Thr Leu Arg Thr Ala Glu Arg His 
785                 790                 795                 800 


Tyr Asn Asn Gly Lys Ser Lys Gln Ser Gln Thr His Leu Asn Lys Phe 
                805                 810                 815     


His Ser Ile Ile Glu Gly Asn Met Met Lys Asn Val Ser Asn Glu Val 
            820                 825                 830         


Lys Lys Val Leu Arg Asp Asp Ile Asp Arg Leu Ser Ser Leu Trp Phe 
        835                 840                 845             


Asp 
    


<210> 287
<211> 1023
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 287
atgactcgaa actggcttgg caagatattg gcggcattgc ttttggcggg atgcgccata     60

cccgccccgg cgcaatcgcc gtcctccgat gagcccgcct tcgcggcgcc ggttccgccc    120

gctggcagcc cggtcgctcg ccacggcgcg ttgtcggtca gcggaaaccg gatcgtcgat    180

gcgcatggcc agcccgtcac cttgcggggc atgagcctgt tctggtcgca atgggcgccg    240

caatattaca ccggcgaaac ggtcgactgg ctggtcaagg actggaagat caccgccatc    300

cgcgcggcca tcgccgccga gaccaatgac agcgcgcgcc agcatttcga gcgtgaattc    360

gccaaggccg gccgggtgat cgaggcggcg gcgcgcaacg gcatctatgt gattgtcgat    420

tggcacgcgc accggcagta ccccgtcgag gcggagcagt ttcttaccgc catcgcccgg    480

cgatacggcc atctgcccaa tctgatctac gagccgttca acgagccgct gcgcgaaggc    540

gtcgattggt cgcgtgacgt gaagccctat caccagcggg tcgtcggggc gatccgggcg    600

atcgatcccg acaatctggt gatcgtcggc agtcccagct ggagccagga tgtcgatatc    660

gcggcgctgg acccgctcga tttccccaat gtcggctaca cgctgcatta ttacgccggc    720

acccaccgtc aggagctgcg cgacaagggc gatgctgcgc tggccgccgg gctggcgctg    780

atggtgaccg aattcgggat cgtcgatgcc accggcgacg gtccgatcga tctgccgtcg    840

agcgaactgt ggtgggactg ggccgaagcc aacggcgtct cgtggctcgc ctggtcgacc    900

ggcgaccgcg acgagaccag cgcgacgctg aaaccgggca ccgcgccatc gggctggagc    960

gaagacgatc tgacccaatc cgggaaaatc cttcgcgcgc ggctgcgggc ggcggcggaa   1020

tga                                                                 1023

<210> 288
<211> 340
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(24)

<220> 
<221> DOMAIN
<222> (56)...(308)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (110)...(113)
<223> N-glycosylation site. Prosite id = PS00001

<400> 288
Met Thr Arg Asn Trp Leu Gly Lys Ile Leu Ala Ala Leu Leu Leu Ala 
1               5                   10                  15      


Gly Cys Ala Ile Pro Ala Pro Ala Gln Ser Pro Ser Ser Asp Glu Pro 
            20                  25                  30          


Ala Phe Ala Ala Pro Val Pro Pro Ala Gly Ser Pro Val Ala Arg His 
        35                  40                  45              


Gly Ala Leu Ser Val Ser Gly Asn Arg Ile Val Asp Ala His Gly Gln 
    50                  55                  60                  


Pro Val Thr Leu Arg Gly Met Ser Leu Phe Trp Ser Gln Trp Ala Pro 
65                  70                  75                  80  


Gln Tyr Tyr Thr Gly Glu Thr Val Asp Trp Leu Val Lys Asp Trp Lys 
                85                  90                  95      


Ile Thr Ala Ile Arg Ala Ala Ile Ala Ala Glu Thr Asn Asp Ser Ala 
            100                 105                 110         


Arg Gln His Phe Glu Arg Glu Phe Ala Lys Ala Gly Arg Val Ile Glu 
        115                 120                 125             


Ala Ala Ala Arg Asn Gly Ile Tyr Val Ile Val Asp Trp His Ala His 
    130                 135                 140                 


Arg Gln Tyr Pro Val Glu Ala Glu Gln Phe Leu Thr Ala Ile Ala Arg 
145                 150                 155                 160 


Arg Tyr Gly His Leu Pro Asn Leu Ile Tyr Glu Pro Phe Asn Glu Pro 
                165                 170                 175     


Leu Arg Glu Gly Val Asp Trp Ser Arg Asp Val Lys Pro Tyr His Gln 
            180                 185                 190         


Arg Val Val Gly Ala Ile Arg Ala Ile Asp Pro Asp Asn Leu Val Ile 
        195                 200                 205             


Val Gly Ser Pro Ser Trp Ser Gln Asp Val Asp Ile Ala Ala Leu Asp 
    210                 215                 220                 


Pro Leu Asp Phe Pro Asn Val Gly Tyr Thr Leu His Tyr Tyr Ala Gly 
225                 230                 235                 240 


Thr His Arg Gln Glu Leu Arg Asp Lys Gly Asp Ala Ala Leu Ala Ala 
                245                 250                 255     


Gly Leu Ala Leu Met Val Thr Glu Phe Gly Ile Val Asp Ala Thr Gly 
            260                 265                 270         


Asp Gly Pro Ile Asp Leu Pro Ser Ser Glu Leu Trp Trp Asp Trp Ala 
        275                 280                 285             


Glu Ala Asn Gly Val Ser Trp Leu Ala Trp Ser Thr Gly Asp Arg Asp 
    290                 295                 300                 


Glu Thr Ser Ala Thr Leu Lys Pro Gly Thr Ala Pro Ser Gly Trp Ser 
305                 310                 315                 320 


Glu Asp Asp Leu Thr Gln Ser Gly Lys Ile Leu Arg Ala Arg Leu Arg 
                325                 330                 335     


Ala Ala Ala Glu 
            340 


<210> 289
<211> 1971
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 289
atgattggcg ttaatctcgc tggggcagaa ttcggcagtg tgggtcagcc gtttggagtc     60

ggctatattt acccgagcaa agcgaacatc gattactatg cgtcgcacgg catggagctg    120

atccgcctgc cgttccgctg ggagcggatc cagccgcagc aggacgggcc gctcgaccag    180

gccgaactcg cgcgcatccg tgaagtcgtg gactacgcgg cctcgaaggg catgacggtc    240

gccctcgacg tgcacaatta cgggatgtcc tacggcggca agctgatcgg cagcgccgag    300

cttcccaact cttccttcgc gaacctctgg tccaagctcg ccacggagtt ccgcgacgac    360

ggcaacgtgt ggttcaacat catgaacgaa ccgcacaagc agtcggcgac gcagtggatc    420

gaatcggcca acgccgcgat cgccgcgatc cgcgacgcgg gtgcggacca gaagattctc    480

gtccccggca cctactggtc gggcgcgcat tcatggatga cgagcgacaa ccacagcgtc    540

gtcggcaacg gcgtcaagga cccgctgaac aacttcgcgt tcgacgtgca ccagtacttc    600

gacaacgaca gcagcggcac cagcacgcag gtcgtctccg agaccatcgg cgtcgagcgc    660

ctgagggcga tcaccgagtg ggcgaaggcg aacgggcacg agctcttcct cggcgagttc    720

ggcgtgggga ccgacgccag gagcctcgcc gccctcgaca acatgatgaa gtacatgtcg    780

gagcatcccg acgtgtggat cggcgccacc tactgggccg gcggcccctg gtggggcgac    840

tactacttct ccctcgagcc gaagaacggc gtcgacaagc cccagctcgc gatcctcatg    900

aagtacgaga tcggcccgaa cggggaagag gtggccacgc cggtggtcgt cgagcccgag    960

acggcggaaa gcgacgagcc ggtgcaggag gaggccgagc aggccgtccc gccggtgacc   1020

gccgaggacg gagacgcgcc gtccgaggac gaggccgccg ctccccccgc cgaggaggac   1080

gcgcctacgg atgaggagga cgacgcggag gacgacggcg gcgacgagca gcccgtctcg   1140

cgccttccgg aagcctccga ggatccggtc gacgaaacga cgcccgacga agagcccgtc   1200

gccgcggctc ccgaggtcga ggacgaagtc gaggacgaag ccgaaatcga gccggagacg   1260

ccggtcgccg tcgaggaacc ggttgtcgac gagccggagg tgacccaagg tcccgtcgcc   1320

gctcagccct ccacggtgcg catcgaccac gccgacggcg agggcgacca ttatcgcgaa   1380

ggcgagcgca tgggctacaa gatcaccgtc gaaaacgccc aggccggcca gaccttccat   1440

atctggatgg ccaatacggc gggcgccgac gatttctccg ggggcttcct cgaggatctg   1500

caggcggctg cgcccgaggg cgtgggcgtg cggatgatca gccccaccaa ggccgagatc   1560

acgctcaagg aaggcagcta cgacttcacc ttcttccgcg acaccatgat caatgcgggg   1620

gcggagaagg cggaccaggc tccgtggaac gccgagagct tccagcagat cgacttctgg   1680

atcagcgact tcaccggggg cctgaccccg accgcccggg tttcgtcgag ctggattgaa   1740

gacacgcggc ccgatgaagc gccgatcgtg tcggctccgg aggcggtgaa cacggaaact   1800

cccgaggtcg aggccgctcc cgcagaagcg gcgcagcccg ctcccgtcgc ggaggcgccc   1860

gctgccacgg tcgattggca gaagcttctg gccgatctcc tcgccaatgc cggcgagggc   1920

gccccggagg gcggcttcag cgagacgaag atcttgttca gtccgctttg a            1971

<210> 290
<211> 656
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (3)...(282)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (104)...(107)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (179)...(182)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (205)...(208)
<223> N-glycosylation site. Prosite id = PS00001

<400> 290
Met Ile Gly Val Asn Leu Ala Gly Ala Glu Phe Gly Ser Val Gly Gln 
1               5                   10                  15      


Pro Phe Gly Val Gly Tyr Ile Tyr Pro Ser Lys Ala Asn Ile Asp Tyr 
            20                  25                  30          


Tyr Ala Ser His Gly Met Glu Leu Ile Arg Leu Pro Phe Arg Trp Glu 
        35                  40                  45              


Arg Ile Gln Pro Gln Gln Asp Gly Pro Leu Asp Gln Ala Glu Leu Ala 
    50                  55                  60                  


Arg Ile Arg Glu Val Val Asp Tyr Ala Ala Ser Lys Gly Met Thr Val 
65                  70                  75                  80  


Ala Leu Asp Val His Asn Tyr Gly Met Ser Tyr Gly Gly Lys Leu Ile 
                85                  90                  95      


Gly Ser Ala Glu Leu Pro Asn Ser Ser Phe Ala Asn Leu Trp Ser Lys 
            100                 105                 110         


Leu Ala Thr Glu Phe Arg Asp Asp Gly Asn Val Trp Phe Asn Ile Met 
        115                 120                 125             


Asn Glu Pro His Lys Gln Ser Ala Thr Gln Trp Ile Glu Ser Ala Asn 
    130                 135                 140                 


Ala Ala Ile Ala Ala Ile Arg Asp Ala Gly Ala Asp Gln Lys Ile Leu 
145                 150                 155                 160 


Val Pro Gly Thr Tyr Trp Ser Gly Ala His Ser Trp Met Thr Ser Asp 
                165                 170                 175     


Asn His Ser Val Val Gly Asn Gly Val Lys Asp Pro Leu Asn Asn Phe 
            180                 185                 190         


Ala Phe Asp Val His Gln Tyr Phe Asp Asn Asp Ser Ser Gly Thr Ser 
        195                 200                 205             


Thr Gln Val Val Ser Glu Thr Ile Gly Val Glu Arg Leu Arg Ala Ile 
    210                 215                 220                 


Thr Glu Trp Ala Lys Ala Asn Gly His Glu Leu Phe Leu Gly Glu Phe 
225                 230                 235                 240 


Gly Val Gly Thr Asp Ala Arg Ser Leu Ala Ala Leu Asp Asn Met Met 
                245                 250                 255     


Lys Tyr Met Ser Glu His Pro Asp Val Trp Ile Gly Ala Thr Tyr Trp 
            260                 265                 270         


Ala Gly Gly Pro Trp Trp Gly Asp Tyr Tyr Phe Ser Leu Glu Pro Lys 
        275                 280                 285             


Asn Gly Val Asp Lys Pro Gln Leu Ala Ile Leu Met Lys Tyr Glu Ile 
    290                 295                 300                 


Gly Pro Asn Gly Glu Glu Val Ala Thr Pro Val Val Val Glu Pro Glu 
305                 310                 315                 320 


Thr Ala Glu Ser Asp Glu Pro Val Gln Glu Glu Ala Glu Gln Ala Val 
                325                 330                 335     


Pro Pro Val Thr Ala Glu Asp Gly Asp Ala Pro Ser Glu Asp Glu Ala 
            340                 345                 350         


Ala Ala Pro Pro Ala Glu Glu Asp Ala Pro Thr Asp Glu Glu Asp Asp 
        355                 360                 365             


Ala Glu Asp Asp Gly Gly Asp Glu Gln Pro Val Ser Arg Leu Pro Glu 
    370                 375                 380                 


Ala Ser Glu Asp Pro Val Asp Glu Thr Thr Pro Asp Glu Glu Pro Val 
385                 390                 395                 400 


Ala Ala Ala Pro Glu Val Glu Asp Glu Val Glu Asp Glu Ala Glu Ile 
                405                 410                 415     


Glu Pro Glu Thr Pro Val Ala Val Glu Glu Pro Val Val Asp Glu Pro 
            420                 425                 430         


Glu Val Thr Gln Gly Pro Val Ala Ala Gln Pro Ser Thr Val Arg Ile 
        435                 440                 445             


Asp His Ala Asp Gly Glu Gly Asp His Tyr Arg Glu Gly Glu Arg Met 
    450                 455                 460                 


Gly Tyr Lys Ile Thr Val Glu Asn Ala Gln Ala Gly Gln Thr Phe His 
465                 470                 475                 480 


Ile Trp Met Ala Asn Thr Ala Gly Ala Asp Asp Phe Ser Gly Gly Phe 
                485                 490                 495     


Leu Glu Asp Leu Gln Ala Ala Ala Pro Glu Gly Val Gly Val Arg Met 
            500                 505                 510         


Ile Ser Pro Thr Lys Ala Glu Ile Thr Leu Lys Glu Gly Ser Tyr Asp 
        515                 520                 525             


Phe Thr Phe Phe Arg Asp Thr Met Ile Asn Ala Gly Ala Glu Lys Ala 
    530                 535                 540                 


Asp Gln Ala Pro Trp Asn Ala Glu Ser Phe Gln Gln Ile Asp Phe Trp 
545                 550                 555                 560 


Ile Ser Asp Phe Thr Gly Gly Leu Thr Pro Thr Ala Arg Val Ser Ser 
                565                 570                 575     


Ser Trp Ile Glu Asp Thr Arg Pro Asp Glu Ala Pro Ile Val Ser Ala 
            580                 585                 590         


Pro Glu Ala Val Asn Thr Glu Thr Pro Glu Val Glu Ala Ala Pro Ala 
        595                 600                 605             


Glu Ala Ala Gln Pro Ala Pro Val Ala Glu Ala Pro Ala Ala Thr Val 
    610                 615                 620                 


Asp Trp Gln Lys Leu Leu Ala Asp Leu Leu Ala Asn Ala Gly Glu Gly 
625                 630                 635                 640 


Ala Pro Glu Gly Gly Phe Ser Glu Thr Lys Ile Leu Phe Ser Pro Leu 
                645                 650                 655     


<210> 291
<211> 1893
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 291
atgcgcagat ttcgagttgt attccttggt ctttttgttt tttttggtat tgttatcgct     60

agccagtatg ggcagacagc agctgcatac gttccaactg tctgtgatgc agacttcaat    120

cacgacacca cagtcaacgt tttggacttt gcggtcatgg caaacaactt cttcaagtcg    180

cctctcatca atccagaagc tgacttaaac cgagatggtt acgtcaacat ttttgacttt    240

cactacatgc agatgttcct gtttgagaac tgtgttcctt ccagctctcc atcacctcca    300

gtcagtccgt caccagctgt tagtcccagt ccaagcgtct caccatctcc cttacctcca    360

actgacgtaa ccatgtccat cagcacttca gtcagtagtc ctatctcacc ctatatttat    420

ggaacgaact ccaagaacta cgctcgttca aagctgatca gggctggtgg taatcgttgg    480

acagcatata actgggaaaa taacgcctcc aacgcgggat ctgactggaa ccattcaagt    540

gataactact tgtgtgacgc cagcggtggt tatgtctgta caaactctac aaatcctgga    600

gaagctgtac gagttcgtat tgcagacagc cgaagtaaga atgcagcctc acttattact    660

gtccctatcg tcgactacgt ggctgcagac aaaaatggaa atgtctggac agctgccagt    720

cctactaatg aacggtgggc ccaaaatgct ccgaccaagc cagctgcaga agcttcagtt    780

atgggtgatc ggaaggtcta tcagaatgaa tttgtcaatt ttattgagac tacatttagc    840

gatgctcata atggcacagg acctgaaatc ttttatagct tagacaacga gccagcactc    900

tggccatcaa cacatccata cgtgcatggt gccaagccga cctacgcgga gatgacacaa    960

cgctccacta atacagccaa gatgataaaa gaccgagtcc ctaatgctaa agtttttggt   1020

aatgttgcat acgggtgggc tgagtacaag aatctgcagg atgctccaga ttccagtaca   1080

cgtgcccctg gccaagtagg taacacatat cttgattatt atctcgcttc gatgaagaat   1140

cagtcagata cagctggaaa acgactagtc gatgtccttg acctccactg gtacccagag   1200

gcacaggatc cagtcagcaa ctgccggatt actgactgtg ctagcgacac cgaaagtaag   1260

attcaagctc gcgtgcaggc gccacgctcg ctgtgggata cgacttatgt tgagaagagt   1320

tggatcactc aatgggacac gaacaatggc ccaattaagc tcattccaga cataaagacg   1380

aggattgcca acaactaccc tggtacctta ctcgccttca gtgagtacta ctacggtggt   1440

ggtaatcaca tttctggtgg aatcgctcaa gctgatgtgt taggtatttt cggtcgtgag   1500

ggagtctatg cagccaatct atgggagctg catggtgaca accagtttat ttggggggca   1560

ttcgatatgt acctcaacta taacggtagt ggcgccagtg ttggtgatcg gagcgtaaca   1620

gccagtacca gtgacgtagc aaagtcttct gtctatgcga tgacgaaaca gggtgataac   1680

tctcgagtat atgtgattgc gatcaataaa acagctagtc cagtttctac gcttgtccag   1740

atagatcatt ctagagcctt gacgactgct gaagtttatc agctgacgtc gacatcatcg   1800

acaccacaaa cgagaccaac aatcagcgtc tcaggcaacc gttttgtcta tgccatgccg   1860

gcatacagcg tgaccaccct ggtagtaagg tag                                1893

<210> 292
<211> 630
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SITE
<222> (69)...(81)
<223> EF-hand calcium-binding domain. Prosite id = PS00018

<220> 
<221> SITE
<222> (170)...(173)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (179)...(182)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (197)...(200)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (288)...(291)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (385)...(388)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (536)...(539)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (577)...(580)
<223> N-glycosylation site. Prosite id = PS00001

<400> 292
Met Arg Arg Phe Arg Val Val Phe Leu Gly Leu Phe Val Phe Phe Gly 
1               5                   10                  15      


Ile Val Ile Ala Ser Gln Tyr Gly Gln Thr Ala Ala Ala Tyr Val Pro 
            20                  25                  30          


Thr Val Cys Asp Ala Asp Phe Asn His Asp Thr Thr Val Asn Val Leu 
        35                  40                  45              


Asp Phe Ala Val Met Ala Asn Asn Phe Phe Lys Ser Pro Leu Ile Asn 
    50                  55                  60                  


Pro Glu Ala Asp Leu Asn Arg Asp Gly Tyr Val Asn Ile Phe Asp Phe 
65                  70                  75                  80  


His Tyr Met Gln Met Phe Leu Phe Glu Asn Cys Val Pro Ser Ser Ser 
                85                  90                  95      


Pro Ser Pro Pro Val Ser Pro Ser Pro Ala Val Ser Pro Ser Pro Ser 
            100                 105                 110         


Val Ser Pro Ser Pro Leu Pro Pro Thr Asp Val Thr Met Ser Ile Ser 
        115                 120                 125             


Thr Ser Val Ser Ser Pro Ile Ser Pro Tyr Ile Tyr Gly Thr Asn Ser 
    130                 135                 140                 


Lys Asn Tyr Ala Arg Ser Lys Leu Ile Arg Ala Gly Gly Asn Arg Trp 
145                 150                 155                 160 


Thr Ala Tyr Asn Trp Glu Asn Asn Ala Ser Asn Ala Gly Ser Asp Trp 
                165                 170                 175     


Asn His Ser Ser Asp Asn Tyr Leu Cys Asp Ala Ser Gly Gly Tyr Val 
            180                 185                 190         


Cys Thr Asn Ser Thr Asn Pro Gly Glu Ala Val Arg Val Arg Ile Ala 
        195                 200                 205             


Asp Ser Arg Ser Lys Asn Ala Ala Ser Leu Ile Thr Val Pro Ile Val 
    210                 215                 220                 


Asp Tyr Val Ala Ala Asp Lys Asn Gly Asn Val Trp Thr Ala Ala Ser 
225                 230                 235                 240 


Pro Thr Asn Glu Arg Trp Ala Gln Asn Ala Pro Thr Lys Pro Ala Ala 
                245                 250                 255     


Glu Ala Ser Val Met Gly Asp Arg Lys Val Tyr Gln Asn Glu Phe Val 
            260                 265                 270         


Asn Phe Ile Glu Thr Thr Phe Ser Asp Ala His Asn Gly Thr Gly Pro 
        275                 280                 285             


Glu Ile Phe Tyr Ser Leu Asp Asn Glu Pro Ala Leu Trp Pro Ser Thr 
    290                 295                 300                 


His Pro Tyr Val His Gly Ala Lys Pro Thr Tyr Ala Glu Met Thr Gln 
305                 310                 315                 320 


Arg Ser Thr Asn Thr Ala Lys Met Ile Lys Asp Arg Val Pro Asn Ala 
                325                 330                 335     


Lys Val Phe Gly Asn Val Ala Tyr Gly Trp Ala Glu Tyr Lys Asn Leu 
            340                 345                 350         


Gln Asp Ala Pro Asp Ser Ser Thr Arg Ala Pro Gly Gln Val Gly Asn 
        355                 360                 365             


Thr Tyr Leu Asp Tyr Tyr Leu Ala Ser Met Lys Asn Gln Ser Asp Thr 
    370                 375                 380                 


Ala Gly Lys Arg Leu Val Asp Val Leu Asp Leu His Trp Tyr Pro Glu 
385                 390                 395                 400 


Ala Gln Asp Pro Val Ser Asn Cys Arg Ile Thr Asp Cys Ala Ser Asp 
                405                 410                 415     


Thr Glu Ser Lys Ile Gln Ala Arg Val Gln Ala Pro Arg Ser Leu Trp 
            420                 425                 430         


Asp Thr Thr Tyr Val Glu Lys Ser Trp Ile Thr Gln Trp Asp Thr Asn 
        435                 440                 445             


Asn Gly Pro Ile Lys Leu Ile Pro Asp Ile Lys Thr Arg Ile Ala Asn 
    450                 455                 460                 


Asn Tyr Pro Gly Thr Leu Leu Ala Phe Ser Glu Tyr Tyr Tyr Gly Gly 
465                 470                 475                 480 


Gly Asn His Ile Ser Gly Gly Ile Ala Gln Ala Asp Val Leu Gly Ile 
                485                 490                 495     


Phe Gly Arg Glu Gly Val Tyr Ala Ala Asn Leu Trp Glu Leu His Gly 
            500                 505                 510         


Asp Asn Gln Phe Ile Trp Gly Ala Phe Asp Met Tyr Leu Asn Tyr Asn 
        515                 520                 525             


Gly Ser Gly Ala Ser Val Gly Asp Arg Ser Val Thr Ala Ser Thr Ser 
    530                 535                 540                 


Asp Val Ala Lys Ser Ser Val Tyr Ala Met Thr Lys Gln Gly Asp Asn 
545                 550                 555                 560 


Ser Arg Val Tyr Val Ile Ala Ile Asn Lys Thr Ala Ser Pro Val Ser 
                565                 570                 575     


Thr Leu Val Gln Ile Asp His Ser Arg Ala Leu Thr Thr Ala Glu Val 
            580                 585                 590         


Tyr Gln Leu Thr Ser Thr Ser Ser Thr Pro Gln Thr Arg Pro Thr Ile 
        595                 600                 605             


Ser Val Ser Gly Asn Arg Phe Val Tyr Ala Met Pro Ala Tyr Ser Val 
    610                 615                 620                 


Thr Thr Leu Val Val Arg 
625                 630 


<210> 293
<211> 1125
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 293
atgaaaaaga ttattttaaa atcaggtata ctcttgttgg tagttatttt gattgtttct     60

attttgcaaa ttttacctgt gtttgcccag agcacaccat atgaaaaaga aaaatatcca    120

caccttcttg gcaatcaagc ggtcaaaaaa ccctcggttg ctggcagact gcaaattatt    180

gaaaagaacg gtaaaaagta tttagctgat caaaaaggtg aaataatcca gcttcgtggc    240

atgagtacac atgggcttca gtggtatggt gatattataa acaaaaatgc gtttgaagct    300

ctttcaaaag attgggagtg taatgttgta aggcttgcga tgtatgtggg tgaaggcggg    360

tatgcttcaa atccaagcat caaagaaaaa gttatagaag ggattaagct tgctattgag    420

aatgacatgt atgtaattgt tgactggcat gtattaaatc ccggcgaccc gaatgctgaa    480

atttataaag gggcaaaaga ctttttcaaa gagatagcta caagttttcc caatgactat    540

cacataatat atgaactttg caatgaacca aatccaaatg aaccgggagt agaaaatagc    600

ttggatggtt ggaaaaaggt aaaggcttat gctgaaccca tcataaagat gcttagaagt    660

ttggggaatc agaacattat aattgtaggt tcgccaaact ggagccagag acctgacttt    720

gcaattcaag accctataaa tgataaaaat gttatgtatt cagttcattt ttattctggg    780

actcacaaag ttaatggata tgtttttgaa aatatgaaga atgcatttga aaatggtgtg    840

cccatttttg taactgaatg gggaacaagt ttagcaagcg gtgacggtgg accatatctt    900

gatgaggcgg acaaatggct tgaatatcta aatagcaact atattagctg ggtgaactgg    960

tcactgtcaa acaaaaatga aacatcagct gcttttgttc catatgttag tggcatgcat   1020

gatgcaacat cacttgaccc gggcgatgat aaggtgtggg atataaaaga gctgagtata   1080

tctggagagt atgtgagggc aaggataaaa ggaattgcat ataag                   1125

<210> 294
<211> 375
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (66)...(330)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (184)...(193)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (236)...(239)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (323)...(326)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (331)...(334)
<223> N-glycosylation site. Prosite id = PS00001

<400> 294
Met Lys Lys Ile Ile Leu Lys Ser Gly Ile Leu Leu Leu Val Val Ile 
1               5                   10                  15      


Leu Ile Val Ser Ile Leu Gln Ile Leu Pro Val Phe Ala Gln Ser Thr 
            20                  25                  30          


Pro Tyr Glu Lys Glu Lys Tyr Pro His Leu Leu Gly Asn Gln Ala Val 
        35                  40                  45              


Lys Lys Pro Ser Val Ala Gly Arg Leu Gln Ile Ile Glu Lys Asn Gly 
    50                  55                  60                  


Lys Lys Tyr Leu Ala Asp Gln Lys Gly Glu Ile Ile Gln Leu Arg Gly 
65                  70                  75                  80  


Met Ser Thr His Gly Leu Gln Trp Tyr Gly Asp Ile Ile Asn Lys Asn 
                85                  90                  95      


Ala Phe Glu Ala Leu Ser Lys Asp Trp Glu Cys Asn Val Val Arg Leu 
            100                 105                 110         


Ala Met Tyr Val Gly Glu Gly Gly Tyr Ala Ser Asn Pro Ser Ile Lys 
        115                 120                 125             


Glu Lys Val Ile Glu Gly Ile Lys Leu Ala Ile Glu Asn Asp Met Tyr 
    130                 135                 140                 


Val Ile Val Asp Trp His Val Leu Asn Pro Gly Asp Pro Asn Ala Glu 
145                 150                 155                 160 


Ile Tyr Lys Gly Ala Lys Asp Phe Phe Lys Glu Ile Ala Thr Ser Phe 
                165                 170                 175     


Pro Asn Asp Tyr His Ile Ile Tyr Glu Leu Cys Asn Glu Pro Asn Pro 
            180                 185                 190         


Asn Glu Pro Gly Val Glu Asn Ser Leu Asp Gly Trp Lys Lys Val Lys 
        195                 200                 205             


Ala Tyr Ala Glu Pro Ile Ile Lys Met Leu Arg Ser Leu Gly Asn Gln 
    210                 215                 220                 


Asn Ile Ile Ile Val Gly Ser Pro Asn Trp Ser Gln Arg Pro Asp Phe 
225                 230                 235                 240 


Ala Ile Gln Asp Pro Ile Asn Asp Lys Asn Val Met Tyr Ser Val His 
                245                 250                 255     


Phe Tyr Ser Gly Thr His Lys Val Asn Gly Tyr Val Phe Glu Asn Met 
            260                 265                 270         


Lys Asn Ala Phe Glu Asn Gly Val Pro Ile Phe Val Thr Glu Trp Gly 
        275                 280                 285             


Thr Ser Leu Ala Ser Gly Asp Gly Gly Pro Tyr Leu Asp Glu Ala Asp 
    290                 295                 300                 


Lys Trp Leu Glu Tyr Leu Asn Ser Asn Tyr Ile Ser Trp Val Asn Trp 
305                 310                 315                 320 


Ser Leu Ser Asn Lys Asn Glu Thr Ser Ala Ala Phe Val Pro Tyr Val 
                325                 330                 335     


Ser Gly Met His Asp Ala Thr Ser Leu Asp Pro Gly Asp Asp Lys Val 
            340                 345                 350         


Trp Asp Ile Lys Glu Leu Ser Ile Ser Gly Glu Tyr Val Arg Ala Arg 
        355                 360                 365             


Ile Lys Gly Ile Ala Tyr Lys 
    370                 375 


<210> 295
<211> 1095
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 295
atgaagcgta ctcgatatgg tgtacgaagc ccgcgttccg caccccggtt cggtgtactg     60

ttcggtgcgg cggcggcagg agtgctcatg actggtgctg gtcaggaggc gcgttcggaa    120

ggcacccggc tgacggctgc agctgcggtc ccggcgtggg tcccgtcggt cccatcccct    180

ggttcgagcc gggcggtgct ggagacggcg gtgctgtacg tggatccgca ctcgacggcc    240

cggtcccagg tggaaagctg gcgcagctcc cgtccgcggg aagcagcgct gctggaccgg    300

gaagtggcga gtcagccctc ggggatctgg ttcggcgact ggaacaggtc ggtccgcaac    360

gacgtgagcg cggtgatgac cagtgccgcg cgccagggcg cgctgccggt tctggtagcg    420

tacaacatcc cgctgcggga ctgcggcagc cactctgcgg gtggtgccgg cagcgccggg    480

gcgtaccgga gctggatcgg cgagttcgcc cggggcctga acggccgccg cgccatcgtg    540

gtcctggaac cggacgcgct tgcgtccacc gaatgcctga gcacggcgca gcggaacgag    600

cggtttgcgc tgctgaagca cgcaacggag acgctctcgg cgcagggcgc tctggtctat    660

atcgacgcag ggcacgcgca gtggctgagt gccgcagaaa cggcgtcccg gctgatccag    720

gcgggcgtgc ggagtgcggc cggattctcg ctgaacgtga gcaacttcat cggcaatgaa    780

gcgaacatcc gcttcggcga tgacgtttcg cggcggacgg gcggagcgca ctacgtcatc    840

gacacgagcc gcaacggggc cggccctacg gcggacctgc agtggtgcaa cccgtcgggt    900

cgtgcgctgg gcacgcggcc cacgacgcgc acggcgcacg cgaagctgga tgcgctgctg    960

tggatcaaga agcccggaga gtcggacggc agctgcaacg gtggtccggc agccggtcag   1020

tggtgggcgg actatgcgct gggtcttgcc cagcggtcca cgccggtcat ggccctggcg   1080

gatgcccggc gctga                                                    1095

<210> 296
<211> 364
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (74)...(344)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (116)...(119)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (141)...(157)
<223> Glycosyl hydrolases family 6 signature 1. Prosite id = PS00655

<220> 
<221> SITE
<222> (181)...(190)
<223> Glycosyl hydrolases family 6 signature 2. Prosite id = PS00656

<220> 
<221> SITE
<222> (255)...(258)
<223> N-glycosylation site. Prosite id = PS00001

<400> 296
Met Lys Arg Thr Arg Tyr Gly Val Arg Ser Pro Arg Ser Ala Pro Arg 
1               5                   10                  15      


Phe Gly Val Leu Phe Gly Ala Ala Ala Ala Gly Val Leu Met Thr Gly 
            20                  25                  30          


Ala Gly Gln Glu Ala Arg Ser Glu Gly Thr Arg Leu Thr Ala Ala Ala 
        35                  40                  45              


Ala Val Pro Ala Trp Val Pro Ser Val Pro Ser Pro Gly Ser Ser Arg 
    50                  55                  60                  


Ala Val Leu Glu Thr Ala Val Leu Tyr Val Asp Pro His Ser Thr Ala 
65                  70                  75                  80  


Arg Ser Gln Val Glu Ser Trp Arg Ser Ser Arg Pro Arg Glu Ala Ala 
                85                  90                  95      


Leu Leu Asp Arg Glu Val Ala Ser Gln Pro Ser Gly Ile Trp Phe Gly 
            100                 105                 110         


Asp Trp Asn Arg Ser Val Arg Asn Asp Val Ser Ala Val Met Thr Ser 
        115                 120                 125             


Ala Ala Arg Gln Gly Ala Leu Pro Val Leu Val Ala Tyr Asn Ile Pro 
    130                 135                 140                 


Leu Arg Asp Cys Gly Ser His Ser Ala Gly Gly Ala Gly Ser Ala Gly 
145                 150                 155                 160 


Ala Tyr Arg Ser Trp Ile Gly Glu Phe Ala Arg Gly Leu Asn Gly Arg 
                165                 170                 175     


Arg Ala Ile Val Val Leu Glu Pro Asp Ala Leu Ala Ser Thr Glu Cys 
            180                 185                 190         


Leu Ser Thr Ala Gln Arg Asn Glu Arg Phe Ala Leu Leu Lys His Ala 
        195                 200                 205             


Thr Glu Thr Leu Ser Ala Gln Gly Ala Leu Val Tyr Ile Asp Ala Gly 
    210                 215                 220                 


His Ala Gln Trp Leu Ser Ala Ala Glu Thr Ala Ser Arg Leu Ile Gln 
225                 230                 235                 240 


Ala Gly Val Arg Ser Ala Ala Gly Phe Ser Leu Asn Val Ser Asn Phe 
                245                 250                 255     


Ile Gly Asn Glu Ala Asn Ile Arg Phe Gly Asp Asp Val Ser Arg Arg 
            260                 265                 270         


Thr Gly Gly Ala His Tyr Val Ile Asp Thr Ser Arg Asn Gly Ala Gly 
        275                 280                 285             


Pro Thr Ala Asp Leu Gln Trp Cys Asn Pro Ser Gly Arg Ala Leu Gly 
    290                 295                 300                 


Thr Arg Pro Thr Thr Arg Thr Ala His Ala Lys Leu Asp Ala Leu Leu 
305                 310                 315                 320 


Trp Ile Lys Lys Pro Gly Glu Ser Asp Gly Ser Cys Asn Gly Gly Pro 
                325                 330                 335     


Ala Ala Gly Gln Trp Trp Ala Asp Tyr Ala Leu Gly Leu Ala Gln Arg 
            340                 345                 350         


Ser Thr Pro Val Met Ala Leu Ala Asp Ala Arg Arg 
        355                 360                 


<210> 297
<211> 2601
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 297
atgagaaaac tactgactgc tttattagta acggtggcaa taggagcaaa cgcccaaatg     60

ctgccgtacc agaacccaca gctaaccgca gagcaacgtg ccgacgactt actaggtcgt    120

ctgacccttg acgagaaagt gaagctgatg atggacacct cgccagccat tccgcgactg    180

ggcatcccgc aattccagtg gtggaacgag gcgctgcatg gtgtgggacg aaacggattt    240

gccaccgtat tccccatcac aatggcaatg gcagcctcat gggacgatgc tttggtgcac    300

aaagtgttta ccgctgtcag cgacgaggct cgcgtgaagg cccaacaggc caagcgttct    360

ggaaatatcc aacgctatca gagtctgtca ttttggacac ccaacatcaa cattttccgc    420

gatcctcgtt ggggtcgtgg acaagagacc tatggcgaag acccgtatct taccactcaa    480

atgggactgg ctgtggtgcg cggtctacaa ggtgttggct atcagggcga agacctgggc    540

gtgagcaaat atcgtaagtt actggcctgt gccaagcact ttgccgtaca tagcggtcct    600

gagtggaacc gtcacacctt caacattgag aacctgcccg agcgcgacct ctgggaaacc    660

tatctgcctg ccttcaaggc tttggtgcaa gaaggaaacg tggcagaagt gatgtgtgcc    720

tatcaacgta ttgacgggca ggcctgctgc gcccagaccc gctacgaaca gcagatattg    780

cgtgacgaat ggggattcga cggactcatc accagcgact gtggtgctat ccgcgacttc    840

ctgcccaagt ggcacaatgt ggccaaagat ggtgcagaag ccagtgccaa agccgtattg    900

gctggtaccg acgtggaatg tggttcggaa tataagaacc tgcctgcagc catcaagcgt    960

ggcgacatca aggagtccga ccttgacaag agtctgcgac gtctgctcat cgcccgcttt   1020

gaactgggcg acttcgacag cgacgaagca aacgcatgga ctaagatacc agaaagcgtc   1080

atcgcttcga aggaacataa gaaactggcg ttggatatgg ctcagaaaag cattgtattg   1140

ctgaaaaaca atggagtatt gccattgact caatcgcagc ccgcagaact ggtggtaatg   1200

ggaccgaatg ccaacgattc cgtgatgatg tggggcaact attcgggcta ccccacccgt   1260

accatcaccg ctctggaagg tatcaataga tatttcaaag cacagacacc cacagccaag   1320

gttcgctata tccaaggctg cggactgacc cgcaacgaat cattcatcag tcgtttcgat   1380

aaggtacagg gccccttggg atatcagggt atgcaagcca tctattggaa caataccgag   1440

atgaagggag agcctgtgac taccgttcac atcaaagatc ccattcactt gagtaatgga   1500

ggcaacaccg tttttgcccc tggcgtaaac ttagagaatt tctctgctcg tctggatggt   1560

attttcaaac ccactgagga tgaaacgctc atcttcgaca tcagtgcaga tgacaaaatg   1620

cgattgattg tgaacagcga cacattggtt gacatctgga aggtgcgcca tcgtattcag   1680

ggagacaaga aagaactgaa ggtgaaggca ggcgagcact atcgcatcca gatagactac   1740

gtgcaggaaa cgggttatgg tgccttgaac ttcgacatca agaagaaggt aaacccaacc   1800

caacaggaat tgttggcaca gattggcaat gcagaaacca ttatcttcgt gggtggcatc   1860

tcacccagtc tggaaggtga agaaatgaaa gtcagcgaac ccggcttcaa gggtggcgac   1920

cgtaccagca ttgaactgcc tcaagcacag cgcgacatgc tggccatgct ccacaaggct   1980

ggcaagaagg ttgtcttcgt caactgctct ggctctgcaa tggcactgac tccagaactt   2040

gagacctgcg acgccatcat ccagtggtgg tatgctggag aactgggtgg atcagcatta   2100

gctggtgttc ttatgggtga cagcaatcct agtggtaaac tacccatcac gttctataag   2160

agcaccgaag aattgccaga cttcctggat tacacgatga agaaccgtac ctatcgctat   2220

tatacgggcg aggcactgtt ccccttcggt ttcggactga gctacaccac ctttgccatc   2280

tcaaagccag tctataaaaa taacaaggta cgtgtgaccg ttaagaatac tggtgctcgc   2340

aagggattgg aaaccgtaca ggtttatgtt aggaacatgg cagacaagca gggtccactg   2400

aaaactctga aagcctacaa gcaagtagaa gtggaggttg gcgaaagcaa ggtggtagat   2460

atcgatttgc cccgcaacag ctttgaggga tgggacgaga agaccaacac catgcgtgtt   2520

gtaccaggaa agtacgagct gatggtaggt tcgtcgagtg ccgataagga tttgaaaaag   2580

gtggttgtta cagtgaagta a                                             2601

<210> 298
<211> 866
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(18)

<220> 
<221> DOMAIN
<222> (54)...(308)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (378)...(757)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> DOMAIN
<222> (467)...(611)
<223> PA14 domain

<220> 
<221> SITE
<222> (411)...(414)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (419)...(422)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (458)...(461)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (484)...(487)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (520)...(523)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (678)...(681)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (746)...(749)
<223> N-glycosylation site. Prosite id = PS00001

<400> 298
Met Arg Lys Leu Leu Thr Ala Leu Leu Val Thr Val Ala Ile Gly Ala 
1               5                   10                  15      


Asn Ala Gln Met Leu Pro Tyr Gln Asn Pro Gln Leu Thr Ala Glu Gln 
            20                  25                  30          


Arg Ala Asp Asp Leu Leu Gly Arg Leu Thr Leu Asp Glu Lys Val Lys 
        35                  40                  45              


Leu Met Met Asp Thr Ser Pro Ala Ile Pro Arg Leu Gly Ile Pro Gln 
    50                  55                  60                  


Phe Gln Trp Trp Asn Glu Ala Leu His Gly Val Gly Arg Asn Gly Phe 
65                  70                  75                  80  


Ala Thr Val Phe Pro Ile Thr Met Ala Met Ala Ala Ser Trp Asp Asp 
                85                  90                  95      


Ala Leu Val His Lys Val Phe Thr Ala Val Ser Asp Glu Ala Arg Val 
            100                 105                 110         


Lys Ala Gln Gln Ala Lys Arg Ser Gly Asn Ile Gln Arg Tyr Gln Ser 
        115                 120                 125             


Leu Ser Phe Trp Thr Pro Asn Ile Asn Ile Phe Arg Asp Pro Arg Trp 
    130                 135                 140                 


Gly Arg Gly Gln Glu Thr Tyr Gly Glu Asp Pro Tyr Leu Thr Thr Gln 
145                 150                 155                 160 


Met Gly Leu Ala Val Val Arg Gly Leu Gln Gly Val Gly Tyr Gln Gly 
                165                 170                 175     


Glu Asp Leu Gly Val Ser Lys Tyr Arg Lys Leu Leu Ala Cys Ala Lys 
            180                 185                 190         


His Phe Ala Val His Ser Gly Pro Glu Trp Asn Arg His Thr Phe Asn 
        195                 200                 205             


Ile Glu Asn Leu Pro Glu Arg Asp Leu Trp Glu Thr Tyr Leu Pro Ala 
    210                 215                 220                 


Phe Lys Ala Leu Val Gln Glu Gly Asn Val Ala Glu Val Met Cys Ala 
225                 230                 235                 240 


Tyr Gln Arg Ile Asp Gly Gln Ala Cys Cys Ala Gln Thr Arg Tyr Glu 
                245                 250                 255     


Gln Gln Ile Leu Arg Asp Glu Trp Gly Phe Asp Gly Leu Ile Thr Ser 
            260                 265                 270         


Asp Cys Gly Ala Ile Arg Asp Phe Leu Pro Lys Trp His Asn Val Ala 
        275                 280                 285             


Lys Asp Gly Ala Glu Ala Ser Ala Lys Ala Val Leu Ala Gly Thr Asp 
    290                 295                 300                 


Val Glu Cys Gly Ser Glu Tyr Lys Asn Leu Pro Ala Ala Ile Lys Arg 
305                 310                 315                 320 


Gly Asp Ile Lys Glu Ser Asp Leu Asp Lys Ser Leu Arg Arg Leu Leu 
                325                 330                 335     


Ile Ala Arg Phe Glu Leu Gly Asp Phe Asp Ser Asp Glu Ala Asn Ala 
            340                 345                 350         


Trp Thr Lys Ile Pro Glu Ser Val Ile Ala Ser Lys Glu His Lys Lys 
        355                 360                 365             


Leu Ala Leu Asp Met Ala Gln Lys Ser Ile Val Leu Leu Lys Asn Asn 
    370                 375                 380                 


Gly Val Leu Pro Leu Thr Gln Ser Gln Pro Ala Glu Leu Val Val Met 
385                 390                 395                 400 


Gly Pro Asn Ala Asn Asp Ser Val Met Met Trp Gly Asn Tyr Ser Gly 
                405                 410                 415     


Tyr Pro Thr Arg Thr Ile Thr Ala Leu Glu Gly Ile Asn Arg Tyr Phe 
            420                 425                 430         


Lys Ala Gln Thr Pro Thr Ala Lys Val Arg Tyr Ile Gln Gly Cys Gly 
        435                 440                 445             


Leu Thr Arg Asn Glu Ser Phe Ile Ser Arg Phe Asp Lys Val Gln Gly 
    450                 455                 460                 


Pro Leu Gly Tyr Gln Gly Met Gln Ala Ile Tyr Trp Asn Asn Thr Glu 
465                 470                 475                 480 


Met Lys Gly Glu Pro Val Thr Thr Val His Ile Lys Asp Pro Ile His 
                485                 490                 495     


Leu Ser Asn Gly Gly Asn Thr Val Phe Ala Pro Gly Val Asn Leu Glu 
            500                 505                 510         


Asn Phe Ser Ala Arg Leu Asp Gly Ile Phe Lys Pro Thr Glu Asp Glu 
        515                 520                 525             


Thr Leu Ile Phe Asp Ile Ser Ala Asp Asp Lys Met Arg Leu Ile Val 
    530                 535                 540                 


Asn Ser Asp Thr Leu Val Asp Ile Trp Lys Val Arg His Arg Ile Gln 
545                 550                 555                 560 


Gly Asp Lys Lys Glu Leu Lys Val Lys Ala Gly Glu His Tyr Arg Ile 
                565                 570                 575     


Gln Ile Asp Tyr Val Gln Glu Thr Gly Tyr Gly Ala Leu Asn Phe Asp 
            580                 585                 590         


Ile Lys Lys Lys Val Asn Pro Thr Gln Gln Glu Leu Leu Ala Gln Ile 
        595                 600                 605             


Gly Asn Ala Glu Thr Ile Ile Phe Val Gly Gly Ile Ser Pro Ser Leu 
    610                 615                 620                 


Glu Gly Glu Glu Met Lys Val Ser Glu Pro Gly Phe Lys Gly Gly Asp 
625                 630                 635                 640 


Arg Thr Ser Ile Glu Leu Pro Gln Ala Gln Arg Asp Met Leu Ala Met 
                645                 650                 655     


Leu His Lys Ala Gly Lys Lys Val Val Phe Val Asn Cys Ser Gly Ser 
            660                 665                 670         


Ala Met Ala Leu Thr Pro Glu Leu Glu Thr Cys Asp Ala Ile Ile Gln 
        675                 680                 685             


Trp Trp Tyr Ala Gly Glu Leu Gly Gly Ser Ala Leu Ala Gly Val Leu 
    690                 695                 700                 


Met Gly Asp Ser Asn Pro Ser Gly Lys Leu Pro Ile Thr Phe Tyr Lys 
705                 710                 715                 720 


Ser Thr Glu Glu Leu Pro Asp Phe Leu Asp Tyr Thr Met Lys Asn Arg 
                725                 730                 735     


Thr Tyr Arg Tyr Tyr Thr Gly Glu Ala Leu Phe Pro Phe Gly Phe Gly 
            740                 745                 750         


Leu Ser Tyr Thr Thr Phe Ala Ile Ser Lys Pro Val Tyr Lys Asn Asn 
        755                 760                 765             


Lys Val Arg Val Thr Val Lys Asn Thr Gly Ala Arg Lys Gly Leu Glu 
    770                 775                 780                 


Thr Val Gln Val Tyr Val Arg Asn Met Ala Asp Lys Gln Gly Pro Leu 
785                 790                 795                 800 


Lys Thr Leu Lys Ala Tyr Lys Gln Val Glu Val Glu Val Gly Glu Ser 
                805                 810                 815     


Lys Val Val Asp Ile Asp Leu Pro Arg Asn Ser Phe Glu Gly Trp Asp 
            820                 825                 830         


Glu Lys Thr Asn Thr Met Arg Val Val Pro Gly Lys Tyr Glu Leu Met 
        835                 840                 845             


Val Gly Ser Ser Ser Ala Asp Lys Asp Leu Lys Lys Val Val Val Thr 
    850                 855                 860                 


Val Lys 
865     


<210> 299
<211> 1014
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 299
atgacatcaa aacacttttt caaaattacc cttatgtcaa tacttttatt caccacaaca     60

cttgcgcaaa ccttttccca aaccccggtt gaattacatg gccgtctcag ggtttctggc    120

aaccagatcg tggatgagca tggcaatcca ttacagttga tgggtatgag cctcttctgg    180

tcggtttggg gtgcagagaa atactacaac gccgatgtgg tgaactggct ggtaaaagac    240

tggaagatcg acctgattcg tgcagcaata gctgttgagg taaaccagga aggagatggg    300

aataaaggat ggcttttcaa caaagagggg cagtacaaac tggccgaaac cattatccag    360

gccgctatcg acaatggcat atatgtcctg atcgattggc acacccatcg tacccatacc    420

gatgcagcga ttgagttttt cggctacctg gcccaaaaat atggcaggta ccccaacctg    480

atctgggaaa cattcaacga accgataaac cagggctggc aggagatcgc tgactttacc    540

aatgcggtaa ccggagccat tcgcccacac agcgataatc tgatcattgc aggtacccgc    600

cgctggagcc agctggtgaa tgagcctgcc gacaatccgc tcccagataa aaacactgcc    660

tattcgctgc acttttatgc cggcacccat gggcaggaac tccgcgatat tggcgactat    720

gcactctcaa aaggaattgc ccttttcatc accgagtggg gcacctccca tgccgatggc    780

ggccgcgaca tgatcgtgca caaagaaaaa gcacaggagt ggatcgactg ggcagtggag    840

cgtaacctga gtatggccaa ctggtcactt tttgataagg aagaagcttc tgcagcactc    900

cagcccgagg cacccgtcaa cggaaactgg gatccggaaa aacacctatc agtatcaggc    960

cggtttgtca gagaccagat catccggatc aataacaaaa agtacaaaaa gtag         1014

<210> 300
<211> 337
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(22)

<220> 
<221> DOMAIN
<222> (41)...(298)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (162)...(171)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (286)...(289)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<400> 300
Met Thr Ser Lys His Phe Phe Lys Ile Thr Leu Met Ser Ile Leu Leu 
1               5                   10                  15      


Phe Thr Thr Thr Leu Ala Gln Thr Phe Ser Gln Thr Pro Val Glu Leu 
            20                  25                  30          


His Gly Arg Leu Arg Val Ser Gly Asn Gln Ile Val Asp Glu His Gly 
        35                  40                  45              


Asn Pro Leu Gln Leu Met Gly Met Ser Leu Phe Trp Ser Val Trp Gly 
    50                  55                  60                  


Ala Glu Lys Tyr Tyr Asn Ala Asp Val Val Asn Trp Leu Val Lys Asp 
65                  70                  75                  80  


Trp Lys Ile Asp Leu Ile Arg Ala Ala Ile Ala Val Glu Val Asn Gln 
                85                  90                  95      


Glu Gly Asp Gly Asn Lys Gly Trp Leu Phe Asn Lys Glu Gly Gln Tyr 
            100                 105                 110         


Lys Leu Ala Glu Thr Ile Ile Gln Ala Ala Ile Asp Asn Gly Ile Tyr 
        115                 120                 125             


Val Leu Ile Asp Trp His Thr His Arg Thr His Thr Asp Ala Ala Ile 
    130                 135                 140                 


Glu Phe Phe Gly Tyr Leu Ala Gln Lys Tyr Gly Arg Tyr Pro Asn Leu 
145                 150                 155                 160 


Ile Trp Glu Thr Phe Asn Glu Pro Ile Asn Gln Gly Trp Gln Glu Ile 
                165                 170                 175     


Ala Asp Phe Thr Asn Ala Val Thr Gly Ala Ile Arg Pro His Ser Asp 
            180                 185                 190         


Asn Leu Ile Ile Ala Gly Thr Arg Arg Trp Ser Gln Leu Val Asn Glu 
        195                 200                 205             


Pro Ala Asp Asn Pro Leu Pro Asp Lys Asn Thr Ala Tyr Ser Leu His 
    210                 215                 220                 


Phe Tyr Ala Gly Thr His Gly Gln Glu Leu Arg Asp Ile Gly Asp Tyr 
225                 230                 235                 240 


Ala Leu Ser Lys Gly Ile Ala Leu Phe Ile Thr Glu Trp Gly Thr Ser 
                245                 250                 255     


His Ala Asp Gly Gly Arg Asp Met Ile Val His Lys Glu Lys Ala Gln 
            260                 265                 270         


Glu Trp Ile Asp Trp Ala Val Glu Arg Asn Leu Ser Met Ala Asn Trp 
        275                 280                 285             


Ser Leu Phe Asp Lys Glu Glu Ala Ser Ala Ala Leu Gln Pro Glu Ala 
    290                 295                 300                 


Pro Val Asn Gly Asn Trp Asp Pro Glu Lys His Leu Ser Val Ser Gly 
305                 310                 315                 320 


Arg Phe Val Arg Asp Gln Ile Ile Arg Ile Asn Asn Lys Lys Tyr Lys 
                325                 330                 335     


Lys 
    


<210> 301
<211> 1374
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 301
atgtttcaaa gtttgaagat gcgtacatta tcgttcctgc tcctgatggc gttgcttgct     60

tcgtttctgg cactacccac cgacgtggcc catgccatca acacgccctg gttgagcgtc    120

tcaggcaggt tcatcaagga cccggcgggc aataacgtgg tcctgcgagg ggtctcgctg    180

gtggatattg gtgaggtgaa ccttggacgg acgcgcaatg tcagccaggt gatcaatatg    240

gctaccaatg aagccgatgg ctggtatgcg cgtgtagtgc gcctgccagt ctatccgaat    300

gcgatcgata gttcgcccgg ctggctggca aacccagatg cttatttcaa taaccatctc    360

aacccggcta ttcagaactg tgtggcgcgc cagatctact gcatcatcga ctggcactat    420

atcgcggact ataacaacag cacgatcgac acaaacacac gcgccttctg gaactatgtg    480

gcaccacgat atgccaatac tccgaatgta atcttcgaat tgtacaatga accagtcaac    540

cctgataact ggtcaacgtg gaagcaatgg gcgcagccct gggtagacat catccgctcc    600

catgcgccga acaacttgat cctgatcggt ggtccgcgct ggtcgcagaa tctttcgagc    660

gcggcgagca gtccatttac tggcagtaat cttgtgtatg ttgcccacat ctatcctgaa    720

cacggcggac aaagcaactg ggattcatgg ttcggcaatg ccgcgaactc tgttcccttc    780

tttgtcacgg aatggggctg gatacagggc ggcgccaccc caactaatgg cacacagtct    840

ggctacggtg ttccgttcag taactacctt gaatcaaagg gcttgagttg gaccgcctgg    900

gtctttgatc aatattggga tcctaaaatg tgggatgaga actggaacct gctcggcggt    960

gagaattaca tgggacagtt caccaaagac ttcctgttcc agcaccgcaa cgacaacctg   1020

cccggcggca gcgcgaccaa cacacctcct ggtccaacct tcacgccgac gcgcaccaac   1080

acaccgggca gcggcaccct gaaagtgcag gtgtcggctg gcggcacgga caacaaccag   1140

cagacggcct tccgcttccg ggtgcagaac accggttcga gcgcggtctc gaacgtctcg   1200

acgcgcctct acttcacact ggatggcagc aatgcggcgt cgaactacac attggagaaa   1260

tattgggatc aatcgggagt cgcaacggtt tccggtccca cacaggcgtc cggttcgacc   1320

tattacttta ccgtgaacta tggcacggcc tctctggggg ctggcaattc atgg         1374

<210> 302
<211> 458
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (43)...(310)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (369)...(446)
<223> Cellulose binding domain

<220> 
<221> SITE
<222> (74)...(77)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (147)...(150)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (172)...(181)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (185)...(188)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (220)...(223)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (280)...(283)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (395)...(398)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (404)...(407)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (421)...(424)
<223> N-glycosylation site. Prosite id = PS00001

<400> 302
Met Phe Gln Ser Leu Lys Met Arg Thr Leu Ser Phe Leu Leu Leu Met 
1               5                   10                  15      


Ala Leu Leu Ala Ser Phe Leu Ala Leu Pro Thr Asp Val Ala His Ala 
            20                  25                  30          


Ile Asn Thr Pro Trp Leu Ser Val Ser Gly Arg Phe Ile Lys Asp Pro 
        35                  40                  45              


Ala Gly Asn Asn Val Val Leu Arg Gly Val Ser Leu Val Asp Ile Gly 
    50                  55                  60                  


Glu Val Asn Leu Gly Arg Thr Arg Asn Val Ser Gln Val Ile Asn Met 
65                  70                  75                  80  


Ala Thr Asn Glu Ala Asp Gly Trp Tyr Ala Arg Val Val Arg Leu Pro 
                85                  90                  95      


Val Tyr Pro Asn Ala Ile Asp Ser Ser Pro Gly Trp Leu Ala Asn Pro 
            100                 105                 110         


Asp Ala Tyr Phe Asn Asn His Leu Asn Pro Ala Ile Gln Asn Cys Val 
        115                 120                 125             


Ala Arg Gln Ile Tyr Cys Ile Ile Asp Trp His Tyr Ile Ala Asp Tyr 
    130                 135                 140                 


Asn Asn Ser Thr Ile Asp Thr Asn Thr Arg Ala Phe Trp Asn Tyr Val 
145                 150                 155                 160 


Ala Pro Arg Tyr Ala Asn Thr Pro Asn Val Ile Phe Glu Leu Tyr Asn 
                165                 170                 175     


Glu Pro Val Asn Pro Asp Asn Trp Ser Thr Trp Lys Gln Trp Ala Gln 
            180                 185                 190         


Pro Trp Val Asp Ile Ile Arg Ser His Ala Pro Asn Asn Leu Ile Leu 
        195                 200                 205             


Ile Gly Gly Pro Arg Trp Ser Gln Asn Leu Ser Ser Ala Ala Ser Ser 
    210                 215                 220                 


Pro Phe Thr Gly Ser Asn Leu Val Tyr Val Ala His Ile Tyr Pro Glu 
225                 230                 235                 240 


His Gly Gly Gln Ser Asn Trp Asp Ser Trp Phe Gly Asn Ala Ala Asn 
                245                 250                 255     


Ser Val Pro Phe Phe Val Thr Glu Trp Gly Trp Ile Gln Gly Gly Ala 
            260                 265                 270         


Thr Pro Thr Asn Gly Thr Gln Ser Gly Tyr Gly Val Pro Phe Ser Asn 
        275                 280                 285             


Tyr Leu Glu Ser Lys Gly Leu Ser Trp Thr Ala Trp Val Phe Asp Gln 
    290                 295                 300                 


Tyr Trp Asp Pro Lys Met Trp Asp Glu Asn Trp Asn Leu Leu Gly Gly 
305                 310                 315                 320 


Glu Asn Tyr Met Gly Gln Phe Thr Lys Asp Phe Leu Phe Gln His Arg 
                325                 330                 335     


Asn Asp Asn Leu Pro Gly Gly Ser Ala Thr Asn Thr Pro Pro Gly Pro 
            340                 345                 350         


Thr Phe Thr Pro Thr Arg Thr Asn Thr Pro Gly Ser Gly Thr Leu Lys 
        355                 360                 365             


Val Gln Val Ser Ala Gly Gly Thr Asp Asn Asn Gln Gln Thr Ala Phe 
    370                 375                 380                 


Arg Phe Arg Val Gln Asn Thr Gly Ser Ser Ala Val Ser Asn Val Ser 
385                 390                 395                 400 


Thr Arg Leu Tyr Phe Thr Leu Asp Gly Ser Asn Ala Ala Ser Asn Tyr 
                405                 410                 415     


Thr Leu Glu Lys Tyr Trp Asp Gln Ser Gly Val Ala Thr Val Ser Gly 
            420                 425                 430         


Pro Thr Gln Ala Ser Gly Ser Thr Tyr Tyr Phe Thr Val Asn Tyr Gly 
        435                 440                 445             


Thr Ala Ser Leu Gly Ala Gly Asn Ser Trp 
    450                 455             


<210> 303
<211> 2481
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 303
atggcattgt ccaccgtttc aaaagtcatg ctgctgacct gtgcagcggt cctgctgacc     60

ataccgggat gtaactccgc catggagcag ccggaaaaat ggccgtatga ggacgtttcc    120

gtcacgccgg agagctggcc gcagctgacg gtggcggcgc tcgacccggc gatcgaggcg    180

cagattgatg cgatcctgcc gaagctgaca ctggagcaga aggtcgggca ggtgatccag    240

ggcgacagcg aattcctgac gccggacgat gtgaaacgat accggttggg atcggtgttg    300

agcggcggga actcggcgcc gggcgagcag ccgtttgcgg atgcggcgac ctggctggcg    360

gcggcggatg cgtatttcga agcatcgctg gataccgagg gcgtagagat cgccattccg    420

gtgatctggg ggattgatgc cgtacacgga catacgaacc ttgtcggcgc gaccgtgttt    480

ccgcacaatg tcggccttgg cgcgacgaat gatccggagc tgatccggga tattgcggcg    540

gtgacggcgg cagagctggt tgtgtccggc catgactgga cgttcgcccc gacactggcc    600

gtgccgcgcg atgaccggtg gggccgggcg tatgaaggtt tctcggaaga cccggaaatt    660

gtgcgcaatt ttgccggaaa ggtggtcgag gggctgcagg gcgtgcaggg cgcggaaggc    720

tggctgcgcg aggggcgcgt gatctcgagt gcgaagcatt tcgttgccga tggcggcacg    780

gagaatggcc gcgaccaggg cgatgcgcgc atcagcgaag cagagctgcg cgacattcat    840

gcagcgggct atatgccggc aattgagtcc ggtgtgcaga cgatcatggc ttctttctcc    900

agctggaacg gcatcaagat ccatggcagc catgctctgc tcacggacgt tctcaaggat    960

cggctcgggt ttaccgggtt cattgtcggc gactggaatg cgcatgggca gattccggga   1020

tgtacgaaca cggattgtcc gcaggcactg ctggcgggga tcgacatgta catggcgccg   1080

gacagctgga aggggatgta tgagtcgacg ctgaagcatg ttcaggacgg caccattccg   1140

atggagcggc tggatgatgc cgtgcggcgc atcctgcgtg tgaagattgc ctatggcctg   1200

ttcgacaagc cgaagccgag tctgcgggcg ggcgcgggag acacttcgct gctcggctcg   1260

gcggcgcacc gggatgtggc gcggcgggct gtgcgccagt cgatggtgct gctcaagaac   1320

aatgacggca cgctgccgct ggcggcgaag cagacggtgc tggttgtggg cgacggggcc   1380

gacagtatca gcaaggtgtc gggcggctgg acgctgtcct ggcagggcgg cgggtatgac   1440

aatgcgcatt ttcccaacgg gcagtcgatc ctgtcgggca tccgcgaggt ggtcgaagcg   1500

gcaggcggca ccgtgatcca cgatccggcg ggcacgagcg gggccagggc ggatgtggtg   1560

atcgcggtat acggtgaaga tccgtatgca gagttccagg gcgaccggga caatgtggac   1620

tttgtgccgg agggctttga tacggggctg ctggcagggt accgggcaaa gggcgcgaag   1680

gtggtgtcgt tgttcctgtc gggccggcct ctgtggacca acccggaaat caatgcgtcg   1740

gatgcgttcg tggccgcgtg gtggccaggc tctgaaggcg gcggcgtggc ggacctcttg   1800

ttccggacga agccggaata tgacttcacg gggcggcttt ccttctcctg gccggcgagc   1860

gcggtgcaga cgccgctgaa ccggggcgac gcgaattatg caccgcaatt tgcctatggt   1920

tatgggcttt cctatgccgc gccgcaggtg gtgggtgtac tgacggaaga gtccgggctg   1980

gcggcggatg caaacggcgc tcgcggcgcg gtgtttgtgc gcgggcaggc ggtggcaccg   2040

tggagcatgc ggttcgaggg gtcggacggc ccgcccgcgc gggtggatca tggggcgcag   2100

gaagatgcgc tggcgctttc ggcgaacagt gcgccggcgt cgctctcctt cgagacggcg   2160

ggcgccgggc ttgactggtc gcgggagtcg aacggggcga tggaactcag cttctttgcg   2220

cgctcgatga cggcggagcc ggcgagcgtg aacgtgtcga tgggatgcgc gctcgagggg   2280

gcttgtgcgc ggcaggtgcc ggtggcggtc ggcgccgagt gggcggagca ccggatttcg   2340

ctcagctgtt ttgcggatgc cggtgtcgac atgtcaaagc tgacgagcgc ggttcgcttc   2400

gggctggacg gcgggcgtgt ggcgtttgcc ggcattgcgc tcgccgagga caaggacggc   2460

cagccgaact gcggcggctg a                                             2481

<210> 304
<211> 826
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (132)...(359)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (435)...(647)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (586)...(589)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (762)...(765)
<223> N-glycosylation site. Prosite id = PS00001

<400> 304
Met Ala Leu Ser Thr Val Ser Lys Val Met Leu Leu Thr Cys Ala Ala 
1               5                   10                  15      


Val Leu Leu Thr Ile Pro Gly Cys Asn Ser Ala Met Glu Gln Pro Glu 
            20                  25                  30          


Lys Trp Pro Tyr Glu Asp Val Ser Val Thr Pro Glu Ser Trp Pro Gln 
        35                  40                  45              


Leu Thr Val Ala Ala Leu Asp Pro Ala Ile Glu Ala Gln Ile Asp Ala 
    50                  55                  60                  


Ile Leu Pro Lys Leu Thr Leu Glu Gln Lys Val Gly Gln Val Ile Gln 
65                  70                  75                  80  


Gly Asp Ser Glu Phe Leu Thr Pro Asp Asp Val Lys Arg Tyr Arg Leu 
                85                  90                  95      


Gly Ser Val Leu Ser Gly Gly Asn Ser Ala Pro Gly Glu Gln Pro Phe 
            100                 105                 110         


Ala Asp Ala Ala Thr Trp Leu Ala Ala Ala Asp Ala Tyr Phe Glu Ala 
        115                 120                 125             


Ser Leu Asp Thr Glu Gly Val Glu Ile Ala Ile Pro Val Ile Trp Gly 
    130                 135                 140                 


Ile Asp Ala Val His Gly His Thr Asn Leu Val Gly Ala Thr Val Phe 
145                 150                 155                 160 


Pro His Asn Val Gly Leu Gly Ala Thr Asn Asp Pro Glu Leu Ile Arg 
                165                 170                 175     


Asp Ile Ala Ala Val Thr Ala Ala Glu Leu Val Val Ser Gly His Asp 
            180                 185                 190         


Trp Thr Phe Ala Pro Thr Leu Ala Val Pro Arg Asp Asp Arg Trp Gly 
        195                 200                 205             


Arg Ala Tyr Glu Gly Phe Ser Glu Asp Pro Glu Ile Val Arg Asn Phe 
    210                 215                 220                 


Ala Gly Lys Val Val Glu Gly Leu Gln Gly Val Gln Gly Ala Glu Gly 
225                 230                 235                 240 


Trp Leu Arg Glu Gly Arg Val Ile Ser Ser Ala Lys His Phe Val Ala 
                245                 250                 255     


Asp Gly Gly Thr Glu Asn Gly Arg Asp Gln Gly Asp Ala Arg Ile Ser 
            260                 265                 270         


Glu Ala Glu Leu Arg Asp Ile His Ala Ala Gly Tyr Met Pro Ala Ile 
        275                 280                 285             


Glu Ser Gly Val Gln Thr Ile Met Ala Ser Phe Ser Ser Trp Asn Gly 
    290                 295                 300                 


Ile Lys Ile His Gly Ser His Ala Leu Leu Thr Asp Val Leu Lys Asp 
305                 310                 315                 320 


Arg Leu Gly Phe Thr Gly Phe Ile Val Gly Asp Trp Asn Ala His Gly 
                325                 330                 335     


Gln Ile Pro Gly Cys Thr Asn Thr Asp Cys Pro Gln Ala Leu Leu Ala 
            340                 345                 350         


Gly Ile Asp Met Tyr Met Ala Pro Asp Ser Trp Lys Gly Met Tyr Glu 
        355                 360                 365             


Ser Thr Leu Lys His Val Gln Asp Gly Thr Ile Pro Met Glu Arg Leu 
    370                 375                 380                 


Asp Asp Ala Val Arg Arg Ile Leu Arg Val Lys Ile Ala Tyr Gly Leu 
385                 390                 395                 400 


Phe Asp Lys Pro Lys Pro Ser Leu Arg Ala Gly Ala Gly Asp Thr Ser 
                405                 410                 415     


Leu Leu Gly Ser Ala Ala His Arg Asp Val Ala Arg Arg Ala Val Arg 
            420                 425                 430         


Gln Ser Met Val Leu Leu Lys Asn Asn Asp Gly Thr Leu Pro Leu Ala 
        435                 440                 445             


Ala Lys Gln Thr Val Leu Val Val Gly Asp Gly Ala Asp Ser Ile Ser 
    450                 455                 460                 


Lys Val Ser Gly Gly Trp Thr Leu Ser Trp Gln Gly Gly Gly Tyr Asp 
465                 470                 475                 480 


Asn Ala His Phe Pro Asn Gly Gln Ser Ile Leu Ser Gly Ile Arg Glu 
                485                 490                 495     


Val Val Glu Ala Ala Gly Gly Thr Val Ile His Asp Pro Ala Gly Thr 
            500                 505                 510         


Ser Gly Ala Arg Ala Asp Val Val Ile Ala Val Tyr Gly Glu Asp Pro 
        515                 520                 525             


Tyr Ala Glu Phe Gln Gly Asp Arg Asp Asn Val Asp Phe Val Pro Glu 
    530                 535                 540                 


Gly Phe Asp Thr Gly Leu Leu Ala Gly Tyr Arg Ala Lys Gly Ala Lys 
545                 550                 555                 560 


Val Val Ser Leu Phe Leu Ser Gly Arg Pro Leu Trp Thr Asn Pro Glu 
                565                 570                 575     


Ile Asn Ala Ser Asp Ala Phe Val Ala Ala Trp Trp Pro Gly Ser Glu 
            580                 585                 590         


Gly Gly Gly Val Ala Asp Leu Leu Phe Arg Thr Lys Pro Glu Tyr Asp 
        595                 600                 605             


Phe Thr Gly Arg Leu Ser Phe Ser Trp Pro Ala Ser Ala Val Gln Thr 
    610                 615                 620                 


Pro Leu Asn Arg Gly Asp Ala Asn Tyr Ala Pro Gln Phe Ala Tyr Gly 
625                 630                 635                 640 


Tyr Gly Leu Ser Tyr Ala Ala Pro Gln Val Val Gly Val Leu Thr Glu 
                645                 650                 655     


Glu Ser Gly Leu Ala Ala Asp Ala Asn Gly Ala Arg Gly Ala Val Phe 
            660                 665                 670         


Val Arg Gly Gln Ala Val Ala Pro Trp Ser Met Arg Phe Glu Gly Ser 
        675                 680                 685             


Asp Gly Pro Pro Ala Arg Val Asp His Gly Ala Gln Glu Asp Ala Leu 
    690                 695                 700                 


Ala Leu Ser Ala Asn Ser Ala Pro Ala Ser Leu Ser Phe Glu Thr Ala 
705                 710                 715                 720 


Gly Ala Gly Leu Asp Trp Ser Arg Glu Ser Asn Gly Ala Met Glu Leu 
                725                 730                 735     


Ser Phe Phe Ala Arg Ser Met Thr Ala Glu Pro Ala Ser Val Asn Val 
            740                 745                 750         


Ser Met Gly Cys Ala Leu Glu Gly Ala Cys Ala Arg Gln Val Pro Val 
        755                 760                 765             


Ala Val Gly Ala Glu Trp Ala Glu His Arg Ile Ser Leu Ser Cys Phe 
    770                 775                 780                 


Ala Asp Ala Gly Val Asp Met Ser Lys Leu Thr Ser Ala Val Arg Phe 
785                 790                 795                 800 


Gly Leu Asp Gly Gly Arg Val Ala Phe Ala Gly Ile Ala Leu Ala Glu 
                805                 810                 815     


Asp Lys Asp Gly Gln Pro Asn Cys Gly Gly 
            820                 825     


<210> 305
<211> 1119
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 305
atggcgatcg gaatctcggc gacgatgctg ctcgcgatgc cccagcaggc cgacgggctg     60

aacggcgtcc ctgcctcgcg cttcgcccac ttccagaagg gcgtcaacat gagccactgg    120

ttcagccagt ggggcgacct gacccccgcc cggttcacct cctacgtgac cgagcgcgac    180

atggacctca tccggcaggc cggcttcgac cacgtgcggc tgccgttcaa cccggacccg    240

ttctggaacg atgcggaccc cgcgaagatc cgccccgttc cgctgtcgta ctgcaagaag    300

gcggtcgaag ggttcctgaa gcggggcctc gccgtggtgg tggacatgca cccggaggac    360

ccgttcaaga accgcatcgc caccgacgac gcgttcgtgg ccaaggccgc cgccttctgg    420

cgcgcgttcg ccaaggagat ggcctcgttc gacccggagc gggtgatgct cgagatcatg    480

aacgagccct cgatcgacgg tcgcggcgcc aaggacccgg tccgacgctg gcaccagatc    540

aacacccagc tggccaaggc gatccgcgag ggcgcgccgc ggcacaccat cgtggccgcc    600

ggcggaggat ggacgggcgt cgaccagttc gacacgctgg aaccgatccc gctgcggaac    660

gtggtctaca acttccactg ctacgacccg ttcgtgttca cacaccaggg agcgacctgg    720

ggatgggaca tgtcccgcct gatgaaggcc gttccctatc cctcttcgcc ggaggccgtg    780

caaccggcca tcgccgccag cgaccccaag gtccgcgaca tcctgatcgg ctacggcaac    840

gagcggtgga accgcgcgcg cctgcgctcc cacctcaaga aggccgcgga ctgggctgcc    900

aagcaccgcg tgtacctgac ctgcaacgag ttcggcgtgt acatcccgaa cgcaccgcgg    960

gagtcgcggc tggcatggct gcgcgacatg gcctcggtgc tgcgggagta ccgcatcggc   1020

tgggccatgt gggactacgc cggcgggttc gcggtggcgc tcggcgagcc gggcaagcgg   1080

accatggacc gggacgtgct gaaggcgctc gggctgtga                          1119

<210> 306
<211> 372
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(17)

<220> 
<221> DOMAIN
<222> (38)...(354)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (36)...(39)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (157)...(166)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<400> 306
Met Ala Ile Gly Ile Ser Ala Thr Met Leu Leu Ala Met Pro Gln Gln 
1               5                   10                  15      


Ala Asp Gly Leu Asn Gly Val Pro Ala Ser Arg Phe Ala His Phe Gln 
            20                  25                  30          


Lys Gly Val Asn Met Ser His Trp Phe Ser Gln Trp Gly Asp Leu Thr 
        35                  40                  45              


Pro Ala Arg Phe Thr Ser Tyr Val Thr Glu Arg Asp Met Asp Leu Ile 
    50                  55                  60                  


Arg Gln Ala Gly Phe Asp His Val Arg Leu Pro Phe Asn Pro Asp Pro 
65                  70                  75                  80  


Phe Trp Asn Asp Ala Asp Pro Ala Lys Ile Arg Pro Val Pro Leu Ser 
                85                  90                  95      


Tyr Cys Lys Lys Ala Val Glu Gly Phe Leu Lys Arg Gly Leu Ala Val 
            100                 105                 110         


Val Val Asp Met His Pro Glu Asp Pro Phe Lys Asn Arg Ile Ala Thr 
        115                 120                 125             


Asp Asp Ala Phe Val Ala Lys Ala Ala Ala Phe Trp Arg Ala Phe Ala 
    130                 135                 140                 


Lys Glu Met Ala Ser Phe Asp Pro Glu Arg Val Met Leu Glu Ile Met 
145                 150                 155                 160 


Asn Glu Pro Ser Ile Asp Gly Arg Gly Ala Lys Asp Pro Val Arg Arg 
                165                 170                 175     


Trp His Gln Ile Asn Thr Gln Leu Ala Lys Ala Ile Arg Glu Gly Ala 
            180                 185                 190         


Pro Arg His Thr Ile Val Ala Ala Gly Gly Gly Trp Thr Gly Val Asp 
        195                 200                 205             


Gln Phe Asp Thr Leu Glu Pro Ile Pro Leu Arg Asn Val Val Tyr Asn 
    210                 215                 220                 


Phe His Cys Tyr Asp Pro Phe Val Phe Thr His Gln Gly Ala Thr Trp 
225                 230                 235                 240 


Gly Trp Asp Met Ser Arg Leu Met Lys Ala Val Pro Tyr Pro Ser Ser 
                245                 250                 255     


Pro Glu Ala Val Gln Pro Ala Ile Ala Ala Ser Asp Pro Lys Val Arg 
            260                 265                 270         


Asp Ile Leu Ile Gly Tyr Gly Asn Glu Arg Trp Asn Arg Ala Arg Leu 
        275                 280                 285             


Arg Ser His Leu Lys Lys Ala Ala Asp Trp Ala Ala Lys His Arg Val 
    290                 295                 300                 


Tyr Leu Thr Cys Asn Glu Phe Gly Val Tyr Ile Pro Asn Ala Pro Arg 
305                 310                 315                 320 


Glu Ser Arg Leu Ala Trp Leu Arg Asp Met Ala Ser Val Leu Arg Glu 
                325                 330                 335     


Tyr Arg Ile Gly Trp Ala Met Trp Asp Tyr Ala Gly Gly Phe Ala Val 
            340                 345                 350         


Ala Leu Gly Glu Pro Gly Lys Arg Thr Met Asp Arg Asp Val Leu Lys 
        355                 360                 365             


Ala Leu Gly Leu 
    370         


<210> 307
<211> 2820
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 307
atgtcctgcc gcaccctgat gagtaggcgt gtaggatggg gacttttatt gtggggaggt     60

ttattcctca gaaccggttc ggttacagga caaacttaca attatgccga agtcctgcag    120

aaatctatgt ttttctacga atgtcaggag tctaaaattg ccccgggcaa tcgggtgaca    180

tggcgagcta atgcagccat gaacgatggg agcgatgttg gaaaagacct gacaggagga    240

tggtttgatg caggtgacca tgtgaaattt aattttccca tggcgtttac cgctacggcg    300

ctggcgtggg gagctattga ctttgctcag ggatacatta gttccgggca aatgcaatac    360

ctgaaacgta atctgcgcta cgtcaatgac tatttcatta aatgtcacac agcccccaac    420

gaattgtatg gtcaggtggg taatggaggc cttgaccatg ccttttgggg accacccgaa    480

gtcatgcgca tggctaggcc tgcctataaa attgatgcgt caaaacccgg atcagatctg    540

gctgccgaaa cagctgctgc aatggctgcc gccagcattg ttttcaaatc cgacgatcct    600

acctatagcg ctactttgct gaatcatgca aaacagctgt tttcttttgc cgaaacctat    660

aaaggaaaat attccgacgc tattaccgat gctgcaggat attataactc ctggagcggc    720

tataacgatg aactggtatg gggagctata tggctttacc gggctaccgg cgatgcaacc    780

tatctatcta aggcagaatc ctattacgac aatctgggta atcagggtca ggaacccgtt    840

aaagcctaca aatggaccat tgcatgggat gacaaatcct atggctgtta tgccctactg    900

gccaaattga caggtaagga aaaatacaaa attgacgccg aacgttttct cgactattgg    960

accgatggtt ataatggttc ccggattact tataccccgg gaggactcgc tttcctcgat   1020

atatggggat cgttgcgcta tgctatgaat actgcctttg ttgctgccta ctatgccgat   1080

gcagccactt cagctgctaa aaccacaaaa tatctcaact ttgctaaaca acaactgcat   1140

tatgctcttg gatccaatcc gagcaacaga agctatgtct gtggctttgg caataatcct   1200

cccgttaatc ctcaccatag aggtgcacac ggagcatggt ctaataatgt tcaaggacct   1260

cctaccgaaa cacgacatat cctctacggc gcattagtgg gtggaccagg cagtaatgac   1320

tcctatactg acgaccgatc caattacacc aataacgaag tagcatgtga ctacaatgct   1380

cttttctccg gactgcttgc aaagttcgtc attgattatg gaggcacacc gttagccaac   1440

ttccctgttc gtgaaacccc aaaagatgaa tatttcgttg aagcaaaagc aaacgctaca   1500

ggaaccaatt tctccgaatg gacggtatgg gtatacaacc acactgcatg gccagcccgt   1560

gaaggttctg aatataaatt cagattatac gtaaatattt cggaaggact ggctgcaggc   1620

tatactgcct caaattatgt tgtgcaaacc aataatgccg gtgtggtaaa ctttacccaa   1680

cttttagctg ctgatgcagc taacggcatc tattataccg aagtaacctt taaacctggt   1740

accgaaattt atcctggcgg gcaacagtat gacaagaagg aagctcagat gcgtattagt   1800

ttgcccaatg ctccggcttc tgcatgggat ccgactaacg acccgtcatg ggcgggaatc   1860

acctctacct tgaaacaaat gccgggtata cccatgtatg tagatggtgt aaaggtattt   1920

ggtaatgagc ctgtcccagg tcagacagtt cccgtcaccg gagtaaccgt atcgcctacc   1980

accctgagtc tgactgtagg acagaccagt acactcaccg ctaccgtatc gccggctaat   2040

gctaccaaca aaaacgtcac ctggagcagc agcaatacca gcgtagccac ggtaagctca   2100

acaggcgttg tcacagccgt agcagccggt tcggccacca tcaccgtaac cacagtcgat   2160

ggcgctaaaa cagccacctg cgccgtaacg gtaacaggca gcaccaacgt tcccgtcacc   2220

ggagtaaccg tatcgcccac cacgctgagt ctgaccgtag ggcagaccgc taccctcacc   2280

gctaccgtat cgccggctaa tgctaccaac aagaacgtta cctggagcag cagcaatacc   2340

agcgtagcca cggtaagttc aacaggcgta gttactgccg tagcggccgg ttcggccacc   2400

atcaccgtaa ccaccgtcga tggagctaaa accgctacct gcaccgtaac ggtaacgggc   2460

agcactaccg tacccgtcac cggcgtaact gtatcgccta ccaccctgag tctgaccgtt   2520

ggacaaaccg ctaccctgac cgctaccgta tcgccagctg atgctaccaa caagaacgtc   2580

acctggagca gcagcaatac cagcgtagcc acggtaagct caacaggcgt agtcactgcc   2640

gtagcggccg gttcagctac catcaccgtg accacagtcg atggggctaa aactgctacc   2700

tgtgccgtga ccgtaaccgc cggaggttcc accaccccct gcagtaatcc ggtaagcaaa   2760

accctacctc tggtacagga tggtgccggc gaattcaggt tgagtaatag ttttaattaa   2820

<210> 308
<211> 939
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(30)

<220> 
<221> DOMAIN
<222> (34)...(469)
<223> Glycosyl hydrolase family 9

<220> 
<221> DOMAIN
<222> (491)...(576)
<223> Cellulose binding domain

<220> 
<221> DOMAIN
<222> (651)...(729)
<223> Bacterial Ig-like domain (group 2)

<220> 
<221> DOMAIN
<222> (738)...(816)
<223> Bacterial Ig-like domain (group 2)

<220> 
<221> DOMAIN
<222> (825)...(903)
<223> Bacterial Ig-like domain (group 2)

<220> 
<221> SITE
<222> (445)...(448)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (454)...(457)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (505)...(508)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (510)...(513)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (520)...(523)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (540)...(543)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (565)...(568)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (690)...(693)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (695)...(698)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (702)...(705)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (778)...(781)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (783)...(786)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (872)...(875)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (879)...(882)
<223> N-glycosylation site. Prosite id = PS00001

<400> 308
Met Ser Cys Arg Thr Leu Met Ser Arg Arg Val Gly Trp Gly Leu Leu 
1               5                   10                  15      


Leu Trp Gly Gly Leu Phe Leu Arg Thr Gly Ser Val Thr Gly Gln Thr 
            20                  25                  30          


Tyr Asn Tyr Ala Glu Val Leu Gln Lys Ser Met Phe Phe Tyr Glu Cys 
        35                  40                  45              


Gln Glu Ser Lys Ile Ala Pro Gly Asn Arg Val Thr Trp Arg Ala Asn 
    50                  55                  60                  


Ala Ala Met Asn Asp Gly Ser Asp Val Gly Lys Asp Leu Thr Gly Gly 
65                  70                  75                  80  


Trp Phe Asp Ala Gly Asp His Val Lys Phe Asn Phe Pro Met Ala Phe 
                85                  90                  95      


Thr Ala Thr Ala Leu Ala Trp Gly Ala Ile Asp Phe Ala Gln Gly Tyr 
            100                 105                 110         


Ile Ser Ser Gly Gln Met Gln Tyr Leu Lys Arg Asn Leu Arg Tyr Val 
        115                 120                 125             


Asn Asp Tyr Phe Ile Lys Cys His Thr Ala Pro Asn Glu Leu Tyr Gly 
    130                 135                 140                 


Gln Val Gly Asn Gly Gly Leu Asp His Ala Phe Trp Gly Pro Pro Glu 
145                 150                 155                 160 


Val Met Arg Met Ala Arg Pro Ala Tyr Lys Ile Asp Ala Ser Lys Pro 
                165                 170                 175     


Gly Ser Asp Leu Ala Ala Glu Thr Ala Ala Ala Met Ala Ala Ala Ser 
            180                 185                 190         


Ile Val Phe Lys Ser Asp Asp Pro Thr Tyr Ser Ala Thr Leu Leu Asn 
        195                 200                 205             


His Ala Lys Gln Leu Phe Ser Phe Ala Glu Thr Tyr Lys Gly Lys Tyr 
    210                 215                 220                 


Ser Asp Ala Ile Thr Asp Ala Ala Gly Tyr Tyr Asn Ser Trp Ser Gly 
225                 230                 235                 240 


Tyr Asn Asp Glu Leu Val Trp Gly Ala Ile Trp Leu Tyr Arg Ala Thr 
                245                 250                 255     


Gly Asp Ala Thr Tyr Leu Ser Lys Ala Glu Ser Tyr Tyr Asp Asn Leu 
            260                 265                 270         


Gly Asn Gln Gly Gln Glu Pro Val Lys Ala Tyr Lys Trp Thr Ile Ala 
        275                 280                 285             


Trp Asp Asp Lys Ser Tyr Gly Cys Tyr Ala Leu Leu Ala Lys Leu Thr 
    290                 295                 300                 


Gly Lys Glu Lys Tyr Lys Ile Asp Ala Glu Arg Phe Leu Asp Tyr Trp 
305                 310                 315                 320 


Thr Asp Gly Tyr Asn Gly Ser Arg Ile Thr Tyr Thr Pro Gly Gly Leu 
                325                 330                 335     


Ala Phe Leu Asp Ile Trp Gly Ser Leu Arg Tyr Ala Met Asn Thr Ala 
            340                 345                 350         


Phe Val Ala Ala Tyr Tyr Ala Asp Ala Ala Thr Ser Ala Ala Lys Thr 
        355                 360                 365             


Thr Lys Tyr Leu Asn Phe Ala Lys Gln Gln Leu His Tyr Ala Leu Gly 
    370                 375                 380                 


Ser Asn Pro Ser Asn Arg Ser Tyr Val Cys Gly Phe Gly Asn Asn Pro 
385                 390                 395                 400 


Pro Val Asn Pro His His Arg Gly Ala His Gly Ala Trp Ser Asn Asn 
                405                 410                 415     


Val Gln Gly Pro Pro Thr Glu Thr Arg His Ile Leu Tyr Gly Ala Leu 
            420                 425                 430         


Val Gly Gly Pro Gly Ser Asn Asp Ser Tyr Thr Asp Asp Arg Ser Asn 
        435                 440                 445             


Tyr Thr Asn Asn Glu Val Ala Cys Asp Tyr Asn Ala Leu Phe Ser Gly 
    450                 455                 460                 


Leu Leu Ala Lys Phe Val Ile Asp Tyr Gly Gly Thr Pro Leu Ala Asn 
465                 470                 475                 480 


Phe Pro Val Arg Glu Thr Pro Lys Asp Glu Tyr Phe Val Glu Ala Lys 
                485                 490                 495     


Ala Asn Ala Thr Gly Thr Asn Phe Ser Glu Trp Thr Val Trp Val Tyr 
            500                 505                 510         


Asn His Thr Ala Trp Pro Ala Arg Glu Gly Ser Glu Tyr Lys Phe Arg 
        515                 520                 525             


Leu Tyr Val Asn Ile Ser Glu Gly Leu Ala Ala Gly Tyr Thr Ala Ser 
    530                 535                 540                 


Asn Tyr Val Val Gln Thr Asn Asn Ala Gly Val Val Asn Phe Thr Gln 
545                 550                 555                 560 


Leu Leu Ala Ala Asp Ala Ala Asn Gly Ile Tyr Tyr Thr Glu Val Thr 
                565                 570                 575     


Phe Lys Pro Gly Thr Glu Ile Tyr Pro Gly Gly Gln Gln Tyr Asp Lys 
            580                 585                 590         


Lys Glu Ala Gln Met Arg Ile Ser Leu Pro Asn Ala Pro Ala Ser Ala 
        595                 600                 605             


Trp Asp Pro Thr Asn Asp Pro Ser Trp Ala Gly Ile Thr Ser Thr Leu 
    610                 615                 620                 


Lys Gln Met Pro Gly Ile Pro Met Tyr Val Asp Gly Val Lys Val Phe 
625                 630                 635                 640 


Gly Asn Glu Pro Val Pro Gly Gln Thr Val Pro Val Thr Gly Val Thr 
                645                 650                 655     


Val Ser Pro Thr Thr Leu Ser Leu Thr Val Gly Gln Thr Ser Thr Leu 
            660                 665                 670         


Thr Ala Thr Val Ser Pro Ala Asn Ala Thr Asn Lys Asn Val Thr Trp 
        675                 680                 685             


Ser Ser Ser Asn Thr Ser Val Ala Thr Val Ser Ser Thr Gly Val Val 
    690                 695                 700                 


Thr Ala Val Ala Ala Gly Ser Ala Thr Ile Thr Val Thr Thr Val Asp 
705                 710                 715                 720 


Gly Ala Lys Thr Ala Thr Cys Ala Val Thr Val Thr Gly Ser Thr Asn 
                725                 730                 735     


Val Pro Val Thr Gly Val Thr Val Ser Pro Thr Thr Leu Ser Leu Thr 
            740                 745                 750         


Val Gly Gln Thr Ala Thr Leu Thr Ala Thr Val Ser Pro Ala Asn Ala 
        755                 760                 765             


Thr Asn Lys Asn Val Thr Trp Ser Ser Ser Asn Thr Ser Val Ala Thr 
    770                 775                 780                 


Val Ser Ser Thr Gly Val Val Thr Ala Val Ala Ala Gly Ser Ala Thr 
785                 790                 795                 800 


Ile Thr Val Thr Thr Val Asp Gly Ala Lys Thr Ala Thr Cys Thr Val 
                805                 810                 815     


Thr Val Thr Gly Ser Thr Thr Val Pro Val Thr Gly Val Thr Val Ser 
            820                 825                 830         


Pro Thr Thr Leu Ser Leu Thr Val Gly Gln Thr Ala Thr Leu Thr Ala 
        835                 840                 845             


Thr Val Ser Pro Ala Asp Ala Thr Asn Lys Asn Val Thr Trp Ser Ser 
    850                 855                 860                 


Ser Asn Thr Ser Val Ala Thr Val Ser Ser Thr Gly Val Val Thr Ala 
865                 870                 875                 880 


Val Ala Ala Gly Ser Ala Thr Ile Thr Val Thr Thr Val Asp Gly Ala 
                885                 890                 895     


Lys Thr Ala Thr Cys Ala Val Thr Val Thr Ala Gly Gly Ser Thr Thr 
            900                 905                 910         


Pro Cys Ser Asn Pro Val Ser Lys Thr Leu Pro Leu Val Gln Asp Gly 
        915                 920                 925             


Ala Gly Glu Phe Arg Leu Ser Asn Ser Phe Asn 
    930                 935                 


<210> 309
<211> 1725
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 309
atgctgaaat taagtgataa cctaactttc ttgaaaagca aaccattttt tcttaatgaa     60

aaagaaatga agtgggtgga gaaaacactt caatccatgt ccttacatga aaaagtaggg    120

caattatttt gtcccattgg cggttcagat aataaacaag aattagaagc ctttattaag    180

gaatatcatc ctggcggcat catgtaccgt cctaatacag gagcaaaaat acaggaaaca    240

catcggttgt tacaagagct atccccggta cctttattaa tttctgctaa cttagaggcc    300

ggtggtaatg ggattgctac ggatggtact tacttcggaa agcaaatgca ggtggctgca    360

acagataatg aagaaatggc ctataaatta ggattagttg ctggccgtga aggccgtgtg    420

gccggttgta actgggcttt tgcaccaatt gttgatattg atatgaacta tcgaaaccca    480

attacaaacg taagaacgta tgggtctgac ccaattagag ttgcccaaat gtctaaagct    540

tttatgaagg gaattcatga aagcggactc gcagcagctg ttaagcattt cccaggggat    600

ggagtggatg atagagatca gcatctttta tcatctgtaa acaccttatc taccgaagaa    660

tgggatcaaa cctttgggat ggtttatcaa gaaatgatag acagtggggc aaaatcgatt    720

atggcgggcc atatcatgct ccctgaatat tcaagagaac tattgccggg tattgaagac    780

gaacaaatca tgcccgccac actagcacca gagttactta atggtttatt aagggaaaag    840

ttaggtttta atggtttaat cgtgactgat gcatccccta tgttagggtt cactacttcg    900

gaaagaagag aaattgctgt tcctaaggcg attgcttcgg gctgtgatat gtttctcttc    960

aaccgtaaca taaaagaaga ttatgagttc atgctgaatg gaattgaaac tggaattcta   1020

accttggaaa gagtagatga agctgttact agagtacttg ctcttaaagc atctctaggt   1080

ctgaatgtac aaaaggaatt gggaatatta gtacctgaag aagcggaatt gtcggtatta   1140

caatctgaag aacatttgga ttgggcaaga gaatgtgcag accaatcggt tacattagta   1200

aaggatacac aaaaactgct gcctattagt gctgatcagt ataaacgggt tcgactttat   1260

gtattgggtg atcaagaagg agggctaaag gaaggcggct ccgtcactca accgtttatc   1320

gattctctta aaaatgctgg ctttgaagta gatttatata atgacaagca agttaatttc   1380

caagaactgt ttatgagtgt aaacgagttt aaaaagaact atgatctgat catttatgtc   1440

gccaaccttg aaaccgctag taaccaaacg acagtcagaa ttaattggca gcagccgcta   1500

aatgccaacg ctccatggtt tgttaaagat ataccgacat tatttatttc ggttgctaac   1560

ccataccatc tacaggacgt accaatggtt aagacctata taaatgctta ttcatctaat   1620

gaatatgtgg tagaagcaat tgtagataaa atcttaggaa aatcagagtt taaagggaag   1680

aatcccgtcg atccgttttg tgggaaatgg gataccagac tttaa                   1725

<210> 310
<211> 574
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (87)...(320)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> SITE
<222> (7)...(10)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (495)...(498)
<223> N-glycosylation site. Prosite id = PS00001

<400> 310
Met Leu Lys Leu Ser Asp Asn Leu Thr Phe Leu Lys Ser Lys Pro Phe 
1               5                   10                  15      


Phe Leu Asn Glu Lys Glu Met Lys Trp Val Glu Lys Thr Leu Gln Ser 
            20                  25                  30          


Met Ser Leu His Glu Lys Val Gly Gln Leu Phe Cys Pro Ile Gly Gly 
        35                  40                  45              


Ser Asp Asn Lys Gln Glu Leu Glu Ala Phe Ile Lys Glu Tyr His Pro 
    50                  55                  60                  


Gly Gly Ile Met Tyr Arg Pro Asn Thr Gly Ala Lys Ile Gln Glu Thr 
65                  70                  75                  80  


His Arg Leu Leu Gln Glu Leu Ser Pro Val Pro Leu Leu Ile Ser Ala 
                85                  90                  95      


Asn Leu Glu Ala Gly Gly Asn Gly Ile Ala Thr Asp Gly Thr Tyr Phe 
            100                 105                 110         


Gly Lys Gln Met Gln Val Ala Ala Thr Asp Asn Glu Glu Met Ala Tyr 
        115                 120                 125             


Lys Leu Gly Leu Val Ala Gly Arg Glu Gly Arg Val Ala Gly Cys Asn 
    130                 135                 140                 


Trp Ala Phe Ala Pro Ile Val Asp Ile Asp Met Asn Tyr Arg Asn Pro 
145                 150                 155                 160 


Ile Thr Asn Val Arg Thr Tyr Gly Ser Asp Pro Ile Arg Val Ala Gln 
                165                 170                 175     


Met Ser Lys Ala Phe Met Lys Gly Ile His Glu Ser Gly Leu Ala Ala 
            180                 185                 190         


Ala Val Lys His Phe Pro Gly Asp Gly Val Asp Asp Arg Asp Gln His 
        195                 200                 205             


Leu Leu Ser Ser Val Asn Thr Leu Ser Thr Glu Glu Trp Asp Gln Thr 
    210                 215                 220                 


Phe Gly Met Val Tyr Gln Glu Met Ile Asp Ser Gly Ala Lys Ser Ile 
225                 230                 235                 240 


Met Ala Gly His Ile Met Leu Pro Glu Tyr Ser Arg Glu Leu Leu Pro 
                245                 250                 255     


Gly Ile Glu Asp Glu Gln Ile Met Pro Ala Thr Leu Ala Pro Glu Leu 
            260                 265                 270         


Leu Asn Gly Leu Leu Arg Glu Lys Leu Gly Phe Asn Gly Leu Ile Val 
        275                 280                 285             


Thr Asp Ala Ser Pro Met Leu Gly Phe Thr Thr Ser Glu Arg Arg Glu 
    290                 295                 300                 


Ile Ala Val Pro Lys Ala Ile Ala Ser Gly Cys Asp Met Phe Leu Phe 
305                 310                 315                 320 


Asn Arg Asn Ile Lys Glu Asp Tyr Glu Phe Met Leu Asn Gly Ile Glu 
                325                 330                 335     


Thr Gly Ile Leu Thr Leu Glu Arg Val Asp Glu Ala Val Thr Arg Val 
            340                 345                 350         


Leu Ala Leu Lys Ala Ser Leu Gly Leu Asn Val Gln Lys Glu Leu Gly 
        355                 360                 365             


Ile Leu Val Pro Glu Glu Ala Glu Leu Ser Val Leu Gln Ser Glu Glu 
    370                 375                 380                 


His Leu Asp Trp Ala Arg Glu Cys Ala Asp Gln Ser Val Thr Leu Val 
385                 390                 395                 400 


Lys Asp Thr Gln Lys Leu Leu Pro Ile Ser Ala Asp Gln Tyr Lys Arg 
                405                 410                 415     


Val Arg Leu Tyr Val Leu Gly Asp Gln Glu Gly Gly Leu Lys Glu Gly 
            420                 425                 430         


Gly Ser Val Thr Gln Pro Phe Ile Asp Ser Leu Lys Asn Ala Gly Phe 
        435                 440                 445             


Glu Val Asp Leu Tyr Asn Asp Lys Gln Val Asn Phe Gln Glu Leu Phe 
    450                 455                 460                 


Met Ser Val Asn Glu Phe Lys Lys Asn Tyr Asp Leu Ile Ile Tyr Val 
465                 470                 475                 480 


Ala Asn Leu Glu Thr Ala Ser Asn Gln Thr Thr Val Arg Ile Asn Trp 
                485                 490                 495     


Gln Gln Pro Leu Asn Ala Asn Ala Pro Trp Phe Val Lys Asp Ile Pro 
            500                 505                 510         


Thr Leu Phe Ile Ser Val Ala Asn Pro Tyr His Leu Gln Asp Val Pro 
        515                 520                 525             


Met Val Lys Thr Tyr Ile Asn Ala Tyr Ser Ser Asn Glu Tyr Val Val 
    530                 535                 540                 


Glu Ala Ile Val Asp Lys Ile Leu Gly Lys Ser Glu Phe Lys Gly Lys 
545                 550                 555                 560 


Asn Pro Val Asp Pro Phe Cys Gly Lys Trp Asp Thr Arg Leu 
                565                 570                 


<210> 311
<211> 2298
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 311
atgatcaatc aagatataaa acaattaatc tcacaaatga ccttggaaga aaaagctggt     60

ctttgttctg gattagattt ttggaattta aaaggtatcg aaagactggg aataccctcg    120

ataatggtaa ccgatggtcc gcatggactc cgtaaacaaa aaatgggagc agatcattta    180

gggctgtttg acagtattcc tgcgacatgt ttcccatctg cagccggttt agctagtact    240

tggaataaag agttaatata tgaagttggg gttgcattag gaaaggaatg ccaggcagag    300

gatgtggcaa tacttcttgg ccctggagca aacattaagc gctcacccct ttgtggcaga    360

aactttgaat atttttcgga agatccattc ctttcatcag aaatggctgc gtcccatatc    420

aagggtgttc aaagtgaggg ggttgggaca tcacttaagc acttcgctgc aaataatcaa    480

gaacaccgaa gaatgtcgac agatgctatt gtggatgaaa ggacgttgcg agaaatatat    540

ttggccagct ttgaaaacgc tgtaaagaaa gcgcagccat ggactgtgat gtgcgcctac    600

aacaaggtca atggagactt tgcatcagaa aataaaacat tgttaactga catcctgcga    660

gatgagtggg gctttgaagg aattgttgtt tctgactggg gggcggttaa tgaacctgtt    720

gacggattaa atgccgggtt agacctggaa atgccttcaa gtagtgggat tggtgaaaag    780

aaaatcatca atgctgtaag aaatggtcag cttttagaag ataaactaga tcaggcagtt    840

gaaagaattc tacgtattat cttaatggca gtagaaaaca agaaagaaac cgctgactat    900

gataaagaac aacatcataa gcttgcaaga aaagcagcaa gtgaaagtat ggttttatta    960

aagaatgaag ataatatcct gccgttaaag aaagaaggaa ccatttcgat tattggttca   1020

tttgccaaaa aaccaaggta tcaaggcggt ggaagctcac acattaaccc gacaaagctt   1080

gaaaatatct atgaagaaat agagaaaaca gcgggccaaa atgtgaacgt tttatacgcg   1140

gaaggatatc atcttgaaaa ggatttaatc gatgatcaat taattgaaga ggcaaaaaaa   1200

acggcagcaa aatccgatgt aaccgtattg tttgtaggtc ttcctgaccg atatgaatct   1260

gaaggatatg atagagagca cctgaatata ccggagaatc accgtctttt agtcgaagcg   1320

gttgcggaag tacaaaagaa tatagttgtt gtactaagta atggggcacc gcttgttatg   1380

ccatggcttg ataaggtgaa ggggctgctg gaaagttacc tgggaggtca ggcactagga   1440

ggtgcgattg cagacatcct attcggagaa gttaatccaa gtggaaagct tgccgaaact   1500

tttcccgtaa aattaggtga caatccttct tatctcaact ttccaggaga gagggataaa   1560

gttgagtata aagaaggcat ctttgttggt tatcgttatt acgatacaaa acagattgag   1620

ccgctgtttc catttggata tggtttaagc tatacaaact ttgaatataa aaaccttgta   1680

attgataaaa aagaaataaa agatacagaa attgtcacag ttaccgtgaa tgtgaaaaat   1740

acaggaaaag tgcctgggaa agaaatcatc cagttatatg taaaagatat aaaaagcagt   1800

gtagttcgtc ctgaaaaaga gttaaaaggc tttggaaagg tttccttaca gcctggggaa   1860

gacaaaacta tttcctttaa attggataaa cgcgcatttg catattacaa cacggaattg   1920

aaggattggt atgtagaatc aggagaattt gaaattttgg tggggaaatc gtccagagaa   1980

attgaactaa cagaaaaaat tatggttcac tctacttccc cagttttctt ggaggttcac   2040

cgaaattcca cggtcggaga tcttttaact gatccaattc taggtgaaaa agctaatgct   2100

ctaattagag agctaacaaa aggaagtcca ttatttgatg ctgggtcaga tcacggagag   2160

ggtgcagaaa tgatggaagc gatgttaaaa tacatgcctt tgcgtgctct tatgaatttt   2220

agtggtggag acattaccga agagaaacta actgaattta ttaaggaact taattcaact   2280

aattttgtaa gcctttaa                                                 2298

<210> 312
<211> 765
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (30)...(252)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (317)...(531)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (214)...(217)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (221)...(238)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (692)...(695)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (750)...(753)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (769)...(772)
<223> N-glycosylation site. Prosite id = PS00001

<400> 312
Met Ile Asn Gln Asp Ile Lys Gln Leu Ile Ser Gln Met Thr Leu Glu 
1               5                   10                  15      


Glu Lys Ala Gly Leu Cys Ser Gly Leu Asp Phe Trp Asn Leu Lys Gly 
            20                  25                  30          


Ile Glu Arg Leu Gly Ile Pro Ser Ile Met Val Thr Asp Gly Pro His 
        35                  40                  45              


Gly Leu Arg Lys Gln Lys Met Gly Ala Asp His Leu Gly Leu Phe Asp 
    50                  55                  60                  


Ser Ile Pro Ala Thr Cys Phe Pro Ser Ala Ala Gly Leu Ala Ser Thr 
65                  70                  75                  80  


Trp Asn Lys Glu Leu Ile Tyr Glu Val Gly Val Ala Leu Gly Lys Glu 
                85                  90                  95      


Cys Gln Ala Glu Asp Val Ala Ile Leu Leu Gly Pro Gly Ala Asn Ile 
            100                 105                 110         


Lys Arg Ser Pro Leu Cys Gly Arg Asn Phe Glu Tyr Phe Ser Glu Asp 
        115                 120                 125             


Pro Phe Leu Ser Ser Glu Met Ala Ala Ser His Ile Lys Gly Val Gln 
    130                 135                 140                 


Ser Glu Gly Val Gly Thr Ser Leu Lys His Phe Ala Ala Asn Asn Gln 
145                 150                 155                 160 


Glu His Arg Arg Met Ser Thr Asp Ala Ile Val Asp Glu Arg Thr Leu 
                165                 170                 175     


Arg Glu Ile Tyr Leu Ala Ser Phe Glu Asn Ala Val Lys Lys Ala Gln 
            180                 185                 190         


Pro Trp Thr Val Met Cys Ala Tyr Asn Lys Val Asn Gly Asp Phe Ala 
        195                 200                 205             


Ser Glu Asn Lys Thr Leu Leu Thr Asp Ile Leu Arg Asp Glu Trp Gly 
    210                 215                 220                 


Phe Glu Gly Ile Val Val Ser Asp Trp Gly Ala Val Asn Glu Pro Val 
225                 230                 235                 240 


Asp Gly Leu Asn Ala Gly Leu Asp Leu Glu Met Pro Ser Ser Ser Gly 
                245                 250                 255     


Ile Gly Glu Lys Lys Ile Ile Asn Ala Val Arg Asn Gly Gln Leu Leu 
            260                 265                 270         


Glu Asp Lys Leu Asp Gln Ala Val Glu Arg Ile Leu Arg Ile Ile Leu 
        275                 280                 285             


Met Ala Val Glu Asn Lys Lys Glu Thr Ala Asp Tyr Asp Lys Glu Gln 
    290                 295                 300                 


His His Lys Leu Ala Arg Lys Ala Ala Ser Glu Ser Met Val Leu Leu 
305                 310                 315                 320 


Lys Asn Glu Asp Asn Ile Leu Pro Leu Lys Lys Glu Gly Thr Ile Ser 
                325                 330                 335     


Ile Ile Gly Ser Phe Ala Lys Lys Pro Arg Tyr Gln Gly Gly Gly Ser 
            340                 345                 350         


Ser His Ile Asn Pro Thr Lys Leu Glu Asn Ile Tyr Glu Glu Ile Glu 
        355                 360                 365             


Lys Thr Ala Gly Gln Asn Val Asn Val Leu Tyr Ala Glu Gly Tyr His 
    370                 375                 380                 


Leu Glu Lys Asp Leu Ile Asp Asp Gln Leu Ile Glu Glu Ala Lys Lys 
385                 390                 395                 400 


Thr Ala Ala Lys Ser Asp Val Thr Val Leu Phe Val Gly Leu Pro Asp 
                405                 410                 415     


Arg Tyr Glu Ser Glu Gly Tyr Asp Arg Glu His Leu Asn Ile Pro Glu 
            420                 425                 430         


Asn His Arg Leu Leu Val Glu Ala Val Ala Glu Val Gln Lys Asn Ile 
        435                 440                 445             


Val Val Val Leu Ser Asn Gly Ala Pro Leu Val Met Pro Trp Leu Asp 
    450                 455                 460                 


Lys Val Lys Gly Leu Leu Glu Ser Tyr Leu Gly Gly Gln Ala Leu Gly 
465                 470                 475                 480 


Gly Ala Ile Ala Asp Ile Leu Phe Gly Glu Val Asn Pro Ser Gly Lys 
                485                 490                 495     


Leu Ala Glu Thr Phe Pro Val Lys Leu Gly Asp Asn Pro Ser Tyr Leu 
            500                 505                 510         


Asn Phe Pro Gly Glu Arg Asp Lys Val Glu Tyr Lys Glu Gly Ile Phe 
        515                 520                 525             


Val Gly Tyr Arg Tyr Tyr Asp Thr Lys Gln Ile Glu Pro Leu Phe Pro 
    530                 535                 540                 


Phe Gly Tyr Gly Leu Ser Tyr Thr Asn Phe Glu Tyr Lys Asn Leu Val 
545                 550                 555                 560 


Ile Asp Lys Lys Glu Ile Lys Asp Thr Glu Ile Val Thr Val Thr Val 
                565                 570                 575     


Asn Val Lys Asn Thr Gly Lys Val Pro Gly Lys Glu Ile Ile Gln Leu 
            580                 585                 590         


Tyr Val Lys Asp Ile Lys Ser Ser Val Val Arg Pro Glu Lys Glu Leu 
        595                 600                 605             


Lys Gly Phe Gly Lys Val Ser Leu Gln Pro Gly Glu Asp Lys Thr Ile 
    610                 615                 620                 


Ser Phe Lys Leu Asp Lys Arg Ala Phe Ala Tyr Tyr Asn Thr Glu Leu 
625                 630                 635                 640 


Lys Asp Trp Tyr Val Glu Ser Gly Glu Phe Glu Ile Leu Val Gly Lys 
                645                 650                 655     


Ser Ser Arg Glu Ile Glu Leu Thr Glu Lys Ile Met Val His Ser Thr 
            660                 665                 670         


Ser Pro Val Phe Leu Glu Val His Arg Asn Ser Thr Val Gly Asp Leu 
        675                 680                 685             


Leu Thr Asp Pro Ile Leu Gly Glu Lys Ala Asn Ala Leu Ile Arg Glu 
    690                 695                 700                 


Leu Thr Lys Gly Ser Pro Leu Phe Asp Ala Gly Ser Asp His Gly Glu 
705                 710                 715                 720 


Gly Ala Glu Met Met Glu Ala Met Leu Lys Tyr Met Pro Leu Arg Ala 
                725                 730                 735     


Leu Met Asn Phe Ser Gly Gly Asp Ile Thr Glu Glu Lys Leu Thr Glu 
            740                 745                 750         


Phe Ile Lys Glu Leu Asn Ser Thr Asn Phe Val Ser Leu 
        755                 760                 765 


<210> 313
<211> 2166
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 313
atgaagacca aggctgtagt gctctctcta ctcctgctgc tctccatgtt cggtcccatg     60

ggggcagaga gagcctatgg aactactgac tctacccagg ggcctggagt ttactacaag    120

gttgttgggg ataccatata catggtaaac ctcagcagtg gaactgagaa gccaatacac    180

ctttacggcg tcaactggtt cggttttgaa acgcccaacc atgtcgttca tggattatgg    240

tcgaggaact gggtggacat gctccagcag ataaagtccc tcggctttaa cgccatcagg    300

cttcccttct gccccgcgtc ccttgacccg tcaatctatc ccacgggcat cgactactct    360

aagaaccccg acctcaaggg actttccagc ccggagataa tggaaaggat cattaagaag    420

gccggcgatc tgggaatctt cgtcctcctc gactttcaca ggattggatg caactacata    480

gaacccctct ggtatactga ctccttcagc gagcaggact acatagacac gtgggtgaga    540

gttgccaaga agtttggaaa gtactggaac gtcatagggg cggacatcaa aaacgagccc    600

cacagtgaga gccaggttcc gtccgcatat accgacggta agggtgcaac ctggggtatg    660

ggtaacgagg caacggactg gaacctcgcc gctgagagga ttggaagggc aatcctacag    720

gttgcacccc actggttgat attcgtcgag ggaacccagt ttacgaaccc tgaaactgac    780

ggggcctaca agtggggcta caacgcgtgg tggggcggaa accttatggc ggtcagggat    840

tacccgataa accttccacg ggagaaactc gtctacagcc cacacgtcta tgggcctgac    900

gtttacaacc agccctattt caacgaagac gacttcccaa acaacatgcc tgacatctgg    960

taccaccact tcggctacgt gaagaccgac ctcggctatc cggttgtcat aggggaattc   1020

gggggaaaat acggccacgg tggaagcgac aaggatccgg tatggcagaa ggccctagtt   1080

gactggatga taaagaacaa tttctgcgac ttcttctact ggagctggaa tccgaacagc   1140

ggagacaccg gaggaattct ccaagatgac tggactcaca tatgggatga taagtacagg   1200

aacctgaaga ggttgatgga tcactgttct ggaagcgact cttccagctc cccgtcggga   1260

gataccggct ctggaaactc gacgccttcc tctaacgata cttccagtga accaaacgcc   1320

tccaacttca atgtgatcaa actcctgcca acatcatccc agtatgaggg ggcttcatcc   1380

acggtgacct gtgacggaac gaagtgttcc tccagcgtct ggggaactcc gaacctctgg   1440

ggcgttgtcc agataggaaa cgcgagcatc gacccgaacg tctggggctg ggaggatctc   1500

tatcagaccg atccagagaa aataggaacc ggaacctcaa agatgtggat tgaaaaggga   1560

attcttcacg tggacaaccg ctggacgata aagaccagtc ccttgtacaa cgttatggcc   1620

tacaacgagg ttatctacgg aagcaaaccc tgggggaacc agccggtgaa cgctcccggc   1680

tttgaacttc cgatggactt taattctctc ccgaggatac tcgttggcgt gaactacacc   1740

ctgacgaagg gaattccagg caacaacttc gccttcgagg catggctctt caaggacacc   1800

aacagcaacc gcgcccccgg tgaaggcgac tacgagataa tggttcagct ctacatcaaa   1860

aacggctttc cggccggtta cgaacacgga cccgtcgcta cctttaacgt tccgatggta   1920

gtcaacggca ccctgataaa ccagaccttt gagctttacg acgttgtcgc cgatcccggc   1980

tggaggttcc tgacgtttaa atcaaccaaa aactacgtca acgcgagcgt tgtttttgac   2040

tacacccact tcatcgagat ggccaacgac tatctaaacg actcgctgaa cagtacttac   2100

ctcatgtcct tggagtttgg aaccgagatc caaagaattc aaaaagcttc tcgagagtac   2160

ttctag                                                              2166

<210> 314
<211> 721
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(25)

<220> 
<221> DOMAIN
<222> (49)...(384)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (559)...(714)
<223> Glycosyl hydrolase family 12

<220> 
<221> SITE
<222> (50)...(53)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (438)...(441)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (445)...(448)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (494)...(497)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (586)...(589)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (651)...(654)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (656)...(659)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (684)...(687)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (703)...(706)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (707)...(710)
<223> N-glycosylation site. Prosite id = PS00001

<400> 314
Met Lys Thr Lys Ala Val Val Leu Ser Leu Leu Leu Leu Leu Ser Met 
1               5                   10                  15      


Phe Gly Pro Met Gly Ala Glu Arg Ala Tyr Gly Thr Thr Asp Ser Thr 
            20                  25                  30          


Gln Gly Pro Gly Val Tyr Tyr Lys Val Val Gly Asp Thr Ile Tyr Met 
        35                  40                  45              


Val Asn Leu Ser Ser Gly Thr Glu Lys Pro Ile His Leu Tyr Gly Val 
    50                  55                  60                  


Asn Trp Phe Gly Phe Glu Thr Pro Asn His Val Val His Gly Leu Trp 
65                  70                  75                  80  


Ser Arg Asn Trp Val Asp Met Leu Gln Gln Ile Lys Ser Leu Gly Phe 
                85                  90                  95      


Asn Ala Ile Arg Leu Pro Phe Cys Pro Ala Ser Leu Asp Pro Ser Ile 
            100                 105                 110         


Tyr Pro Thr Gly Ile Asp Tyr Ser Lys Asn Pro Asp Leu Lys Gly Leu 
        115                 120                 125             


Ser Ser Pro Glu Ile Met Glu Arg Ile Ile Lys Lys Ala Gly Asp Leu 
    130                 135                 140                 


Gly Ile Phe Val Leu Leu Asp Phe His Arg Ile Gly Cys Asn Tyr Ile 
145                 150                 155                 160 


Glu Pro Leu Trp Tyr Thr Asp Ser Phe Ser Glu Gln Asp Tyr Ile Asp 
                165                 170                 175     


Thr Trp Val Arg Val Ala Lys Lys Phe Gly Lys Tyr Trp Asn Val Ile 
            180                 185                 190         


Gly Ala Asp Ile Lys Asn Glu Pro His Ser Glu Ser Gln Val Pro Ser 
        195                 200                 205             


Ala Tyr Thr Asp Gly Lys Gly Ala Thr Trp Gly Met Gly Asn Glu Ala 
    210                 215                 220                 


Thr Asp Trp Asn Leu Ala Ala Glu Arg Ile Gly Arg Ala Ile Leu Gln 
225                 230                 235                 240 


Val Ala Pro His Trp Leu Ile Phe Val Glu Gly Thr Gln Phe Thr Asn 
                245                 250                 255     


Pro Glu Thr Asp Gly Ala Tyr Lys Trp Gly Tyr Asn Ala Trp Trp Gly 
            260                 265                 270         


Gly Asn Leu Met Ala Val Arg Asp Tyr Pro Ile Asn Leu Pro Arg Glu 
        275                 280                 285             


Lys Leu Val Tyr Ser Pro His Val Tyr Gly Pro Asp Val Tyr Asn Gln 
    290                 295                 300                 


Pro Tyr Phe Asn Glu Asp Asp Phe Pro Asn Asn Met Pro Asp Ile Trp 
305                 310                 315                 320 


Tyr His His Phe Gly Tyr Val Lys Thr Asp Leu Gly Tyr Pro Val Val 
                325                 330                 335     


Ile Gly Glu Phe Gly Gly Lys Tyr Gly His Gly Gly Ser Asp Lys Asp 
            340                 345                 350         


Pro Val Trp Gln Lys Ala Leu Val Asp Trp Met Ile Lys Asn Asn Phe 
        355                 360                 365             


Cys Asp Phe Phe Tyr Trp Ser Trp Asn Pro Asn Ser Gly Asp Thr Gly 
    370                 375                 380                 


Gly Ile Leu Gln Asp Asp Trp Thr His Ile Trp Asp Asp Lys Tyr Arg 
385                 390                 395                 400 


Asn Leu Lys Arg Leu Met Asp His Cys Ser Gly Ser Asp Ser Ser Ser 
                405                 410                 415     


Ser Pro Ser Gly Asp Thr Gly Ser Gly Asn Ser Thr Pro Ser Ser Asn 
            420                 425                 430         


Asp Thr Ser Ser Glu Pro Asn Ala Ser Asn Phe Asn Val Ile Lys Leu 
        435                 440                 445             


Leu Pro Thr Ser Ser Gln Tyr Glu Gly Ala Ser Ser Thr Val Thr Cys 
    450                 455                 460                 


Asp Gly Thr Lys Cys Ser Ser Ser Val Trp Gly Thr Pro Asn Leu Trp 
465                 470                 475                 480 


Gly Val Val Gln Ile Gly Asn Ala Ser Ile Asp Pro Asn Val Trp Gly 
                485                 490                 495     


Trp Glu Asp Leu Tyr Gln Thr Asp Pro Glu Lys Ile Gly Thr Gly Thr 
            500                 505                 510         


Ser Lys Met Trp Ile Glu Lys Gly Ile Leu His Val Asp Asn Arg Trp 
        515                 520                 525             


Thr Ile Lys Thr Ser Pro Leu Tyr Asn Val Met Ala Tyr Asn Glu Val 
    530                 535                 540                 


Ile Tyr Gly Ser Lys Pro Trp Gly Asn Gln Pro Val Asn Ala Pro Gly 
545                 550                 555                 560 


Phe Glu Leu Pro Met Asp Phe Asn Ser Leu Pro Arg Ile Leu Val Gly 
                565                 570                 575     


Val Asn Tyr Thr Leu Thr Lys Gly Ile Pro Gly Asn Asn Phe Ala Phe 
            580                 585                 590         


Glu Ala Trp Leu Phe Lys Asp Thr Asn Ser Asn Arg Ala Pro Gly Glu 
        595                 600                 605             


Gly Asp Tyr Glu Ile Met Val Gln Leu Tyr Ile Lys Asn Gly Phe Pro 
    610                 615                 620                 


Ala Gly Tyr Glu His Gly Pro Val Ala Thr Phe Asn Val Pro Met Val 
625                 630                 635                 640 


Val Asn Gly Thr Leu Ile Asn Gln Thr Phe Glu Leu Tyr Asp Val Val 
                645                 650                 655     


Ala Asp Pro Gly Trp Arg Phe Leu Thr Phe Lys Ser Thr Lys Asn Tyr 
            660                 665                 670         


Val Asn Ala Ser Val Val Phe Asp Tyr Thr His Phe Ile Glu Met Ala 
        675                 680                 685             


Asn Asp Tyr Leu Asn Asp Ser Leu Asn Ser Thr Tyr Leu Met Ser Leu 
    690                 695                 700                 


Glu Phe Gly Thr Glu Ile Gln Arg Ile Gln Lys Ala Ser Arg Glu Tyr 
705                 710                 715                 720 


Phe 
    


<210> 315
<211> 1032
<212> DNA
<213> Clostridium thermocellum

<400> 315
atggtgagtt ttaaagcagg tataaattta ggcggatgga tatcacaata tcaagttttc     60

agcaaagagc atttcgatac attcattacg gagaaggaca ttgaaactat tgcagaagca    120

gggtttgacc atgtcagact gccttttgat tatccaatta tcgagtctga tgacaatgtg    180

ggagaatata aagaagatgg gctttcttat attgaccggt gccttgagtg gtgtaaaaaa    240

tacaatttgg ggcttgtgtt ggatatgcat cacgctcccg ggtaccgctt tcaagatttt    300

aagacaagca ccttgtttga agatccgaac cagcaaaaga gatttgttga catatggaga    360

tttttagcca agcgttacat aaatgaacgg gaacatattg cctttgaact gttaaatgaa    420

gttgttgagc ctgacagtac ccgctggaac aagttgatgc ttgagtgtgt aaaagcaatc    480

agggaaattg attccaccag gtggctttac attgggggca ataactataa cagtcctgat    540

gagcttaaaa accttgcaga tattgatgat gattacatag tttacaattt ccatttttac    600

aatccttttt tctttacgca tcagaaagcc cactggtcgg aaagtgccat ggcgtacaac    660

aggactgtaa aatatccggg acaatatgag ggaattgaag agtttgtgaa aaataatcct    720

aagtacagtt ttatgatgga attgaataac ctgaagctga ataaagagct tttgcgcaaa    780

gatttaaaac cagcaattga gttcagggaa aagaaaaaat gcaaactata ttgcggggag    840

tttggcgtaa ttgccattgc tgacctggag tccaggataa aatggcatga agattatata    900

agtcttctag aggagtatga tatcggcggc gcggtgtgga actacaaaaa aatggatttt    960

gaaatttata atgaggatag aaaacctgtc tcgcaagaat tggtaaatat actggcgaga   1020

agaaaaactt ga                                                       1032

<210> 316
<211> 343
<212> PRT
<213> Clostridium thermocellum

<220> 
<221> DOMAIN
<222> (1)...(323)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (15)...(32)
<223> Cytosolic fatty-acid binding proteins signature. Prosite id = PS00214

<220> 
<221> SITE
<222> (135)...(144)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (223)...(226)
<223> N-glycosylation site. Prosite id = PS00001

<400> 316
Met Val Ser Phe Lys Ala Gly Ile Asn Leu Gly Gly Trp Ile Ser Gln 
1               5                   10                  15      


Tyr Gln Val Phe Ser Lys Glu His Phe Asp Thr Phe Ile Thr Glu Lys 
            20                  25                  30          


Asp Ile Glu Thr Ile Ala Glu Ala Gly Phe Asp His Val Arg Leu Pro 
        35                  40                  45              


Phe Asp Tyr Pro Ile Ile Glu Ser Asp Asp Asn Val Gly Glu Tyr Lys 
    50                  55                  60                  


Glu Asp Gly Leu Ser Tyr Ile Asp Arg Cys Leu Glu Trp Cys Lys Lys 
65                  70                  75                  80  


Tyr Asn Leu Gly Leu Val Leu Asp Met His His Ala Pro Gly Tyr Arg 
                85                  90                  95      


Phe Gln Asp Phe Lys Thr Ser Thr Leu Phe Glu Asp Pro Asn Gln Gln 
            100                 105                 110         


Lys Arg Phe Val Asp Ile Trp Arg Phe Leu Ala Lys Arg Tyr Ile Asn 
        115                 120                 125             


Glu Arg Glu His Ile Ala Phe Glu Leu Leu Asn Glu Val Val Glu Pro 
    130                 135                 140                 


Asp Ser Thr Arg Trp Asn Lys Leu Met Leu Glu Cys Val Lys Ala Ile 
145                 150                 155                 160 


Arg Glu Ile Asp Ser Thr Arg Trp Leu Tyr Ile Gly Gly Asn Asn Tyr 
                165                 170                 175     


Asn Ser Pro Asp Glu Leu Lys Asn Leu Ala Asp Ile Asp Asp Asp Tyr 
            180                 185                 190         


Ile Val Tyr Asn Phe His Phe Tyr Asn Pro Phe Phe Phe Thr His Gln 
        195                 200                 205             


Lys Ala His Trp Ser Glu Ser Ala Met Ala Tyr Asn Arg Thr Val Lys 
    210                 215                 220                 


Tyr Pro Gly Gln Tyr Glu Gly Ile Glu Glu Phe Val Lys Asn Asn Pro 
225                 230                 235                 240 


Lys Tyr Ser Phe Met Met Glu Leu Asn Asn Leu Lys Leu Asn Lys Glu 
                245                 250                 255     


Leu Leu Arg Lys Asp Leu Lys Pro Ala Ile Glu Phe Arg Glu Lys Lys 
            260                 265                 270         


Lys Cys Lys Leu Tyr Cys Gly Glu Phe Gly Val Ile Ala Ile Ala Asp 
        275                 280                 285             


Leu Glu Ser Arg Ile Lys Trp His Glu Asp Tyr Ile Ser Leu Leu Glu 
    290                 295                 300                 


Glu Tyr Asp Ile Gly Gly Ala Val Trp Asn Tyr Lys Lys Met Asp Phe 
305                 310                 315                 320 


Glu Ile Tyr Asn Glu Asp Arg Lys Pro Val Ser Gln Glu Leu Val Asn 
                325                 330                 335     


Ile Leu Ala Arg Arg Lys Thr 
            340             


<210> 317
<211> 1374
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 317
atgaggggat ctctcagagc tatgctagat tacagcgata aggcagaact aggacagcct     60

ttaagtaata ctagtataca gctagatagc ttgcctgtat actatgaggt tagaggagac    120

actatttaca tgatcaacat gacaaacggt gaggagaaac cgatccactt gttcggagta    180

aactggtttg ggtttgagac tccagatcat gttgtccatg ggctctgggc tagaaactgg    240

gaggacatgt taatccaaat aaagagcctc ggatttaacg ctataaggct gccattctgc    300

acagagtcag tgcagcccgg aactatgccc gcaacaatag actatagtaa gaatccagac    360

ctccaggggc ttactagctt ggaaataatg gagaagatcg ttcagaaagc cggggaacta    420

ggcatattca tcctattaga ctatcataga ataggatgcc aatacatcga gccattatgg    480

tacactgata cgttcaccga gcaagattat ataaacactt ggataagcgt tgcagagagg    540

ttcggaaagt attggaatgt tataggagcg gatctaaaaa acgagcctca tagcattagt    600

cagcctccag gagcctatac cgatggtaca ggcgccacct gggggatggg caataacgct    660

acagactgga atctggcggc tgagaggatc ggaagggcta tactagaagt agctccccac    720

tggctgatat tcgttgaggg aacacagtac acgaggcccg acatcgacgg ctcctaccag    780

tggggctata atgcatggtg gggcggcaat ttgatggctg tcagagacta cccggttaac    840

ctgccaagga ataagctggt atacagcccc cacgtctatg ggcctgacgt ctacgaccag    900

ccgtacttca gcgatccaaa cttccccaac aacatgccag acatctggta ccatcacttc    960

ggctacgtta agatagacct gggatacccg gtagtcatag gagagtttgg aggaagatac   1020

ggtcatggcg gcgatcctag agacgtcgcc tggcagaaca agatagttga ctggatgata   1080

gagaacaact tctgcagctt cttctactgg agctggaacc ctaacagtgg cgatacagga   1140

ggcatactac aagacgactg gacaaacata tggcaggata aatatgacaa tctgaagagg   1200

cttatggacc actgctcggc gcagcaaggg ttgccggatg tatatctctc cgtcaacgcc   1260

acgacggtaa gtcccgggga tccaaagaat tcaaaaagct tctcgagagt acttctagag   1320

cggccgcggg cccatcgatt ttccacccgg gtggggtacc aggtaagtgt accc         1374

<210> 318
<211> 458
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (45)...(380)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (23)...(26)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (46)...(49)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (222)...(225)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (425)...(428)
<223> N-glycosylation site. Prosite id = PS00001

<400> 318
Met Arg Gly Ser Leu Arg Ala Met Leu Asp Tyr Ser Asp Lys Ala Glu 
1               5                   10                  15      


Leu Gly Gln Pro Leu Ser Asn Thr Ser Ile Gln Leu Asp Ser Leu Pro 
            20                  25                  30          


Val Tyr Tyr Glu Val Arg Gly Asp Thr Ile Tyr Met Ile Asn Met Thr 
        35                  40                  45              


Asn Gly Glu Glu Lys Pro Ile His Leu Phe Gly Val Asn Trp Phe Gly 
    50                  55                  60                  


Phe Glu Thr Pro Asp His Val Val His Gly Leu Trp Ala Arg Asn Trp 
65                  70                  75                  80  


Glu Asp Met Leu Ile Gln Ile Lys Ser Leu Gly Phe Asn Ala Ile Arg 
                85                  90                  95      


Leu Pro Phe Cys Thr Glu Ser Val Gln Pro Gly Thr Met Pro Ala Thr 
            100                 105                 110         


Ile Asp Tyr Ser Lys Asn Pro Asp Leu Gln Gly Leu Thr Ser Leu Glu 
        115                 120                 125             


Ile Met Glu Lys Ile Val Gln Lys Ala Gly Glu Leu Gly Ile Phe Ile 
    130                 135                 140                 


Leu Leu Asp Tyr His Arg Ile Gly Cys Gln Tyr Ile Glu Pro Leu Trp 
145                 150                 155                 160 


Tyr Thr Asp Thr Phe Thr Glu Gln Asp Tyr Ile Asn Thr Trp Ile Ser 
                165                 170                 175     


Val Ala Glu Arg Phe Gly Lys Tyr Trp Asn Val Ile Gly Ala Asp Leu 
            180                 185                 190         


Lys Asn Glu Pro His Ser Ile Ser Gln Pro Pro Gly Ala Tyr Thr Asp 
        195                 200                 205             


Gly Thr Gly Ala Thr Trp Gly Met Gly Asn Asn Ala Thr Asp Trp Asn 
    210                 215                 220                 


Leu Ala Ala Glu Arg Ile Gly Arg Ala Ile Leu Glu Val Ala Pro His 
225                 230                 235                 240 


Trp Leu Ile Phe Val Glu Gly Thr Gln Tyr Thr Arg Pro Asp Ile Asp 
                245                 250                 255     


Gly Ser Tyr Gln Trp Gly Tyr Asn Ala Trp Trp Gly Gly Asn Leu Met 
            260                 265                 270         


Ala Val Arg Asp Tyr Pro Val Asn Leu Pro Arg Asn Lys Leu Val Tyr 
        275                 280                 285             


Ser Pro His Val Tyr Gly Pro Asp Val Tyr Asp Gln Pro Tyr Phe Ser 
    290                 295                 300                 


Asp Pro Asn Phe Pro Asn Asn Met Pro Asp Ile Trp Tyr His His Phe 
305                 310                 315                 320 


Gly Tyr Val Lys Ile Asp Leu Gly Tyr Pro Val Val Ile Gly Glu Phe 
                325                 330                 335     


Gly Gly Arg Tyr Gly His Gly Gly Asp Pro Arg Asp Val Ala Trp Gln 
            340                 345                 350         


Asn Lys Ile Val Asp Trp Met Ile Glu Asn Asn Phe Cys Ser Phe Phe 
        355                 360                 365             


Tyr Trp Ser Trp Asn Pro Asn Ser Gly Asp Thr Gly Gly Ile Leu Gln 
    370                 375                 380                 


Asp Asp Trp Thr Asn Ile Trp Gln Asp Lys Tyr Asp Asn Leu Lys Arg 
385                 390                 395                 400 


Leu Met Asp His Cys Ser Ala Gln Gln Gly Leu Pro Asp Val Tyr Leu 
                405                 410                 415     


Ser Val Asn Ala Thr Thr Val Ser Pro Gly Asp Pro Lys Asn Ser Lys 
            420                 425                 430         


Ser Phe Ser Arg Val Leu Leu Glu Arg Pro Arg Ala His Arg Phe Ser 
        435                 440                 445             


Thr Arg Val Gly Tyr Gln Val Ser Val Pro 
    450                 455             


<210> 319
<211> 1347
<212> DNA
<213> Clostridium thermocellum

<400> 319
atgtcaaaga taactttccc aaaagatttc atatggggtt ctgcaacagc agcatatcag     60

attgaaggtg catacaacga agacggcaaa ggtgaatcta tatgggaccg tttttcccac    120

acgccaggaa atatagcaga cggacatacc ggcgatgttg catgcgacca ctatcatcgt    180

tatgaagaag atatcaaaat aatgaaagaa atcggtatta aatcatacag gttttccatc    240

tcatggccca gaatctttcc tgaaggaaca ggtaaattaa atcaaaaggg actggatttt    300

tacaaaaggc tcacaaatct gcttctggaa aacggaatta tgcctgcaat cactctttat    360

cactgggacc ttccccaaaa gcttcaggat aaaggcggat ggaaaaaccg ggacaccacc    420

gattatttta cagaatactc tgaagtaata tttaaaaatc tcggagatat cgttccaata    480

tggtttactc acaatgaacc cggtgttgtt tctttgcttg gccacttttt aggaattcat    540

gcccctggga taaaagacct ccgcacttca ttggaagtct cgcacaatct tcttttgtcc    600

cacggcaagg ccgtgaaact gtttagagaa atgaatattg acgcccaaat tggaatagct    660

ctcaatttat cttaccatta tcccgcatcc gaaaaagctg aggatattga agcagcggaa    720

ttgtcatttt ctctggcggg aaggtggtat ctggatcctg tgctaaaagg ccggtatcct    780

gaaaacgcat tgaaacttta taaaaagaag ggtattgagc tttctttccc tgaagatgac    840

ctgaaactta tcagtcagcc aatagacttc atagcattca acaattattc ttcggaattt    900

ataaaatatg atccgtccag tgagtcaggt ttttcacctg caaactccat attagaaaag    960

ttcgaaaaaa cagatatggg ctggatcata tatcctgaag gcttgtatga tctgcttatg   1020

ctccttgaca gggattatgg aaagccaaac attgttatca gcgaaaacgg agccgccttc   1080

aaagatgaaa taggtagcaa cggaaagata gaagacacaa agagaatcca atatcttaaa   1140

gattatctga cccaggctca cagggcaatt caggacggtg taaacttaaa agcatactac   1200

ttgtggtcgc ttttggacaa ctttgaatgg gcttacgggt acaacaagag attcggaatc   1260

gttcacgtaa attttgatac gttggaaaga aaaataaagg atagcggcta ctggtacaaa   1320

gaagtaatca aaaacaacgg tttttaa                                       1347

<210> 320
<211> 448
<212> PRT
<213> Clostridium thermocellum

<220> 
<221> DOMAIN
<222> (2)...(448)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (10)...(24)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (225)...(228)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (299)...(302)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (356)...(364)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 320
Met Ser Lys Ile Thr Phe Pro Lys Asp Phe Ile Trp Gly Ser Ala Thr 
1               5                   10                  15      


Ala Ala Tyr Gln Ile Glu Gly Ala Tyr Asn Glu Asp Gly Lys Gly Glu 
            20                  25                  30          


Ser Ile Trp Asp Arg Phe Ser His Thr Pro Gly Asn Ile Ala Asp Gly 
        35                  40                  45              


His Thr Gly Asp Val Ala Cys Asp His Tyr His Arg Tyr Glu Glu Asp 
    50                  55                  60                  


Ile Lys Ile Met Lys Glu Ile Gly Ile Lys Ser Tyr Arg Phe Ser Ile 
65                  70                  75                  80  


Ser Trp Pro Arg Ile Phe Pro Glu Gly Thr Gly Lys Leu Asn Gln Lys 
                85                  90                  95      


Gly Leu Asp Phe Tyr Lys Arg Leu Thr Asn Leu Leu Leu Glu Asn Gly 
            100                 105                 110         


Ile Met Pro Ala Ile Thr Leu Tyr His Trp Asp Leu Pro Gln Lys Leu 
        115                 120                 125             


Gln Asp Lys Gly Gly Trp Lys Asn Arg Asp Thr Thr Asp Tyr Phe Thr 
    130                 135                 140                 


Glu Tyr Ser Glu Val Ile Phe Lys Asn Leu Gly Asp Ile Val Pro Ile 
145                 150                 155                 160 


Trp Phe Thr His Asn Glu Pro Gly Val Val Ser Leu Leu Gly His Phe 
                165                 170                 175     


Leu Gly Ile His Ala Pro Gly Ile Lys Asp Leu Arg Thr Ser Leu Glu 
            180                 185                 190         


Val Ser His Asn Leu Leu Leu Ser His Gly Lys Ala Val Lys Leu Phe 
        195                 200                 205             


Arg Glu Met Asn Ile Asp Ala Gln Ile Gly Ile Ala Leu Asn Leu Ser 
    210                 215                 220                 


Tyr His Tyr Pro Ala Ser Glu Lys Ala Glu Asp Ile Glu Ala Ala Glu 
225                 230                 235                 240 


Leu Ser Phe Ser Leu Ala Gly Arg Trp Tyr Leu Asp Pro Val Leu Lys 
                245                 250                 255     


Gly Arg Tyr Pro Glu Asn Ala Leu Lys Leu Tyr Lys Lys Lys Gly Ile 
            260                 265                 270         


Glu Leu Ser Phe Pro Glu Asp Asp Leu Lys Leu Ile Ser Gln Pro Ile 
        275                 280                 285             


Asp Phe Ile Ala Phe Asn Asn Tyr Ser Ser Glu Phe Ile Lys Tyr Asp 
    290                 295                 300                 


Pro Ser Ser Glu Ser Gly Phe Ser Pro Ala Asn Ser Ile Leu Glu Lys 
305                 310                 315                 320 


Phe Glu Lys Thr Asp Met Gly Trp Ile Ile Tyr Pro Glu Gly Leu Tyr 
                325                 330                 335     


Asp Leu Leu Met Leu Leu Asp Arg Asp Tyr Gly Lys Pro Asn Ile Val 
            340                 345                 350         


Ile Ser Glu Asn Gly Ala Ala Phe Lys Asp Glu Ile Gly Ser Asn Gly 
        355                 360                 365             


Lys Ile Glu Asp Thr Lys Arg Ile Gln Tyr Leu Lys Asp Tyr Leu Thr 
    370                 375                 380                 


Gln Ala His Arg Ala Ile Gln Asp Gly Val Asn Leu Lys Ala Tyr Tyr 
385                 390                 395                 400 


Leu Trp Ser Leu Leu Asp Asn Phe Glu Trp Ala Tyr Gly Tyr Asn Lys 
                405                 410                 415     


Arg Phe Gly Ile Val His Val Asn Phe Asp Thr Leu Glu Arg Lys Ile 
            420                 425                 430         


Lys Asp Ser Gly Tyr Trp Tyr Lys Glu Val Ile Lys Asn Asn Gly Phe 
        435                 440                 445             


<210> 321
<211> 1362
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 321
atggcaaaca agataacctt tcctgaaaat tttctgtggg gcgcggcaac ggcttcgtac     60

cagatcgaag gcgcctggaa caaacatggt aaaggcgaat ccacctggga tcgcttttca    120

cacacgcccg gtaagatcag gaacaacgat acgggcgatg tagcaaatga ccattatcgc    180

ctctggaaaa aagacattgg cttgatgaag aagatcgggt tgaaggctta tcgattttcc    240

atttcgtggc cgcgtattct tcctgctgga agaggcaagg tcaatcaaag agggctggat    300

ttttacaaca agatcgtaga tgagctgctg aaagcagata tcatcccatt tgttactctc    360

aatcactggg acctgcccca aaaactggaa gatgagggcg gctggccggc ccgttctact    420

gccgatgctt ttattgaata cacagatgtg atcacccgct cccttggcga ccgcgcaaag    480

aattggatca ctcacaatga acctgccgtc gttgcctgga tgggatactc cactggccaa    540

cacgcacccg gactgaagga ctatgggctt ggtgcccgcg ccgcgcatca cctgttgctc    600

tcacatggac aggctgtacc ggtcattcgc agcaatagcg cgggggcaga agtgggaatt    660

acgctcgata ttagctggcg gatcgctgcc tcaaacagcc gcgccgaccg ggagctggtc    720

cgtgaggatg atgggaggtg gttccgctgg tttgccgacc cgctttacgg gcgcggatat    780

ccctccgata aggtgtctga tttcactaag ttgggagcac tgcccaacgg acttgatttt    840

gtgcaggcag gcgacatgga cacgatcgcg acaccgactg attttatggg gctaaactac    900

tactcccgaa atgtctaccg cgcggacggt gcagataatg atccgcaaac tgttttccca    960

caaccgaaga tgcccgaaca ctggaccgag atgggctggg aaatttaccc ggatgggctg   1020

accaacattc tgggacgcgt ctatttcaac tatcagccgc gcaaactata cgtcacagaa   1080

aacggcgcca gttactccac gcctcctgat gataagggga atgtcgcgga tgaactccgc   1140

atccattatc tgaggacaca ttttgcagct gcctatcggg ccattcaaat gggcgtgcct   1200

ctggcaggat acttcgtctg gtccctcatg gacaactttg agtggtcatg gggctatatg   1260

caacgctttg gactcatctg ggtggattat gagacccaaa aacgcacttt aaaggatagc   1320

gcaaaatggt ataagcgcgt gatcaagaag aatgggctct aa                      1362

<210> 322
<211> 453
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (3)...(453)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (11)...(25)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (49)...(52)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (361)...(369)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 322
Met Ala Asn Lys Ile Thr Phe Pro Glu Asn Phe Leu Trp Gly Ala Ala 
1               5                   10                  15      


Thr Ala Ser Tyr Gln Ile Glu Gly Ala Trp Asn Lys His Gly Lys Gly 
            20                  25                  30          


Glu Ser Thr Trp Asp Arg Phe Ser His Thr Pro Gly Lys Ile Arg Asn 
        35                  40                  45              


Asn Asp Thr Gly Asp Val Ala Asn Asp His Tyr Arg Leu Trp Lys Lys 
    50                  55                  60                  


Asp Ile Gly Leu Met Lys Lys Ile Gly Leu Lys Ala Tyr Arg Phe Ser 
65                  70                  75                  80  


Ile Ser Trp Pro Arg Ile Leu Pro Ala Gly Arg Gly Lys Val Asn Gln 
                85                  90                  95      


Arg Gly Leu Asp Phe Tyr Asn Lys Ile Val Asp Glu Leu Leu Lys Ala 
            100                 105                 110         


Asp Ile Ile Pro Phe Val Thr Leu Asn His Trp Asp Leu Pro Gln Lys 
        115                 120                 125             


Leu Glu Asp Glu Gly Gly Trp Pro Ala Arg Ser Thr Ala Asp Ala Phe 
    130                 135                 140                 


Ile Glu Tyr Thr Asp Val Ile Thr Arg Ser Leu Gly Asp Arg Ala Lys 
145                 150                 155                 160 


Asn Trp Ile Thr His Asn Glu Pro Ala Val Val Ala Trp Met Gly Tyr 
                165                 170                 175     


Ser Thr Gly Gln His Ala Pro Gly Leu Lys Asp Tyr Gly Leu Gly Ala 
            180                 185                 190         


Arg Ala Ala His His Leu Leu Leu Ser His Gly Gln Ala Val Pro Val 
        195                 200                 205             


Ile Arg Ser Asn Ser Ala Gly Ala Glu Val Gly Ile Thr Leu Asp Ile 
    210                 215                 220                 


Ser Trp Arg Ile Ala Ala Ser Asn Ser Arg Ala Asp Arg Glu Leu Val 
225                 230                 235                 240 


Arg Glu Asp Asp Gly Arg Trp Phe Arg Trp Phe Ala Asp Pro Leu Tyr 
                245                 250                 255     


Gly Arg Gly Tyr Pro Ser Asp Lys Val Ser Asp Phe Thr Lys Leu Gly 
            260                 265                 270         


Ala Leu Pro Asn Gly Leu Asp Phe Val Gln Ala Gly Asp Met Asp Thr 
        275                 280                 285             


Ile Ala Thr Pro Thr Asp Phe Met Gly Leu Asn Tyr Tyr Ser Arg Asn 
    290                 295                 300                 


Val Tyr Arg Ala Asp Gly Ala Asp Asn Asp Pro Gln Thr Val Phe Pro 
305                 310                 315                 320 


Gln Pro Lys Met Pro Glu His Trp Thr Glu Met Gly Trp Glu Ile Tyr 
                325                 330                 335     


Pro Asp Gly Leu Thr Asn Ile Leu Gly Arg Val Tyr Phe Asn Tyr Gln 
            340                 345                 350         


Pro Arg Lys Leu Tyr Val Thr Glu Asn Gly Ala Ser Tyr Ser Thr Pro 
        355                 360                 365             


Pro Asp Asp Lys Gly Asn Val Ala Asp Glu Leu Arg Ile His Tyr Leu 
    370                 375                 380                 


Arg Thr His Phe Ala Ala Ala Tyr Arg Ala Ile Gln Met Gly Val Pro 
385                 390                 395                 400 


Leu Ala Gly Tyr Phe Val Trp Ser Leu Met Asp Asn Phe Glu Trp Ser 
                405                 410                 415     


Trp Gly Tyr Met Gln Arg Phe Gly Leu Ile Trp Val Asp Tyr Glu Thr 
            420                 425                 430         


Gln Lys Arg Thr Leu Lys Asp Ser Ala Lys Trp Tyr Lys Arg Val Ile 
        435                 440                 445             


Lys Lys Asn Gly Leu 
    450             


<210> 323
<211> 1362
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 323
atggcgaaca aaattacctt tcccgaaaat tttctttggg gcgcggcaac agcctcctac     60

cagatcgaag gtgcgtggga caaacatggc aagggtgaat ccatctggga tcgcttttcg    120

catacccctg gcaagatcag aaataatgat acgggcgatg ttgccaatga tcattatcgt    180

ctctggaaaa aagacattgg cttgatgaag aagatcggct tgaaggcata tcgtttttcc    240

atttcgtggc cgcgtgttct tcccgccgga cgcggcaaag tcaatcagaa gggactggat    300

ttctataaca ggctggtaga tgctctgttg aaagaagata tcatcccatt tgtgactctc    360

aatcactggg acctgcccca aaagctggag gaggaaggcg gttggccggt tcgctccacc    420

gcagatgcct ttgtggaata cacagacgtg gtcacacgtt ccctcggcga ccgcgtaaag    480

aattggatca cgcataatga gcctgccgtc gttgcctgga tgggatattc cacaggtcaa    540

cacgcacccg gtttgaagga ctatgggctt ggtgtgcgcg ccgcgcatca tctgctgctc    600

tcccacgggc aggcggtgcc agtcatccgc agtaacagcg ccgatgcaga agtgggcatt    660

acgctggata ttagctggcg gattcctgcc tccaatagcc gagcagaccg ggaattggtc    720

cgtaaagatg acggactatg gttccgctgg ttcgccgatc cgctttatgg gcgcggatac    780

ccctcggata aagtcaccga ttttacaaag atcggcgcgc tgcccaatgg tctggacttt    840

atgcaagccg gtgatatgga tgcgatcgcc acgccaaccg atttcatggg gctgaactat    900

tatttccgaa atgtctaccg cgcgaatggc gaagacaatg atccgcaggt cgttttccca    960

caaccaaaga tgcccgaaca ctggacggag atgggctggg aaatctatcc ggatggactg   1020

acgaacatcc tgggacgcgt ttatttcaat taccagccac ataaactgta tatcacagag   1080

aacggcgcga gctactccac cccgcccgat gaaaagggga atgtcgccga tgagctccgc   1140

actcattatt tacggacaca cttcgcggct gcctaccggg cgattcagat gggcgtgcct   1200

ctggcaggat actttgtctg gtccctcatg gacaactttg agtggtcctg gggatatatg   1260

cagcgctttg ggctcatctg ggtggactac gagacacaga aacgcaccct gaaggatagc   1320

gccaagtggt acaaacgtgt gatcaggaag aatgggtttt ag                      1362

<210> 324
<211> 453
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (3)...(453)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (11)...(25)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (49)...(52)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (361)...(369)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 324
Met Ala Asn Lys Ile Thr Phe Pro Glu Asn Phe Leu Trp Gly Ala Ala 
1               5                   10                  15      


Thr Ala Ser Tyr Gln Ile Glu Gly Ala Trp Asp Lys His Gly Lys Gly 
            20                  25                  30          


Glu Ser Ile Trp Asp Arg Phe Ser His Thr Pro Gly Lys Ile Arg Asn 
        35                  40                  45              


Asn Asp Thr Gly Asp Val Ala Asn Asp His Tyr Arg Leu Trp Lys Lys 
    50                  55                  60                  


Asp Ile Gly Leu Met Lys Lys Ile Gly Leu Lys Ala Tyr Arg Phe Ser 
65                  70                  75                  80  


Ile Ser Trp Pro Arg Val Leu Pro Ala Gly Arg Gly Lys Val Asn Gln 
                85                  90                  95      


Lys Gly Leu Asp Phe Tyr Asn Arg Leu Val Asp Ala Leu Leu Lys Glu 
            100                 105                 110         


Asp Ile Ile Pro Phe Val Thr Leu Asn His Trp Asp Leu Pro Gln Lys 
        115                 120                 125             


Leu Glu Glu Glu Gly Gly Trp Pro Val Arg Ser Thr Ala Asp Ala Phe 
    130                 135                 140                 


Val Glu Tyr Thr Asp Val Val Thr Arg Ser Leu Gly Asp Arg Val Lys 
145                 150                 155                 160 


Asn Trp Ile Thr His Asn Glu Pro Ala Val Val Ala Trp Met Gly Tyr 
                165                 170                 175     


Ser Thr Gly Gln His Ala Pro Gly Leu Lys Asp Tyr Gly Leu Gly Val 
            180                 185                 190         


Arg Ala Ala His His Leu Leu Leu Ser His Gly Gln Ala Val Pro Val 
        195                 200                 205             


Ile Arg Ser Asn Ser Ala Asp Ala Glu Val Gly Ile Thr Leu Asp Ile 
    210                 215                 220                 


Ser Trp Arg Ile Pro Ala Ser Asn Ser Arg Ala Asp Arg Glu Leu Val 
225                 230                 235                 240 


Arg Lys Asp Asp Gly Leu Trp Phe Arg Trp Phe Ala Asp Pro Leu Tyr 
                245                 250                 255     


Gly Arg Gly Tyr Pro Ser Asp Lys Val Thr Asp Phe Thr Lys Ile Gly 
            260                 265                 270         


Ala Leu Pro Asn Gly Leu Asp Phe Met Gln Ala Gly Asp Met Asp Ala 
        275                 280                 285             


Ile Ala Thr Pro Thr Asp Phe Met Gly Leu Asn Tyr Tyr Phe Arg Asn 
    290                 295                 300                 


Val Tyr Arg Ala Asn Gly Glu Asp Asn Asp Pro Gln Val Val Phe Pro 
305                 310                 315                 320 


Gln Pro Lys Met Pro Glu His Trp Thr Glu Met Gly Trp Glu Ile Tyr 
                325                 330                 335     


Pro Asp Gly Leu Thr Asn Ile Leu Gly Arg Val Tyr Phe Asn Tyr Gln 
            340                 345                 350         


Pro His Lys Leu Tyr Ile Thr Glu Asn Gly Ala Ser Tyr Ser Thr Pro 
        355                 360                 365             


Pro Asp Glu Lys Gly Asn Val Ala Asp Glu Leu Arg Thr His Tyr Leu 
    370                 375                 380                 


Arg Thr His Phe Ala Ala Ala Tyr Arg Ala Ile Gln Met Gly Val Pro 
385                 390                 395                 400 


Leu Ala Gly Tyr Phe Val Trp Ser Leu Met Asp Asn Phe Glu Trp Ser 
                405                 410                 415     


Trp Gly Tyr Met Gln Arg Phe Gly Leu Ile Trp Val Asp Tyr Glu Thr 
            420                 425                 430         


Gln Lys Arg Thr Leu Lys Asp Ser Ala Lys Trp Tyr Lys Arg Val Ile 
        435                 440                 445             


Arg Lys Asn Gly Phe 
    450             


<210> 325
<211> 1362
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 325
atggcaaata aaattctctt ccccgagaac tttctctggg gcacggcgac cgcatcctac     60

cagatcgagg gggcttggga taaacatggt aagggcgagt cgacctggga ccgttttacg    120

catacacctg gaaagatcaa aaacaatgat acgggcgatg tagcagatga ccattatcga    180

ttatggaaaa aagatatcgg cttgatgaag aagctcggct tgaaggctta tcgtttttcg    240

acttcctggc cgcgggtgct gccggccggg cgcggtaaga gcaatcaaaa aggactcgat    300

ttctacagca agctggttga tgagttgcta aaagcaaata tcatcccatt cgtgacattg    360

aatcactggg acatcccaca aaagttggag gacgagggtg gctgggccgt gcgctcaacg    420

gctgaggcat ttgtggaata tgccgatctc atgtcgcgca cgcttggaga ccgcgtcaag    480

aactggatca cgcacaacga accggccgtc gtcgcctgga tgggatacgg gatgggcatc    540

cacgcgccgg gcttaacgga tttctcgatt gcggtgccgg tctcgcatca tctgctcctt    600

tcgcacggat gggccgtgcc tgtgattcgc ggtaacagcc cggatgccga ggtgggcatt    660

accctcaaca ttcaatgggg cgaagcagca tccaacagcc gggccgacct aaacgccctg    720

cgcctgaacg atggacagtg gttccgctgg tttgccgatc cggtttatgg ccgcggctat    780

ccttccgacg tggtggctga tttcgagaaa atgggcgcgc tgccgaacgg catgaatttc    840

gtgcaacctg gcgatatgga tgtcatcgcc acgccaaccg atttcctcgg gctcaattat    900

tattcccgcc atgtgcatcg cgtcaacaca ccggataacg atcaacaggt tgtgtttgcc    960

aaacagcagg gtcccgagaa ctggaccgag atgggctggg agatccatcc tgatggattg   1020

gccggaattt tatccagagc gtatttcaat taccagccgc gcaaagtata tgtgactgaa   1080

aacggtgcca gctattccac cgcgcccgat gagaatggta ttgtcaacga cattcaccgc   1140

gtcaattatc tacggacgca cttcgcggct gcccatcgcg ccctgcaggc gggcgtgcca   1200

ttggcaggat acttcgtctg gtcaatgctc gataacttcg aatggagtca cgggtacagc   1260

cagcgctttg gcatcgttta tgtggactat caaacccaga agcgttactt gaaagacagc   1320

gccaagtggt acaaaggtgt catcaaaaag aatgggttct aa                      1362

<210> 326
<211> 453
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (3)...(453)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (11)...(25)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (49)...(52)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (332)...(335)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (361)...(369)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 326
Met Ala Asn Lys Ile Leu Phe Pro Glu Asn Phe Leu Trp Gly Thr Ala 
1               5                   10                  15      


Thr Ala Ser Tyr Gln Ile Glu Gly Ala Trp Asp Lys His Gly Lys Gly 
            20                  25                  30          


Glu Ser Thr Trp Asp Arg Phe Thr His Thr Pro Gly Lys Ile Lys Asn 
        35                  40                  45              


Asn Asp Thr Gly Asp Val Ala Asp Asp His Tyr Arg Leu Trp Lys Lys 
    50                  55                  60                  


Asp Ile Gly Leu Met Lys Lys Leu Gly Leu Lys Ala Tyr Arg Phe Ser 
65                  70                  75                  80  


Thr Ser Trp Pro Arg Val Leu Pro Ala Gly Arg Gly Lys Ser Asn Gln 
                85                  90                  95      


Lys Gly Leu Asp Phe Tyr Ser Lys Leu Val Asp Glu Leu Leu Lys Ala 
            100                 105                 110         


Asn Ile Ile Pro Phe Val Thr Leu Asn His Trp Asp Ile Pro Gln Lys 
        115                 120                 125             


Leu Glu Asp Glu Gly Gly Trp Ala Val Arg Ser Thr Ala Glu Ala Phe 
    130                 135                 140                 


Val Glu Tyr Ala Asp Leu Met Ser Arg Thr Leu Gly Asp Arg Val Lys 
145                 150                 155                 160 


Asn Trp Ile Thr His Asn Glu Pro Ala Val Val Ala Trp Met Gly Tyr 
                165                 170                 175     


Gly Met Gly Ile His Ala Pro Gly Leu Thr Asp Phe Ser Ile Ala Val 
            180                 185                 190         


Pro Val Ser His His Leu Leu Leu Ser His Gly Trp Ala Val Pro Val 
        195                 200                 205             


Ile Arg Gly Asn Ser Pro Asp Ala Glu Val Gly Ile Thr Leu Asn Ile 
    210                 215                 220                 


Gln Trp Gly Glu Ala Ala Ser Asn Ser Arg Ala Asp Leu Asn Ala Leu 
225                 230                 235                 240 


Arg Leu Asn Asp Gly Gln Trp Phe Arg Trp Phe Ala Asp Pro Val Tyr 
                245                 250                 255     


Gly Arg Gly Tyr Pro Ser Asp Val Val Ala Asp Phe Glu Lys Met Gly 
            260                 265                 270         


Ala Leu Pro Asn Gly Met Asn Phe Val Gln Pro Gly Asp Met Asp Val 
        275                 280                 285             


Ile Ala Thr Pro Thr Asp Phe Leu Gly Leu Asn Tyr Tyr Ser Arg His 
    290                 295                 300                 


Val His Arg Val Asn Thr Pro Asp Asn Asp Gln Gln Val Val Phe Ala 
305                 310                 315                 320 


Lys Gln Gln Gly Pro Glu Asn Trp Thr Glu Met Gly Trp Glu Ile His 
                325                 330                 335     


Pro Asp Gly Leu Ala Gly Ile Leu Ser Arg Ala Tyr Phe Asn Tyr Gln 
            340                 345                 350         


Pro Arg Lys Val Tyr Val Thr Glu Asn Gly Ala Ser Tyr Ser Thr Ala 
        355                 360                 365             


Pro Asp Glu Asn Gly Ile Val Asn Asp Ile His Arg Val Asn Tyr Leu 
    370                 375                 380                 


Arg Thr His Phe Ala Ala Ala His Arg Ala Leu Gln Ala Gly Val Pro 
385                 390                 395                 400 


Leu Ala Gly Tyr Phe Val Trp Ser Met Leu Asp Asn Phe Glu Trp Ser 
                405                 410                 415     


His Gly Tyr Ser Gln Arg Phe Gly Ile Val Tyr Val Asp Tyr Gln Thr 
            420                 425                 430         


Gln Lys Arg Tyr Leu Lys Asp Ser Ala Lys Trp Tyr Lys Gly Val Ile 
        435                 440                 445             


Lys Lys Asn Gly Phe 
    450             


<210> 327
<211> 1398
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 327
atgccgatga gcacagaaac gacttttcct tctgatttca cctggggcgc agcaacagcc     60

gcctaccaga tcgaaggggg cgatcgcgct ggcgggcgcg gccgttccgt gtgggacatg    120

ttttgcgaga aacgaggagc tatttgggag gggcatacgg ggcagcgagc gagtctgcat    180

cttcagcgct ggcgtgagga cgtaatgttg atgcaacagc tcggactgcg gggctatcgt    240

tttagcgtca gctggccgcg cgtcttcccg acaggagtcg gcaaagtcaa ccgtgaaggg    300

ttggcctttt acgatcagct cgtagacgcc ttgctcgagg ccggcatcac cccctttata    360

acgctatttc attgggactt cccgctcgat ttgtaccacc gaggcggctg gttgaatcgc    420

gacagcgccg actggtttgc ctcctacgcc gagtgcctcg gcaaggcact gggcgacagg    480

gtcaagcact gggtgaccct caacgagccg caggttttca taggcctcgg tcattacgaa    540

gggcgtcatg ccccggggtt gaagctctcc atcgcggaaa tgctgcgctg cgggcaccac    600

gccttgctcg cgcacgggaa ggccgtgcaa gccctgcgcg cttccgtcga cggcccctgc    660

aagattggat ttgctccggt ggggattccc aagcttccgg cgagtgagag ctcagaggat    720

atcgccgcgg cacgaaaggc ccagttcgcg gcgggagcgc cgccgtattg gacgctgagc    780

tggtgggcgg atccggtgtt tcaggggaca tatcccgctg atgcctgcca ggctctcgga    840

gcggacgcgc cgcaggtggc cgatcacgac atgagcatca tcagcgagcc gactgatttc    900

ctgggcctca acctttatca aggggtggtg gtgcgtgccg atcacacggg tcaaccagaa    960

acggtgccgt ttccgccggg attccccgtg actgcgctca actgggccgt aaccccagag   1020

gcgctgtatt ggggcccgcg ctttgccttc gaacgctaca aaaagccgat tcacatcacg   1080

gaaaacgggc tatcctgtcg tgactggccg tcgctcgacg ggcacgtcca cgacgccgac   1140

cgcatcgact tcatggcccg gcacttgcgc gcagcgcatc gagccattcg cgatgggata   1200

ccgatcgaag gctacttcca ctggtctgcg atcgacaact tcgagtgggc agaaggctac   1260

aaggaacgct tcgggctcat ttacgtcgac tatacgagcg gcgagcgcat tccgaaggac   1320

tcgtaccact ggtaccagaa ggtcattgcc tccgaggggc gggcagcgct cggcgcgccc   1380

agtgctgctc gcccataa                                                 1398

<210> 328
<211> 465
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (5)...(454)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (13)...(27)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<400> 328
Met Pro Met Ser Thr Glu Thr Thr Phe Pro Ser Asp Phe Thr Trp Gly 
1               5                   10                  15      


Ala Ala Thr Ala Ala Tyr Gln Ile Glu Gly Gly Asp Arg Ala Gly Gly 
            20                  25                  30          


Arg Gly Arg Ser Val Trp Asp Met Phe Cys Glu Lys Arg Gly Ala Ile 
        35                  40                  45              


Trp Glu Gly His Thr Gly Gln Arg Ala Ser Leu His Leu Gln Arg Trp 
    50                  55                  60                  


Arg Glu Asp Val Met Leu Met Gln Gln Leu Gly Leu Arg Gly Tyr Arg 
65                  70                  75                  80  


Phe Ser Val Ser Trp Pro Arg Val Phe Pro Thr Gly Val Gly Lys Val 
                85                  90                  95      


Asn Arg Glu Gly Leu Ala Phe Tyr Asp Gln Leu Val Asp Ala Leu Leu 
            100                 105                 110         


Glu Ala Gly Ile Thr Pro Phe Ile Thr Leu Phe His Trp Asp Phe Pro 
        115                 120                 125             


Leu Asp Leu Tyr His Arg Gly Gly Trp Leu Asn Arg Asp Ser Ala Asp 
    130                 135                 140                 


Trp Phe Ala Ser Tyr Ala Glu Cys Leu Gly Lys Ala Leu Gly Asp Arg 
145                 150                 155                 160 


Val Lys His Trp Val Thr Leu Asn Glu Pro Gln Val Phe Ile Gly Leu 
                165                 170                 175     


Gly His Tyr Glu Gly Arg His Ala Pro Gly Leu Lys Leu Ser Ile Ala 
            180                 185                 190         


Glu Met Leu Arg Cys Gly His His Ala Leu Leu Ala His Gly Lys Ala 
        195                 200                 205             


Val Gln Ala Leu Arg Ala Ser Val Asp Gly Pro Cys Lys Ile Gly Phe 
    210                 215                 220                 


Ala Pro Val Gly Ile Pro Lys Leu Pro Ala Ser Glu Ser Ser Glu Asp 
225                 230                 235                 240 


Ile Ala Ala Ala Arg Lys Ala Gln Phe Ala Ala Gly Ala Pro Pro Tyr 
                245                 250                 255     


Trp Thr Leu Ser Trp Trp Ala Asp Pro Val Phe Gln Gly Thr Tyr Pro 
            260                 265                 270         


Ala Asp Ala Cys Gln Ala Leu Gly Ala Asp Ala Pro Gln Val Ala Asp 
        275                 280                 285             


His Asp Met Ser Ile Ile Ser Glu Pro Thr Asp Phe Leu Gly Leu Asn 
    290                 295                 300                 


Leu Tyr Gln Gly Val Val Val Arg Ala Asp His Thr Gly Gln Pro Glu 
305                 310                 315                 320 


Thr Val Pro Phe Pro Pro Gly Phe Pro Val Thr Ala Leu Asn Trp Ala 
                325                 330                 335     


Val Thr Pro Glu Ala Leu Tyr Trp Gly Pro Arg Phe Ala Phe Glu Arg 
            340                 345                 350         


Tyr Lys Lys Pro Ile His Ile Thr Glu Asn Gly Leu Ser Cys Arg Asp 
        355                 360                 365             


Trp Pro Ser Leu Asp Gly His Val His Asp Ala Asp Arg Ile Asp Phe 
    370                 375                 380                 


Met Ala Arg His Leu Arg Ala Ala His Arg Ala Ile Arg Asp Gly Ile 
385                 390                 395                 400 


Pro Ile Glu Gly Tyr Phe His Trp Ser Ala Ile Asp Asn Phe Glu Trp 
                405                 410                 415     


Ala Glu Gly Tyr Lys Glu Arg Phe Gly Leu Ile Tyr Val Asp Tyr Thr 
            420                 425                 430         


Ser Gly Glu Arg Ile Pro Lys Asp Ser Tyr His Trp Tyr Gln Lys Val 
        435                 440                 445             


Ile Ala Ser Glu Gly Arg Ala Ala Leu Gly Ala Pro Ser Ala Ala Arg 
    450                 455                 460                 


Pro 
465 


<210> 329
<211> 1350
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 329
atgtcagatg ccgccccgac tgatccgaaa tccgcaatgc ccagacgctc ggacttcccc     60

gagggttttg tcttcggcgc ggccaccgcg gcctatcaga tcgagggcca tgccttcggc    120

ggcgcgggcc cctgccattg ggacagcttc gccgcaaccg ggcgtaacgt ggtcggcaat    180

gaggatggcg cgcgcgcctg cgagcattac acccgctggc cgcaggatct ggacctgatc    240

cgcgaggccg ggctcgacgc ctaccgcttc tcgacctcct gggcgcgggt gatgcccgat    300

ggcgtgaccc tgaaccccga ggggctggat ttctacgacc gcctcgtcga tggcatgctc    360

gagcgcgggc taaagcccta tctcaccctc taccattggg aattgccctc ggcgcttgcc    420

gacaggggcg gctggaccaa tcgcgacacg gccgagcgct ttgccgattt cgcagcggtg    480

gtgatggagc ggttgggcag ccgcgtcgcc cgcacggcca ccatcaacga gccatggtgc    540

gtgagctggc tctcgcattt cgaaggccat cacgcgccgg gcctgcgcga catccgtgcc    600

accgcacgcg ccatgcatca tgtgcaactg gcgcacggcc tcgcgctcgg gaagctgcgc    660

gcgcaggggc atggcaatct cggcatcgtg ctgaatttct cggaaatcat tcccgccggg    720

cgagagcacg cgaaggcggc tgatctcggc gacgcaatct cgaaccgctg gttcatcgag    780

tcagtcgcgc gtggcaccta tcccgatgtg gtcctcgagg gtctgggcaa gcacatgccc    840

gagggctggc aggatgacat gaaaaccatc gcggccccgc tcgactggct gggtgtgaac    900

tactacaccc gcggcatcgt cgcgcatgac ccggacgcgt cctggccctc gacccgagcg    960

gaggaggggc ccctgcccaa gacgcagatg ggctgggaga tctaccccga gggcttgcgc   1020

aacctgctgg tgcgcatggc gcgcgactat gtgggcgacc ttcccatggt cgtgaccgaa   1080

aacgggatgg cctgggccga cgaggtcgcg gatggcgccg tcagagatac gatccgcacc   1140

gaatatgtcg cagcccatct caacgcgacc cgcgaggcgc tggccggcgg ggcgaatatc   1200

gaaggtttct tctattggtc gctgctcgac aattacgaat gggccttcgg ctatgccaag   1260

cgcttcggcc tcgtccatgt cgatttcgac acgatggcac gcacgccgaa agcctcctac   1320

cacgcgctga gggccgcgct gcagggttga                                    1350

<210> 330
<211> 449
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (15)...(443)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (235)...(238)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (361)...(369)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<220> 
<221> SITE
<222> (393)...(396)
<223> N-glycosylation site. Prosite id = PS00001

<400> 330
Met Ser Asp Ala Ala Pro Thr Asp Pro Lys Ser Ala Met Pro Arg Arg 
1               5                   10                  15      


Ser Asp Phe Pro Glu Gly Phe Val Phe Gly Ala Ala Thr Ala Ala Tyr 
            20                  25                  30          


Gln Ile Glu Gly His Ala Phe Gly Gly Ala Gly Pro Cys His Trp Asp 
        35                  40                  45              


Ser Phe Ala Ala Thr Gly Arg Asn Val Val Gly Asn Glu Asp Gly Ala 
    50                  55                  60                  


Arg Ala Cys Glu His Tyr Thr Arg Trp Pro Gln Asp Leu Asp Leu Ile 
65                  70                  75                  80  


Arg Glu Ala Gly Leu Asp Ala Tyr Arg Phe Ser Thr Ser Trp Ala Arg 
                85                  90                  95      


Val Met Pro Asp Gly Val Thr Leu Asn Pro Glu Gly Leu Asp Phe Tyr 
            100                 105                 110         


Asp Arg Leu Val Asp Gly Met Leu Glu Arg Gly Leu Lys Pro Tyr Leu 
        115                 120                 125             


Thr Leu Tyr His Trp Glu Leu Pro Ser Ala Leu Ala Asp Arg Gly Gly 
    130                 135                 140                 


Trp Thr Asn Arg Asp Thr Ala Glu Arg Phe Ala Asp Phe Ala Ala Val 
145                 150                 155                 160 


Val Met Glu Arg Leu Gly Ser Arg Val Ala Arg Thr Ala Thr Ile Asn 
                165                 170                 175     


Glu Pro Trp Cys Val Ser Trp Leu Ser His Phe Glu Gly His His Ala 
            180                 185                 190         


Pro Gly Leu Arg Asp Ile Arg Ala Thr Ala Arg Ala Met His His Val 
        195                 200                 205             


Gln Leu Ala His Gly Leu Ala Leu Gly Lys Leu Arg Ala Gln Gly His 
    210                 215                 220                 


Gly Asn Leu Gly Ile Val Leu Asn Phe Ser Glu Ile Ile Pro Ala Gly 
225                 230                 235                 240 


Arg Glu His Ala Lys Ala Ala Asp Leu Gly Asp Ala Ile Ser Asn Arg 
                245                 250                 255     


Trp Phe Ile Glu Ser Val Ala Arg Gly Thr Tyr Pro Asp Val Val Leu 
            260                 265                 270         


Glu Gly Leu Gly Lys His Met Pro Glu Gly Trp Gln Asp Asp Met Lys 
        275                 280                 285             


Thr Ile Ala Ala Pro Leu Asp Trp Leu Gly Val Asn Tyr Tyr Thr Arg 
    290                 295                 300                 


Gly Ile Val Ala His Asp Pro Asp Ala Ser Trp Pro Ser Thr Arg Ala 
305                 310                 315                 320 


Glu Glu Gly Pro Leu Pro Lys Thr Gln Met Gly Trp Glu Ile Tyr Pro 
                325                 330                 335     


Glu Gly Leu Arg Asn Leu Leu Val Arg Met Ala Arg Asp Tyr Val Gly 
            340                 345                 350         


Asp Leu Pro Met Val Val Thr Glu Asn Gly Met Ala Trp Ala Asp Glu 
        355                 360                 365             


Val Ala Asp Gly Ala Val Arg Asp Thr Ile Arg Thr Glu Tyr Val Ala 
    370                 375                 380                 


Ala His Leu Asn Ala Thr Arg Glu Ala Leu Ala Gly Gly Ala Asn Ile 
385                 390                 395                 400 


Glu Gly Phe Phe Tyr Trp Ser Leu Leu Asp Asn Tyr Glu Trp Ala Phe 
                405                 410                 415     


Gly Tyr Ala Lys Arg Phe Gly Leu Val His Val Asp Phe Asp Thr Met 
            420                 425                 430         


Ala Arg Thr Pro Lys Ala Ser Tyr His Ala Leu Arg Ala Ala Leu Gln 
        435                 440                 445             


Gly 
    


<210> 331
<211> 1116
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 331
atgaataaaa tcctcaaact cttcagcagc ctgctgcttt ttgcaggcat ctgtcccgcg     60

cttcaggcag agccagtaga aacctacttt cccctgtccc gcgggatcaa catgagccac    120

tggctctctc aagtgaatga aaacattccc gaccgttcca cctatgtgac ggagcgggat    180

ttgcaatttc tgcgggcagc cggtttcgac catgtgcgtc tgccaatcga tgaggtcgaa    240

ctctgggatg aagagggcaa tcagatcgag gaggcctggc aatacatgca taactttctc    300

cgttggagcc gaaagaacga tctccgggtc attctcgacc tgcacacggt attgtcccac    360

cacttcaacg cggtaaatat gggagaggtc aatacactct tcaatgatcc cagggaacag    420

gaaaagttcc tcaacctatg ggaacaaatc atggatgccg tgggtcacca tccgaatgag    480

tttctcgcct atgaaatgct caatgaggcg gtcgcggaag atgatgaaga ctggaatctg    540

ctcctcaacc gcgccattgt ccgcatccgg gaccgtgagc cttatcgggt gctgattgcg    600

gggtcgaact ggtggcagca tgccgaccgg gtccccaacc tgaggctccc gaaaggagac    660

cccaatatca tcatcagttt tcatttttat tccccttttc tcttcaccca ctaccgcagt    720

agctggactg cgatgcaggc gtaccagggc ttcgtccaat accctggcaa aaccatacct    780

tccatacatc tcgaaggcat gaactacccg gagtccttcg ttcatatgtg ggaagcgcac    840

aatcggtact atgacatcca ttccatgtat gccgaaatgg tcccggcggt gcgttttgcc    900

gaaaagttgg gacttcggct ctattgcgga gaattcgggg ccatgaagac cgttgatcgc    960

gcccagatgc tgcagtggta tcgggatgtt gtcactgtat ttaataaatt gggtattccc   1020

tatactgcct gggattatca gggaaccttc ggaatccgcg atgagctgac cggtgagccc   1080

gatcatgaaa tgatcgatat tctcctcggg cgctga                             1116

<210> 332
<211> 371
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(23)

<220> 
<221> DOMAIN
<222> (39)...(350)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (37)...(40)
<223> N-glycosylation site. Prosite id = PS00001

<400> 332
Met Asn Lys Ile Leu Lys Leu Phe Ser Ser Leu Leu Leu Phe Ala Gly 
1               5                   10                  15      


Ile Cys Pro Ala Leu Gln Ala Glu Pro Val Glu Thr Tyr Phe Pro Leu 
            20                  25                  30          


Ser Arg Gly Ile Asn Met Ser His Trp Leu Ser Gln Val Asn Glu Asn 
        35                  40                  45              


Ile Pro Asp Arg Ser Thr Tyr Val Thr Glu Arg Asp Leu Gln Phe Leu 
    50                  55                  60                  


Arg Ala Ala Gly Phe Asp His Val Arg Leu Pro Ile Asp Glu Val Glu 
65                  70                  75                  80  


Leu Trp Asp Glu Glu Gly Asn Gln Ile Glu Glu Ala Trp Gln Tyr Met 
                85                  90                  95      


His Asn Phe Leu Arg Trp Ser Arg Lys Asn Asp Leu Arg Val Ile Leu 
            100                 105                 110         


Asp Leu His Thr Val Leu Ser His His Phe Asn Ala Val Asn Met Gly 
        115                 120                 125             


Glu Val Asn Thr Leu Phe Asn Asp Pro Arg Glu Gln Glu Lys Phe Leu 
    130                 135                 140                 


Asn Leu Trp Glu Gln Ile Met Asp Ala Val Gly His His Pro Asn Glu 
145                 150                 155                 160 


Phe Leu Ala Tyr Glu Met Leu Asn Glu Ala Val Ala Glu Asp Asp Glu 
                165                 170                 175     


Asp Trp Asn Leu Leu Leu Asn Arg Ala Ile Val Arg Ile Arg Asp Arg 
            180                 185                 190         


Glu Pro Tyr Arg Val Leu Ile Ala Gly Ser Asn Trp Trp Gln His Ala 
        195                 200                 205             


Asp Arg Val Pro Asn Leu Arg Leu Pro Lys Gly Asp Pro Asn Ile Ile 
    210                 215                 220                 


Ile Ser Phe His Phe Tyr Ser Pro Phe Leu Phe Thr His Tyr Arg Ser 
225                 230                 235                 240 


Ser Trp Thr Ala Met Gln Ala Tyr Gln Gly Phe Val Gln Tyr Pro Gly 
                245                 250                 255     


Lys Thr Ile Pro Ser Ile His Leu Glu Gly Met Asn Tyr Pro Glu Ser 
            260                 265                 270         


Phe Val His Met Trp Glu Ala His Asn Arg Tyr Tyr Asp Ile His Ser 
        275                 280                 285             


Met Tyr Ala Glu Met Val Pro Ala Val Arg Phe Ala Glu Lys Leu Gly 
    290                 295                 300                 


Leu Arg Leu Tyr Cys Gly Glu Phe Gly Ala Met Lys Thr Val Asp Arg 
305                 310                 315                 320 


Ala Gln Met Leu Gln Trp Tyr Arg Asp Val Val Thr Val Phe Asn Lys 
                325                 330                 335     


Leu Gly Ile Pro Tyr Thr Ala Trp Asp Tyr Gln Gly Thr Phe Gly Ile 
            340                 345                 350         


Arg Asp Glu Leu Thr Gly Glu Pro Asp His Glu Met Ile Asp Ile Leu 
        355                 360                 365             


Leu Gly Arg 
    370     


<210> 333
<211> 1383
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 333
atgagcaaac tccccaaatt cctctttgga gccggcacct caagttatca gatcgaaggt     60

gcctggaata tagatggcaa aggtccctcc atttgggatt tccacactcg ccatcccggc    120

gcggtttatc ggatgcacaa cggggatatg gcctgcgatc attatcatcg gtatcgaacg    180

gatatcgagc tgatgcagaa gatcggccta gaggcttacc gcttttccat aaactggccc    240

cgggttctgc cggaagggac cggtgccgcc aatgaagcag gtctggactt ttacgaccgg    300

ctggtggacg cactgttgga agcgggaatt cagccttgga tcacccttta tcactgggaa    360

ctcccctggg ctctccacct gcgcgggggt tggctcaatc gggacatgcc cgaccacatt    420

gagaactacg ccgccttggt cgccaggtgc ctcggtgacc gggtgaaaaa ctggattact    480

ttgaatgagc ctcaggtttt catcgggctt ggctatgcca gcggggttca tgcccccggc    540

tataagttgt ccttgcggga gtgcctggtc ggttcccacc atgccgtgct ttcccaccac    600

cgggcagtca aggcgatccg ggccaactgc gaaggcagcg tccagatcgg ctcagccccg    660

gtgggtgttg tctgccgacc ggaaacggag tcggcagcag acattgaggc tgcccgccag    720

gccacctacc atatcaacac tcccagcacc cacactcccg acaatctgat cggctgcctc    780

tggaacagca cttggtggat agatccaatg gttctgggga agtatccgga acacgggctg    840

aaagcctttg aaagctatct gccggacaac attcaggccg aactggatgc cgtattcgaa    900

ccgacggact ttgtcggttc caacatctac cacggccgca cggtgcgggc caagcaggat    960

ggtggttttg agtttatcga ccttccgccc ggcagccccc gcaccaccat gggctgggac   1020

atcaccccgg acatcctcta ctggggagga aagtatcttt acgaacgcta tggcaagccg   1080

atgtttatca cggaaaacgg cattgccgtc ccggaactgg tgaatgatga aggccaggtc   1140

gaggataccg tccgtgagca atacatgaag ctgcacctgc gtgggctgca gcgggcccgc   1200

gatgaaggca tcccctatgc cggatacttc cactggtccc tgctcgacaa cttcgagtgg   1260

gaacaaggct actcccagcg ctttggcatg gtctacgtcg actaccagac ccaggaacgt   1320

atcctcaaac gttcgggcca gcatttcgct gccatcgtcc gggaaatcac cggaaccgcc   1380

taa                                                                 1383

<210> 334
<211> 460
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(458)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (7)...(21)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (266)...(269)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (366)...(374)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 334
Met Ser Lys Leu Pro Lys Phe Leu Phe Gly Ala Gly Thr Ser Ser Tyr 
1               5                   10                  15      


Gln Ile Glu Gly Ala Trp Asn Ile Asp Gly Lys Gly Pro Ser Ile Trp 
            20                  25                  30          


Asp Phe His Thr Arg His Pro Gly Ala Val Tyr Arg Met His Asn Gly 
        35                  40                  45              


Asp Met Ala Cys Asp His Tyr His Arg Tyr Arg Thr Asp Ile Glu Leu 
    50                  55                  60                  


Met Gln Lys Ile Gly Leu Glu Ala Tyr Arg Phe Ser Ile Asn Trp Pro 
65                  70                  75                  80  


Arg Val Leu Pro Glu Gly Thr Gly Ala Ala Asn Glu Ala Gly Leu Asp 
                85                  90                  95      


Phe Tyr Asp Arg Leu Val Asp Ala Leu Leu Glu Ala Gly Ile Gln Pro 
            100                 105                 110         


Trp Ile Thr Leu Tyr His Trp Glu Leu Pro Trp Ala Leu His Leu Arg 
        115                 120                 125             


Gly Gly Trp Leu Asn Arg Asp Met Pro Asp His Ile Glu Asn Tyr Ala 
    130                 135                 140                 


Ala Leu Val Ala Arg Cys Leu Gly Asp Arg Val Lys Asn Trp Ile Thr 
145                 150                 155                 160 


Leu Asn Glu Pro Gln Val Phe Ile Gly Leu Gly Tyr Ala Ser Gly Val 
                165                 170                 175     


His Ala Pro Gly Tyr Lys Leu Ser Leu Arg Glu Cys Leu Val Gly Ser 
            180                 185                 190         


His His Ala Val Leu Ser His His Arg Ala Val Lys Ala Ile Arg Ala 
        195                 200                 205             


Asn Cys Glu Gly Ser Val Gln Ile Gly Ser Ala Pro Val Gly Val Val 
    210                 215                 220                 


Cys Arg Pro Glu Thr Glu Ser Ala Ala Asp Ile Glu Ala Ala Arg Gln 
225                 230                 235                 240 


Ala Thr Tyr His Ile Asn Thr Pro Ser Thr His Thr Pro Asp Asn Leu 
                245                 250                 255     


Ile Gly Cys Leu Trp Asn Ser Thr Trp Trp Ile Asp Pro Met Val Leu 
            260                 265                 270         


Gly Lys Tyr Pro Glu His Gly Leu Lys Ala Phe Glu Ser Tyr Leu Pro 
        275                 280                 285             


Asp Asn Ile Gln Ala Glu Leu Asp Ala Val Phe Glu Pro Thr Asp Phe 
    290                 295                 300                 


Val Gly Ser Asn Ile Tyr His Gly Arg Thr Val Arg Ala Lys Gln Asp 
305                 310                 315                 320 


Gly Gly Phe Glu Phe Ile Asp Leu Pro Pro Gly Ser Pro Arg Thr Thr 
                325                 330                 335     


Met Gly Trp Asp Ile Thr Pro Asp Ile Leu Tyr Trp Gly Gly Lys Tyr 
            340                 345                 350         


Leu Tyr Glu Arg Tyr Gly Lys Pro Met Phe Ile Thr Glu Asn Gly Ile 
        355                 360                 365             


Ala Val Pro Glu Leu Val Asn Asp Glu Gly Gln Val Glu Asp Thr Val 
    370                 375                 380                 


Arg Glu Gln Tyr Met Lys Leu His Leu Arg Gly Leu Gln Arg Ala Arg 
385                 390                 395                 400 


Asp Glu Gly Ile Pro Tyr Ala Gly Tyr Phe His Trp Ser Leu Leu Asp 
                405                 410                 415     


Asn Phe Glu Trp Glu Gln Gly Tyr Ser Gln Arg Phe Gly Met Val Tyr 
            420                 425                 430         


Val Asp Tyr Gln Thr Gln Glu Arg Ile Leu Lys Arg Ser Gly Gln His 
        435                 440                 445             


Phe Ala Ala Ile Val Arg Glu Ile Thr Gly Thr Ala 
    450                 455                 460 


<210> 335
<211> 1353
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 335
atgaaaaaat acctttttcc tgaaaatttt ttatggggtg ctgccacagc ttcgtatcaa     60

atcgaaggtt ctccctctgc tgatggcaaa ggtgaatcga tatgggaccg tttttctcac    120

acaccgggga acatttggaa cgctgaaacc ggggatatcg cctgcgatca ttaccggcgt    180

tacgtggatg atgtaaagct gatttcacaa atcgggctta acgcgtaccg tttttcaatt    240

tcctggccca gggtatttcc agaggggaga ggaaaagcaa atgaaaaagg actcgatttt    300

taccgcaggt tgattgaaca gctgcagcaa catcgaatca aaacggcagt gacactttac    360

cactgggatc ttccacaagt tctgcaggat cgcggcgggt gggcaaaccg tgatacggcg    420

aagtattttt ctgagtatgc cacctttctc tttgaaaaac tcgatctccc cgttgacatg    480

tggattactc ttaacgaacc atgggttatc gctattctgg ggcatgcttt tggtatccac    540

gctccaggga tgagtgactt cagcacagcc ctccaggtct cgcataacct gcttctgggg    600

cacgggttgg cggttaaagc atttcgggag tctaagaggg gtgatgaacc ggtaggtatt    660

acccttaacc ttgccccggt tgaaccgctg accgaaaagc ccgccgatct aaaggcagct    720

ttactttctg acggttttat gaaccgctgg taccttgatc ccctgttcaa aggtggttac    780

cctgaagata tgatggatat ctattcccgg aactttgaac tgcccaaaat tgaaaagggg    840

gatgctcagg ttattgccga accgatcgac ttcctgggca taaataacta taccagggtt    900

ctcgtggaag ccagcggtga tgaaaatgcc tttatgggca accctgtcaa cccccagggc    960

tctgaatata ctgaaatggg ttgggaggtt tatccgcagg gtctctacga cctgctgacc   1020

agggttcacc gggattacgg gccaatgccg ctatatataa ctgaaaacgg ggcagccttt   1080

cccgatgaac ttgacagcaa tgggcagata gatgatccaa ggcggataaa ttacctggaa   1140

acttatcttc atcagtgctg gaaggcagtt caggacggtg tgcctctaaa aggctatttt   1200

gtctggaccc tgatggataa cttcgagtgg gctttcggtt tcagcaagcg atttgggctc   1260

atatacgtag attaccagga tcagaaacgt tacttgaaaa acagcgccta ctggtatagc   1320

aaggttattg ggcgaaacgg cctcgagcta taa                                1353

<210> 336
<211> 450
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (4)...(448)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (10)...(24)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (300)...(303)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (356)...(364)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 336
Met Lys Lys Tyr Leu Phe Pro Glu Asn Phe Leu Trp Gly Ala Ala Thr 
1               5                   10                  15      


Ala Ser Tyr Gln Ile Glu Gly Ser Pro Ser Ala Asp Gly Lys Gly Glu 
            20                  25                  30          


Ser Ile Trp Asp Arg Phe Ser His Thr Pro Gly Asn Ile Trp Asn Ala 
        35                  40                  45              


Glu Thr Gly Asp Ile Ala Cys Asp His Tyr Arg Arg Tyr Val Asp Asp 
    50                  55                  60                  


Val Lys Leu Ile Ser Gln Ile Gly Leu Asn Ala Tyr Arg Phe Ser Ile 
65                  70                  75                  80  


Ser Trp Pro Arg Val Phe Pro Glu Gly Arg Gly Lys Ala Asn Glu Lys 
                85                  90                  95      


Gly Leu Asp Phe Tyr Arg Arg Leu Ile Glu Gln Leu Gln Gln His Arg 
            100                 105                 110         


Ile Lys Thr Ala Val Thr Leu Tyr His Trp Asp Leu Pro Gln Val Leu 
        115                 120                 125             


Gln Asp Arg Gly Gly Trp Ala Asn Arg Asp Thr Ala Lys Tyr Phe Ser 
    130                 135                 140                 


Glu Tyr Ala Thr Phe Leu Phe Glu Lys Leu Asp Leu Pro Val Asp Met 
145                 150                 155                 160 


Trp Ile Thr Leu Asn Glu Pro Trp Val Ile Ala Ile Leu Gly His Ala 
                165                 170                 175     


Phe Gly Ile His Ala Pro Gly Met Ser Asp Phe Ser Thr Ala Leu Gln 
            180                 185                 190         


Val Ser His Asn Leu Leu Leu Gly His Gly Leu Ala Val Lys Ala Phe 
        195                 200                 205             


Arg Glu Ser Lys Arg Gly Asp Glu Pro Val Gly Ile Thr Leu Asn Leu 
    210                 215                 220                 


Ala Pro Val Glu Pro Leu Thr Glu Lys Pro Ala Asp Leu Lys Ala Ala 
225                 230                 235                 240 


Leu Leu Ser Asp Gly Phe Met Asn Arg Trp Tyr Leu Asp Pro Leu Phe 
                245                 250                 255     


Lys Gly Gly Tyr Pro Glu Asp Met Met Asp Ile Tyr Ser Arg Asn Phe 
            260                 265                 270         


Glu Leu Pro Lys Ile Glu Lys Gly Asp Ala Gln Val Ile Ala Glu Pro 
        275                 280                 285             


Ile Asp Phe Leu Gly Ile Asn Asn Tyr Thr Arg Val Leu Val Glu Ala 
    290                 295                 300                 


Ser Gly Asp Glu Asn Ala Phe Met Gly Asn Pro Val Asn Pro Gln Gly 
305                 310                 315                 320 


Ser Glu Tyr Thr Glu Met Gly Trp Glu Val Tyr Pro Gln Gly Leu Tyr 
                325                 330                 335     


Asp Leu Leu Thr Arg Val His Arg Asp Tyr Gly Pro Met Pro Leu Tyr 
            340                 345                 350         


Ile Thr Glu Asn Gly Ala Ala Phe Pro Asp Glu Leu Asp Ser Asn Gly 
        355                 360                 365             


Gln Ile Asp Asp Pro Arg Arg Ile Asn Tyr Leu Glu Thr Tyr Leu His 
    370                 375                 380                 


Gln Cys Trp Lys Ala Val Gln Asp Gly Val Pro Leu Lys Gly Tyr Phe 
385                 390                 395                 400 


Val Trp Thr Leu Met Asp Asn Phe Glu Trp Ala Phe Gly Phe Ser Lys 
                405                 410                 415     


Arg Phe Gly Leu Ile Tyr Val Asp Tyr Gln Asp Gln Lys Arg Tyr Leu 
            420                 425                 430         


Lys Asn Ser Ala Tyr Trp Tyr Ser Lys Val Ile Gly Arg Asn Gly Leu 
        435                 440                 445             


Glu Leu 
    450 


<210> 337
<211> 1014
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 337
atgtccaggg gcatcctgat cctcgtcatg ctgtctgttc tgagcggcgc ggcgctggcc     60

caaccggccg ggctgccgcc gcgttcgccg gtgcagcgct gcatcaacct gggcaatatg    120

ctggaagcgc cggaggaggg ctggtggggg ctgcgcgtcg agcgcgacta cctgacgacg    180

atcgccgggg ccgggttcga tgcggtgcgc atcccgataa gctggtcaac ccatgctgcc    240

agcgagccgc cctacaccat cgatccggct ttcttcgccc gcgttgatga agtcgtcggc    300

tgggcgctgg cggacgggct gaaggccatc atcaacgtgc accactacga ggagatgatg    360

agcgatccgg cggggcattt cccccggctg cgcgcgctgt gggcgcagat cgcggagcac    420

tacgccgact acccgcccgc gctgatgttc gagctgctca acgaaccgtt cgaggcgctg    480

acgccgctgc ggtggaacga gtacgccgcc gatctgatcg cgctgatccg ccagaccaac    540

ccggggcgca ccctgatcgt cggcgggggc tggtggaaca gtgtggaagg gctgatgcag    600

ctccgcctgc cggatgatcc cgatctgctg gcgacgttcc attactacca cccgttcgag    660

ttcacgcatc agggggcgga gtggtcaccg gaagtgactg acctgagcgg gatcgcctgg    720

gggacgggcg aggaacggct cgatctggag tccaatatcc gtattgcggc ggcctgggcg    780

gtgtacaacc ggcgcccgct gctgttgggc gaattcggcg tctatggccg ggtggccgat    840

ctcgattcgc gcctgcgctg gacgacggcg gtgcgcgccg aggccgaggc gcagggcatc    900

ggctggtgct actgggaatt cgccgccggc ttcggcattt acgacccgga aagccggacg    960

ttcaacccgc tgtaccgcgc gctgatcccg caggccgggc cggcgcgccc ctga         1014

<210> 338
<211> 337
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(20)

<220> 
<221> DOMAIN
<222> (38)...(314)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (150)...(159)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<400> 338
Met Ser Arg Gly Ile Leu Ile Leu Val Met Leu Ser Val Leu Ser Gly 
1               5                   10                  15      


Ala Ala Leu Ala Gln Pro Ala Gly Leu Pro Pro Arg Ser Pro Val Gln 
            20                  25                  30          


Arg Cys Ile Asn Leu Gly Asn Met Leu Glu Ala Pro Glu Glu Gly Trp 
        35                  40                  45              


Trp Gly Leu Arg Val Glu Arg Asp Tyr Leu Thr Thr Ile Ala Gly Ala 
    50                  55                  60                  


Gly Phe Asp Ala Val Arg Ile Pro Ile Ser Trp Ser Thr His Ala Ala 
65                  70                  75                  80  


Ser Glu Pro Pro Tyr Thr Ile Asp Pro Ala Phe Phe Ala Arg Val Asp 
                85                  90                  95      


Glu Val Val Gly Trp Ala Leu Ala Asp Gly Leu Lys Ala Ile Ile Asn 
            100                 105                 110         


Val His His Tyr Glu Glu Met Met Ser Asp Pro Ala Gly His Phe Pro 
        115                 120                 125             


Arg Leu Arg Ala Leu Trp Ala Gln Ile Ala Glu His Tyr Ala Asp Tyr 
    130                 135                 140                 


Pro Pro Ala Leu Met Phe Glu Leu Leu Asn Glu Pro Phe Glu Ala Leu 
145                 150                 155                 160 


Thr Pro Leu Arg Trp Asn Glu Tyr Ala Ala Asp Leu Ile Ala Leu Ile 
                165                 170                 175     


Arg Gln Thr Asn Pro Gly Arg Thr Leu Ile Val Gly Gly Gly Trp Trp 
            180                 185                 190         


Asn Ser Val Glu Gly Leu Met Gln Leu Arg Leu Pro Asp Asp Pro Asp 
        195                 200                 205             


Leu Leu Ala Thr Phe His Tyr Tyr His Pro Phe Glu Phe Thr His Gln 
    210                 215                 220                 


Gly Ala Glu Trp Ser Pro Glu Val Thr Asp Leu Ser Gly Ile Ala Trp 
225                 230                 235                 240 


Gly Thr Gly Glu Glu Arg Leu Asp Leu Glu Ser Asn Ile Arg Ile Ala 
                245                 250                 255     


Ala Ala Trp Ala Val Tyr Asn Arg Arg Pro Leu Leu Leu Gly Glu Phe 
            260                 265                 270         


Gly Val Tyr Gly Arg Val Ala Asp Leu Asp Ser Arg Leu Arg Trp Thr 
        275                 280                 285             


Thr Ala Val Arg Ala Glu Ala Glu Ala Gln Gly Ile Gly Trp Cys Tyr 
    290                 295                 300                 


Trp Glu Phe Ala Ala Gly Phe Gly Ile Tyr Asp Pro Glu Ser Arg Thr 
305                 310                 315                 320 


Phe Asn Pro Leu Tyr Arg Ala Leu Ile Pro Gln Ala Gly Pro Ala Arg 
                325                 330                 335     


Pro 
    


<210> 339
<211> 1389
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 339
atgagcgctt cgagtccctc ccgccccctg tccttcccag agcagttcgt ctggggtgct     60

gccgcggcct cctaccaagt cgagggcgcc gtccacgagg acgggaaggg cccctccgtc    120

tgggacatgt tctgcgagaa gcccggagcg gtcttccagg ggcacgacgg ggcggtggct    180

tgcgaccact atcaccgcta ccgagaggac gtggcgttga tgcgacaggt gggcctgcac    240

gcctaccgcc tgagcgtgtg ctggccccga gtgctcccgg agggcgtcgg gcagcccaac    300

gagaagggcc tcgacttcta ctcgcggttg gtggacgcgc tgctcgaggc agggattacg    360

ccctgggtaa cgctttttca ttgggactac cccttggctc tctatcaccg ggggggctgg    420

ctcaaccggg atagcgcgga ttggtttgcc gagtacgcgg gcctaatcgc cgatcgcctc    480

tccgaccggg tgcagcattt cttcactcag aacgagcccc aggtctatat cggcttcgga    540

cacctcgagg gtaagcatgc tccaggagac accttgccca tgtcccaggt gctgcttgcg    600

gggcatcata gcctactggc gcacggcaag gccgtgcagg cgctccgcgc ccaggcgaag    660

cagcagctgc gcgtcggcta cgctcccgtc ggcatgcccc tccatccctt cacggactcg    720

gccgaggacg tggccgctgc gcggaaggcg accttttggg ttcgggagaa gaactcctgg    780

aacaacgcct ggtggatgga cccggtgttc ttgggtgagt acccggctca gggcctcgcc    840

ttcttcggcc gggacgtgcc gcaggtgcgc gagggagaca tgcagctcat cgcgcagccc    900

ttggacttct ttggggtcaa catctaccag agcacccccg tgcgcgcgtc tagcgccgaa    960

agcggcttcg aggtcgtccc ccatccaacg ggctatccta tcactgcctt caactggccg   1020

atcacgcccc aggccctcta ctggggtccg cgcttcttct acgagcgcta ccagaagccg   1080

atcgtcatca cggagaacgg actgtcctgt cgggacgtcg tcgctgtgga cgggaaggtt   1140

cacgatccgg ctcgcatcga tttcaccacc cgctatctgc gcgagctcca ccgagccgtc   1200

gcggacggcg tcgcggtcga gggctacttc cactggtcca tcatggacaa cttcgaatgg   1260

gctgccggct accgcgagcg gttcgggctc attcacgtcg actacgagac cctggcgcgg   1320

acgcccaagg cgtccgctgc gtggtatcgc aaggtaatcg agagcaacgg agcgaccctt   1380

ttcggatga                                                           1389

<210> 340
<211> 462
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (8)...(458)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (16)...(30)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (366)...(374)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 340
Met Ser Ala Ser Ser Pro Ser Arg Pro Leu Ser Phe Pro Glu Gln Phe 
1               5                   10                  15      


Val Trp Gly Ala Ala Ala Ala Ser Tyr Gln Val Glu Gly Ala Val His 
            20                  25                  30          


Glu Asp Gly Lys Gly Pro Ser Val Trp Asp Met Phe Cys Glu Lys Pro 
        35                  40                  45              


Gly Ala Val Phe Gln Gly His Asp Gly Ala Val Ala Cys Asp His Tyr 
    50                  55                  60                  


His Arg Tyr Arg Glu Asp Val Ala Leu Met Arg Gln Val Gly Leu His 
65                  70                  75                  80  


Ala Tyr Arg Leu Ser Val Cys Trp Pro Arg Val Leu Pro Glu Gly Val 
                85                  90                  95      


Gly Gln Pro Asn Glu Lys Gly Leu Asp Phe Tyr Ser Arg Leu Val Asp 
            100                 105                 110         


Ala Leu Leu Glu Ala Gly Ile Thr Pro Trp Val Thr Leu Phe His Trp 
        115                 120                 125             


Asp Tyr Pro Leu Ala Leu Tyr His Arg Gly Gly Trp Leu Asn Arg Asp 
    130                 135                 140                 


Ser Ala Asp Trp Phe Ala Glu Tyr Ala Gly Leu Ile Ala Asp Arg Leu 
145                 150                 155                 160 


Ser Asp Arg Val Gln His Phe Phe Thr Gln Asn Glu Pro Gln Val Tyr 
                165                 170                 175     


Ile Gly Phe Gly His Leu Glu Gly Lys His Ala Pro Gly Asp Thr Leu 
            180                 185                 190         


Pro Met Ser Gln Val Leu Leu Ala Gly His His Ser Leu Leu Ala His 
        195                 200                 205             


Gly Lys Ala Val Gln Ala Leu Arg Ala Gln Ala Lys Gln Gln Leu Arg 
    210                 215                 220                 


Val Gly Tyr Ala Pro Val Gly Met Pro Leu His Pro Phe Thr Asp Ser 
225                 230                 235                 240 


Ala Glu Asp Val Ala Ala Ala Arg Lys Ala Thr Phe Trp Val Arg Glu 
                245                 250                 255     


Lys Asn Ser Trp Asn Asn Ala Trp Trp Met Asp Pro Val Phe Leu Gly 
            260                 265                 270         


Glu Tyr Pro Ala Gln Gly Leu Ala Phe Phe Gly Arg Asp Val Pro Gln 
        275                 280                 285             


Val Arg Glu Gly Asp Met Gln Leu Ile Ala Gln Pro Leu Asp Phe Phe 
    290                 295                 300                 


Gly Val Asn Ile Tyr Gln Ser Thr Pro Val Arg Ala Ser Ser Ala Glu 
305                 310                 315                 320 


Ser Gly Phe Glu Val Val Pro His Pro Thr Gly Tyr Pro Ile Thr Ala 
                325                 330                 335     


Phe Asn Trp Pro Ile Thr Pro Gln Ala Leu Tyr Trp Gly Pro Arg Phe 
            340                 345                 350         


Phe Tyr Glu Arg Tyr Gln Lys Pro Ile Val Ile Thr Glu Asn Gly Leu 
        355                 360                 365             


Ser Cys Arg Asp Val Val Ala Val Asp Gly Lys Val His Asp Pro Ala 
    370                 375                 380                 


Arg Ile Asp Phe Thr Thr Arg Tyr Leu Arg Glu Leu His Arg Ala Val 
385                 390                 395                 400 


Ala Asp Gly Val Ala Val Glu Gly Tyr Phe His Trp Ser Ile Met Asp 
                405                 410                 415     


Asn Phe Glu Trp Ala Ala Gly Tyr Arg Glu Arg Phe Gly Leu Ile His 
            420                 425                 430         


Val Asp Tyr Glu Thr Leu Ala Arg Thr Pro Lys Ala Ser Ala Ala Trp 
        435                 440                 445             


Tyr Arg Lys Val Ile Glu Ser Asn Gly Ala Thr Leu Phe Gly 
    450                 455                 460         


<210> 341
<211> 1377
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 341
atgacacaac tggcttttcc atctaacttc atctggggaa cagctacttc cgcttaccaa     60

atcgaaggcg cctggaacgc agacggcaag ggcgaatcta tttgggatcg cttttcccat    120

acgcagggga agatcattga cggcagcaac ggcgatgtgg cctgcgatca ctaccaccgc    180

tggcgcgagg acgtggccct catgagagac ttgggtatgc aggcatatcg cttctccatc    240

tcctggccac gcatcctgcc caccggtcat ggacagatca atcaggctgg gctggacttt    300

tacaatcgcc tggtggacgg gttgctggaa gctggcatca agccctttgc caccctctac    360

cactgggacc tgccgctggc gctacaggct gacggcggct ggccggagcg ctccacggcc    420

aaggcctttg tcgaatacgc cgacgtggtc agccgcgcgc tgggcgatcg ggtgaagagc    480

tggatcaccc ataacgaacc gtggtgcatc agcatgctga gccatcaaat tggggagcat    540

gcgcccggct ggcgggactg gcaggctgcg ttggcggccg cgcaccacgt cctcctttcg    600

catggttggg ccgtgccgga actgcgtcgc aacagccgcg atgcagaaat cggcatcacg    660

ttgaacttta ccccggcgga gccagcttcg aacagcgcag ccgatttcaa ggcctatcgc    720

cagttcgatg gctacttcaa ccgctggttc ctggacccgc tctatggccg ccactatccg    780

gcagatatgg tgcacgatta catcgcgcaa ggctacctgc catcacaggg tttgactttc    840

gtggaagctg gtgacctgga cgcgatcgcg acgcgcaccg atttcctggg tgtgaactat    900

tacacgcgcg aagtggtccg tagccaggaa atcccagaga gtgagaacgc gccgcgcaca    960

gtcttgcgcg cgccacagga agagtggaca gagatgggct gggaagtgta tcctgagggc   1020

ctctacaggt tgctcaatcg gttgcacttt gaataccagc cgcgcaagct ctacgtgacc   1080

gagagcggtt gcagctactc cgatggaccc ggccccaacg gtcggatacc ggaccaacgc   1140

cgtatcaact acctgcgcga tcacttcgca gcggcgcatc aggcgataca atgcggcgtc   1200

ccgctggccg gctacttcgt ctggtcgttc atggacaact tcgagtgggc caaagggtac   1260

acccaacgtt ttggtatcgt atgggtggat tatcaatcgc aacgacggat accgaaagac   1320

agcgcctact ggtatcgcga tgtcgtcgcc gccaacgcgg tgcaagttcc tgattag      1377

<210> 342
<211> 458
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (2)...(454)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (10)...(24)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<400> 342
Met Thr Gln Leu Ala Phe Pro Ser Asn Phe Ile Trp Gly Thr Ala Thr 
1               5                   10                  15      


Ser Ala Tyr Gln Ile Glu Gly Ala Trp Asn Ala Asp Gly Lys Gly Glu 
            20                  25                  30          


Ser Ile Trp Asp Arg Phe Ser His Thr Gln Gly Lys Ile Ile Asp Gly 
        35                  40                  45              


Ser Asn Gly Asp Val Ala Cys Asp His Tyr His Arg Trp Arg Glu Asp 
    50                  55                  60                  


Val Ala Leu Met Arg Asp Leu Gly Met Gln Ala Tyr Arg Phe Ser Ile 
65                  70                  75                  80  


Ser Trp Pro Arg Ile Leu Pro Thr Gly His Gly Gln Ile Asn Gln Ala 
                85                  90                  95      


Gly Leu Asp Phe Tyr Asn Arg Leu Val Asp Gly Leu Leu Glu Ala Gly 
            100                 105                 110         


Ile Lys Pro Phe Ala Thr Leu Tyr His Trp Asp Leu Pro Leu Ala Leu 
        115                 120                 125             


Gln Ala Asp Gly Gly Trp Pro Glu Arg Ser Thr Ala Lys Ala Phe Val 
    130                 135                 140                 


Glu Tyr Ala Asp Val Val Ser Arg Ala Leu Gly Asp Arg Val Lys Ser 
145                 150                 155                 160 


Trp Ile Thr His Asn Glu Pro Trp Cys Ile Ser Met Leu Ser His Gln 
                165                 170                 175     


Ile Gly Glu His Ala Pro Gly Trp Arg Asp Trp Gln Ala Ala Leu Ala 
            180                 185                 190         


Ala Ala His His Val Leu Leu Ser His Gly Trp Ala Val Pro Glu Leu 
        195                 200                 205             


Arg Arg Asn Ser Arg Asp Ala Glu Ile Gly Ile Thr Leu Asn Phe Thr 
    210                 215                 220                 


Pro Ala Glu Pro Ala Ser Asn Ser Ala Ala Asp Phe Lys Ala Tyr Arg 
225                 230                 235                 240 


Gln Phe Asp Gly Tyr Phe Asn Arg Trp Phe Leu Asp Pro Leu Tyr Gly 
                245                 250                 255     


Arg His Tyr Pro Ala Asp Met Val His Asp Tyr Ile Ala Gln Gly Tyr 
            260                 265                 270         


Leu Pro Ser Gln Gly Leu Thr Phe Val Glu Ala Gly Asp Leu Asp Ala 
        275                 280                 285             


Ile Ala Thr Arg Thr Asp Phe Leu Gly Val Asn Tyr Tyr Thr Arg Glu 
    290                 295                 300                 


Val Val Arg Ser Gln Glu Ile Pro Glu Ser Glu Asn Ala Pro Arg Thr 
305                 310                 315                 320 


Val Leu Arg Ala Pro Gln Glu Glu Trp Thr Glu Met Gly Trp Glu Val 
                325                 330                 335     


Tyr Pro Glu Gly Leu Tyr Arg Leu Leu Asn Arg Leu His Phe Glu Tyr 
            340                 345                 350         


Gln Pro Arg Lys Leu Tyr Val Thr Glu Ser Gly Cys Ser Tyr Ser Asp 
        355                 360                 365             


Gly Pro Gly Pro Asn Gly Arg Ile Pro Asp Gln Arg Arg Ile Asn Tyr 
    370                 375                 380                 


Leu Arg Asp His Phe Ala Ala Ala His Gln Ala Ile Gln Cys Gly Val 
385                 390                 395                 400 


Pro Leu Ala Gly Tyr Phe Val Trp Ser Phe Met Asp Asn Phe Glu Trp 
                405                 410                 415     


Ala Lys Gly Tyr Thr Gln Arg Phe Gly Ile Val Trp Val Asp Tyr Gln 
            420                 425                 430         


Ser Gln Arg Arg Ile Pro Lys Asp Ser Ala Tyr Trp Tyr Arg Asp Val 
        435                 440                 445             


Val Ala Ala Asn Ala Val Gln Val Pro Asp 
    450                 455             


<210> 343
<211> 987
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 343
atggttgagc ctgccgatca gagtcatttt tcagatgctt ttcaggtaaa tcgcactctt     60

ggaaaaggca tcaatcttgg taacacactg gaggctccaa atgaaggcga gtggggattg    120

acaattcgcg aggagtattt tgatgaagtg aaacaagccg gatttgaatc cgtgcgtatt    180

ccgatacgat ggaatgctca tgctctggaa ggttttccat atacgataga tgaatctttt    240

tttgaccggg ttgatgaagt tattggctgg gcttttgatc gtgatcttgc agtcatgatt    300

aacattcatc actacaacga attgatggag cagccacagg atcaccggga tcgctttttg    360

aaactttggg agcaaattgc tgcgcactat aaagagtacc cggaagaact ggtattcgag    420

attttaaacg aaccccacga taatctgacc ccggctatct ggaatagctt tttggctgat    480

gctctcggta ttatacgcca aaccaatcca ggaagggtta ttgcagtcgg aacagctgaa    540

tggggcggtt tcgggagttt gcaggatctt gagctgcctg ataatgaccg ccagataatc    600

accaccgttc attactataa cccatttcat ttcacgcatc agggggcaga ttgggttgga    660

gatgaagcgg atcagtggct tggaaccgaa tgggatggag cagatcatga aaaagctgaa    720

gttgacagcg attttgactc tgtggaacag tgggcccgaa atcatgaccg gccaatacac    780

gtgggagagt tcggagcttt cagcgccgca gatgatttgt cacgtgaaca gtggacggca    840

tacgtacgtg agtcttcgga gaaccggcag tttagctggg cgtattggga gtttgggtca    900

gggttcggtg cctatgatcc cggttccgga gaatggcgtg aatatttact ccgggcgtta    960

atccccgaca gtccggtgat tgattaa                                        987

<210> 344
<211> 328
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (27)...(306)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (17)...(20)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (139)...(148)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<400> 344
Met Val Glu Pro Ala Asp Gln Ser His Phe Ser Asp Ala Phe Gln Val 
1               5                   10                  15      


Asn Arg Thr Leu Gly Lys Gly Ile Asn Leu Gly Asn Thr Leu Glu Ala 
            20                  25                  30          


Pro Asn Glu Gly Glu Trp Gly Leu Thr Ile Arg Glu Glu Tyr Phe Asp 
        35                  40                  45              


Glu Val Lys Gln Ala Gly Phe Glu Ser Val Arg Ile Pro Ile Arg Trp 
    50                  55                  60                  


Asn Ala His Ala Leu Glu Gly Phe Pro Tyr Thr Ile Asp Glu Ser Phe 
65                  70                  75                  80  


Phe Asp Arg Val Asp Glu Val Ile Gly Trp Ala Phe Asp Arg Asp Leu 
                85                  90                  95      


Ala Val Met Ile Asn Ile His His Tyr Asn Glu Leu Met Glu Gln Pro 
            100                 105                 110         


Gln Asp His Arg Asp Arg Phe Leu Lys Leu Trp Glu Gln Ile Ala Ala 
        115                 120                 125             


His Tyr Lys Glu Tyr Pro Glu Glu Leu Val Phe Glu Ile Leu Asn Glu 
    130                 135                 140                 


Pro His Asp Asn Leu Thr Pro Ala Ile Trp Asn Ser Phe Leu Ala Asp 
145                 150                 155                 160 


Ala Leu Gly Ile Ile Arg Gln Thr Asn Pro Gly Arg Val Ile Ala Val 
                165                 170                 175     


Gly Thr Ala Glu Trp Gly Gly Phe Gly Ser Leu Gln Asp Leu Glu Leu 
            180                 185                 190         


Pro Asp Asn Asp Arg Gln Ile Ile Thr Thr Val His Tyr Tyr Asn Pro 
        195                 200                 205             


Phe His Phe Thr His Gln Gly Ala Asp Trp Val Gly Asp Glu Ala Asp 
    210                 215                 220                 


Gln Trp Leu Gly Thr Glu Trp Asp Gly Ala Asp His Glu Lys Ala Glu 
225                 230                 235                 240 


Val Asp Ser Asp Phe Asp Ser Val Glu Gln Trp Ala Arg Asn His Asp 
                245                 250                 255     


Arg Pro Ile His Val Gly Glu Phe Gly Ala Phe Ser Ala Ala Asp Asp 
            260                 265                 270         


Leu Ser Arg Glu Gln Trp Thr Ala Tyr Val Arg Glu Ser Ser Glu Asn 
        275                 280                 285             


Arg Gln Phe Ser Trp Ala Tyr Trp Glu Phe Gly Ser Gly Phe Gly Ala 
    290                 295                 300                 


Tyr Asp Pro Gly Ser Gly Glu Trp Arg Glu Tyr Leu Leu Arg Ala Leu 
305                 310                 315                 320 


Ile Pro Asp Ser Pro Val Ile Asp 
                325             


<210> 345
<211> 1350
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 345
atgctgtcct atacgagtcc gttcccaaag aactttgtct ggggtgtggc gacggcggcg     60

ccgcagatcg agggcgctgc gcgagaagac ggaaagggcg aatcgatatg ggatcgcttt    120

tgccgcgtgc ccggaaaggt ccacaatggc gatactctcg atgttgcgtg cgaccactac    180

caccggttcc gggaggattt cgcgctcatg cgagacttgg gcgtgcgcca ctaccggctt    240

tcgcttgcct ggccccgcat attcccggac ggcgacggcg cattgaacca gcgcggagtg    300

gatttctacc accggctctt tgaggccatg atcgagcacg ggattacgcc ttgggtgacg    360

ctctttcact gggatttgcc gcaggcgctc gaggaccgcg gcggctggtg tgagcgtctc    420

accgtcgatg cattcgggcg ctacgctgac accgtggtga aggcgtttgg cgatcgcgtg    480

aagaattgga tcaccctgaa cgaaatccgc tgcttcacgt tgctcgctta cgatctctgc    540

atcaaggccc cgggccgcaa ggtctcgcgg gcgcagctca accagaccta tcatcacgcg    600

ctgatctgcc atgggcatgg cgtccgggcg gtccgcgaac acggcgggcg aggcgctcgc    660

gtcgggctta ccgacaacag cgacgtatgc gtgcccgtca ccgagaccgc gcccgacatc    720

attgcggcca gatcctggta tgcgtcgcga aatattcatc tgctcgatcc gatctatcgc    780

ggcgagtatg cgccggaata cctcgaacgc tgcggtgcgg acgcgcccca ggtggccgag    840

gacgatttcg cgctgatttc aatgccgacg gattttctcg ggctgaatgt atatacggcg    900

acctttgtgc gtgccgacgc ggagggcagg ccggaggaga ttaaactgcc gcggaattac    960

ccgcgcgcgg atagcgcgtg gttgaatatt gtgccccagt cgatgtactg ggccacacgg   1020

ctggcgcggg aaacctacgg cgtgagatca atctacatca ccgaaaacgg ctgcggctac   1080

gacgacgagc ccgtcgacgg cggcgaggtg ctcgacctgc atcgacgcga ttttctgcgc   1140

aaccaccttc gggaattgca tcgcgccata ggcgacggcg tgcccgttga cgggtatttt   1200

ctctggtcct tcatggacaa ctacgagtgg gaggacgggt atgcgcggcg gttcggcatc   1260

gttcacgtcg acttcgaaag ccagaaacgg actccaaaac tctcggcgcg ctattacgcg   1320

caggtaatga aagaaaaccg gatcctgtga                                    1350

<210> 346
<211> 449
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (4)...(448)
<223> Glycosyl hydrolase family 1

<400> 346
Met Leu Ser Tyr Thr Ser Pro Phe Pro Lys Asn Phe Val Trp Gly Val 
1               5                   10                  15      


Ala Thr Ala Ala Pro Gln Ile Glu Gly Ala Ala Arg Glu Asp Gly Lys 
            20                  25                  30          


Gly Glu Ser Ile Trp Asp Arg Phe Cys Arg Val Pro Gly Lys Val His 
        35                  40                  45              


Asn Gly Asp Thr Leu Asp Val Ala Cys Asp His Tyr His Arg Phe Arg 
    50                  55                  60                  


Glu Asp Phe Ala Leu Met Arg Asp Leu Gly Val Arg His Tyr Arg Leu 
65                  70                  75                  80  


Ser Leu Ala Trp Pro Arg Ile Phe Pro Asp Gly Asp Gly Ala Leu Asn 
                85                  90                  95      


Gln Arg Gly Val Asp Phe Tyr His Arg Leu Phe Glu Ala Met Ile Glu 
            100                 105                 110         


His Gly Ile Thr Pro Trp Val Thr Leu Phe His Trp Asp Leu Pro Gln 
        115                 120                 125             


Ala Leu Glu Asp Arg Gly Gly Trp Cys Glu Arg Leu Thr Val Asp Ala 
    130                 135                 140                 


Phe Gly Arg Tyr Ala Asp Thr Val Val Lys Ala Phe Gly Asp Arg Val 
145                 150                 155                 160 


Lys Asn Trp Ile Thr Leu Asn Glu Ile Arg Cys Phe Thr Leu Leu Ala 
                165                 170                 175     


Tyr Asp Leu Cys Ile Lys Ala Pro Gly Arg Lys Val Ser Arg Ala Gln 
            180                 185                 190         


Leu Asn Gln Thr Tyr His His Ala Leu Ile Cys His Gly His Gly Val 
        195                 200                 205             


Arg Ala Val Arg Glu His Gly Gly Arg Gly Ala Arg Val Gly Leu Thr 
    210                 215                 220                 


Asp Asn Ser Asp Val Cys Val Pro Val Thr Glu Thr Ala Pro Asp Ile 
225                 230                 235                 240 


Ile Ala Ala Arg Ser Trp Tyr Ala Ser Arg Asn Ile His Leu Leu Asp 
                245                 250                 255     


Pro Ile Tyr Arg Gly Glu Tyr Ala Pro Glu Tyr Leu Glu Arg Cys Gly 
            260                 265                 270         


Ala Asp Ala Pro Gln Val Ala Glu Asp Asp Phe Ala Leu Ile Ser Met 
        275                 280                 285             


Pro Thr Asp Phe Leu Gly Leu Asn Val Tyr Thr Ala Thr Phe Val Arg 
    290                 295                 300                 


Ala Asp Ala Glu Gly Arg Pro Glu Glu Ile Lys Leu Pro Arg Asn Tyr 
305                 310                 315                 320 


Pro Arg Ala Asp Ser Ala Trp Leu Asn Ile Val Pro Gln Ser Met Tyr 
                325                 330                 335     


Trp Ala Thr Arg Leu Ala Arg Glu Thr Tyr Gly Val Arg Ser Ile Tyr 
            340                 345                 350         


Ile Thr Glu Asn Gly Cys Gly Tyr Asp Asp Glu Pro Val Asp Gly Gly 
        355                 360                 365             


Glu Val Leu Asp Leu His Arg Arg Asp Phe Leu Arg Asn His Leu Arg 
    370                 375                 380                 


Glu Leu His Arg Ala Ile Gly Asp Gly Val Pro Val Asp Gly Tyr Phe 
385                 390                 395                 400 


Leu Trp Ser Phe Met Asp Asn Tyr Glu Trp Glu Asp Gly Tyr Ala Arg 
                405                 410                 415     


Arg Phe Gly Ile Val His Val Asp Phe Glu Ser Gln Lys Arg Thr Pro 
            420                 425                 430         


Lys Leu Ser Ala Arg Tyr Tyr Ala Gln Val Met Lys Glu Asn Arg Ile 
        435                 440                 445             


Leu 
    


<210> 347
<211> 1188
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 347
atgaccatca ccttccccga cgggttctgg tgggggacgg cgacggccgc ccaccaggtg     60

gagggcggca actggaacac cgactggtgg gcctacgagc acgccccggg cacccgctgc    120

gcggagccgt ccggcgatgc gtgcgaccac tggcaccgct acccggagga catcgccctc    180

ctcgccgcgc tcgggttcag tgcctaccgc ttctcggtgg aatgggctcg catcgagccc    240

gaggaagggc atttctcccg cgccaccctc gaccactacc ggcgcatgat cgcctgctgc    300

cgcgaccacg ggctggcccc ggtggtgacc ttccaccact tcaccacccc ccgctgggcc    360

gcggccgggg gctgctggtc cgacccggtc accgccgagc gcttcgcccg ttactgcgag    420

cgcaccgtgg ccgccctcgg cgacgagatc gcgatggcct gcacgatcaa cgagccgaac    480

atcgtggcca ccctcgggta cttcctcggc gagttcccgc cggccgtcgc cgaccccgac    540

cgctaccggc aggcgaacga cacgctgatc cgcgcccatc gcctcgccta cgaggcgctg    600

aaggccgggc ccggcgagtt ccccgtcggc ctcaccctgt cgatggccga gttcgtcgcc    660

gagcccggcg gcgaggccca cctcgcccag gtccggcaca cgatggagga catcttcctg    720

gaggccgccc ggggcgacga cttcatcggg gtgcagacct acagccgcat gcgcttcggt    780

cccgactcgc cgatcccgct cgggccggcc gagggcgtcg aggtcgtcca gatggggtac    840

gagtactggc cgtgggcgct cgaggcgacg atccggcgcg ccgccgaggt caccggcacg    900

gcggtccacg tcaccgagaa cggcatcggg accgccgacg acacgcagcg ggtcgcctac    960

gtcaccgagg ccctccgggg gctgcggcgc tgcctcgacg acggcatcga cgtccgcagc   1020

tacttctact ggacgctgct cgacaacttc gagtggacgc gcggctacgt gccgacgttc   1080

gggctcgtcg ccgtcgaccg caccacccag cgccggtcgg tgaagccgag cgcggtgtgg   1140

ctcggcgagg tcgcccgcac gaaccgcctc gagctcccgg accgctga                1188

<210> 348
<211> 395
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(390)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (9)...(23)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (188)...(191)
<223> N-glycosylation site. Prosite id = PS00001

<400> 348
Met Thr Ile Thr Phe Pro Asp Gly Phe Trp Trp Gly Thr Ala Thr Ala 
1               5                   10                  15      


Ala His Gln Val Glu Gly Gly Asn Trp Asn Thr Asp Trp Trp Ala Tyr 
            20                  25                  30          


Glu His Ala Pro Gly Thr Arg Cys Ala Glu Pro Ser Gly Asp Ala Cys 
        35                  40                  45              


Asp His Trp His Arg Tyr Pro Glu Asp Ile Ala Leu Leu Ala Ala Leu 
    50                  55                  60                  


Gly Phe Ser Ala Tyr Arg Phe Ser Val Glu Trp Ala Arg Ile Glu Pro 
65                  70                  75                  80  


Glu Glu Gly His Phe Ser Arg Ala Thr Leu Asp His Tyr Arg Arg Met 
                85                  90                  95      


Ile Ala Cys Cys Arg Asp His Gly Leu Ala Pro Val Val Thr Phe His 
            100                 105                 110         


His Phe Thr Thr Pro Arg Trp Ala Ala Ala Gly Gly Cys Trp Ser Asp 
        115                 120                 125             


Pro Val Thr Ala Glu Arg Phe Ala Arg Tyr Cys Glu Arg Thr Val Ala 
    130                 135                 140                 


Ala Leu Gly Asp Glu Ile Ala Met Ala Cys Thr Ile Asn Glu Pro Asn 
145                 150                 155                 160 


Ile Val Ala Thr Leu Gly Tyr Phe Leu Gly Glu Phe Pro Pro Ala Val 
                165                 170                 175     


Ala Asp Pro Asp Arg Tyr Arg Gln Ala Asn Asp Thr Leu Ile Arg Ala 
            180                 185                 190         


His Arg Leu Ala Tyr Glu Ala Leu Lys Ala Gly Pro Gly Glu Phe Pro 
        195                 200                 205             


Val Gly Leu Thr Leu Ser Met Ala Glu Phe Val Ala Glu Pro Gly Gly 
    210                 215                 220                 


Glu Ala His Leu Ala Gln Val Arg His Thr Met Glu Asp Ile Phe Leu 
225                 230                 235                 240 


Glu Ala Ala Arg Gly Asp Asp Phe Ile Gly Val Gln Thr Tyr Ser Arg 
                245                 250                 255     


Met Arg Phe Gly Pro Asp Ser Pro Ile Pro Leu Gly Pro Ala Glu Gly 
            260                 265                 270         


Val Glu Val Val Gln Met Gly Tyr Glu Tyr Trp Pro Trp Ala Leu Glu 
        275                 280                 285             


Ala Thr Ile Arg Arg Ala Ala Glu Val Thr Gly Thr Ala Val His Val 
    290                 295                 300                 


Thr Glu Asn Gly Ile Gly Thr Ala Asp Asp Thr Gln Arg Val Ala Tyr 
305                 310                 315                 320 


Val Thr Glu Ala Leu Arg Gly Leu Arg Arg Cys Leu Asp Asp Gly Ile 
                325                 330                 335     


Asp Val Arg Ser Tyr Phe Tyr Trp Thr Leu Leu Asp Asn Phe Glu Trp 
            340                 345                 350         


Thr Arg Gly Tyr Val Pro Thr Phe Gly Leu Val Ala Val Asp Arg Thr 
        355                 360                 365             


Thr Gln Arg Arg Ser Val Lys Pro Ser Ala Val Trp Leu Gly Glu Val 
    370                 375                 380                 


Ala Arg Thr Asn Arg Leu Glu Leu Pro Asp Arg 
385                 390                 395 


<210> 349
<211> 1323
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 349
atgtcaacct ataaatttcc gcacaacttt ttttggggag ccgcaaccgc gtcttatcag     60

atcgaaggcg catggaacga ggatggcaaa ggcgaatcca tttgggatcg cttcagccat    120

acgcccggaa aggtcaccaa tgccgatacc ggtgacatcg cctgtgacca ctatcaccgt    180

tgggaggaag atatcgccct tatgcgccaa cttgggttga aggcgtaccg cttttccact    240

tcatggcccc gtgtgatccc ggcgggccgc agacgggtga atgtcaaagg gctggatttc    300

tacgatcgcc tggtggatgg tctgtgcgcc gcgaacatcg aaccgttcct caccctgtat    360

cactgggacc tgccgcaggc tcttcaagac gaaggcggct gggataatcg caacaccgcc    420

catgcctttg ccgattatgc cgcattgatg gtgaaacgac ttggcgaccg tatccgctat    480

tggacgacgt tcaacgaacc cagcgttgtg gcgttcaatg gtcattactc aggctcgcac    540

gccccgggca ttcaagatgc ccgtgttacc cgccaggtgg tgcatcattt gctggtggcg    600

catgggttgg ctgtgcaggc gatccgcggc gcaaactcca aagtggatgt gggcatcgtg    660

cttaatttat ggcccgccga acccgattcg gactcccccg aagatgccgc cgccgccgaa    720

gccgcctgga accggcacga gaccctgttc cttgacccca tctttaaggc gcattatccc    780

gtatctgccc ttgatgcgat tggggaggat atgccccgca tccacgacgg cgatctggcg    840

ttgatctctc aggaattgga ttttgtcggc atcaactatt actcccgcca tgtggtcagt    900

gccacaaaag aaataggcag gcttcccgaa tcggaataca ctgaaatggg ctgggaagta    960

tgcgcccccg cactccgccg cctgctggtc aagatccata acgattaccg tttgccgccc   1020

atctatatca ccgaaaacgg atcggcattc aaggacgaag ttaacgcaga cggaaaggtt   1080

catgacccgc ggcggttgga ttacctgaaa caacacctga ttcaactttg ccttgccatg   1140

caggacggcg tggatgtgcg cggctacatg gcttggtccc tgctggataa tttcgagtgg   1200

ggtcacggct tttccaagcg ctttggcttg gtccatgtgg attacgagag ccagaagcgg   1260

attattaaag actcgggtga atggtatgca agtgtgatac ggaagaacga ggttgttgaa   1320

taa                                                                 1323

<210> 350
<211> 440
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (2)...(438)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (10)...(24)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (351)...(354)
<223> N-glycosylation site. Prosite id = PS00001

<400> 350
Met Ser Thr Tyr Lys Phe Pro His Asn Phe Phe Trp Gly Ala Ala Thr 
1               5                   10                  15      


Ala Ser Tyr Gln Ile Glu Gly Ala Trp Asn Glu Asp Gly Lys Gly Glu 
            20                  25                  30          


Ser Ile Trp Asp Arg Phe Ser His Thr Pro Gly Lys Val Thr Asn Ala 
        35                  40                  45              


Asp Thr Gly Asp Ile Ala Cys Asp His Tyr His Arg Trp Glu Glu Asp 
    50                  55                  60                  


Ile Ala Leu Met Arg Gln Leu Gly Leu Lys Ala Tyr Arg Phe Ser Thr 
65                  70                  75                  80  


Ser Trp Pro Arg Val Ile Pro Ala Gly Arg Arg Arg Val Asn Val Lys 
                85                  90                  95      


Gly Leu Asp Phe Tyr Asp Arg Leu Val Asp Gly Leu Cys Ala Ala Asn 
            100                 105                 110         


Ile Glu Pro Phe Leu Thr Leu Tyr His Trp Asp Leu Pro Gln Ala Leu 
        115                 120                 125             


Gln Asp Glu Gly Gly Trp Asp Asn Arg Asn Thr Ala His Ala Phe Ala 
    130                 135                 140                 


Asp Tyr Ala Ala Leu Met Val Lys Arg Leu Gly Asp Arg Ile Arg Tyr 
145                 150                 155                 160 


Trp Thr Thr Phe Asn Glu Pro Ser Val Val Ala Phe Asn Gly His Tyr 
                165                 170                 175     


Ser Gly Ser His Ala Pro Gly Ile Gln Asp Ala Arg Val Thr Arg Gln 
            180                 185                 190         


Val Val His His Leu Leu Val Ala His Gly Leu Ala Val Gln Ala Ile 
        195                 200                 205             


Arg Gly Ala Asn Ser Lys Val Asp Val Gly Ile Val Leu Asn Leu Trp 
    210                 215                 220                 


Pro Ala Glu Pro Asp Ser Asp Ser Pro Glu Asp Ala Ala Ala Ala Glu 
225                 230                 235                 240 


Ala Ala Trp Asn Arg His Glu Thr Leu Phe Leu Asp Pro Ile Phe Lys 
                245                 250                 255     


Ala His Tyr Pro Val Ser Ala Leu Asp Ala Ile Gly Glu Asp Met Pro 
            260                 265                 270         


Arg Ile His Asp Gly Asp Leu Ala Leu Ile Ser Gln Glu Leu Asp Phe 
        275                 280                 285             


Val Gly Ile Asn Tyr Tyr Ser Arg His Val Val Ser Ala Thr Lys Glu 
    290                 295                 300                 


Ile Gly Arg Leu Pro Glu Ser Glu Tyr Thr Glu Met Gly Trp Glu Val 
305                 310                 315                 320 


Cys Ala Pro Ala Leu Arg Arg Leu Leu Val Lys Ile His Asn Asp Tyr 
                325                 330                 335     


Arg Leu Pro Pro Ile Tyr Ile Thr Glu Asn Gly Ser Ala Phe Lys Asp 
            340                 345                 350         


Glu Val Asn Ala Asp Gly Lys Val His Asp Pro Arg Arg Leu Asp Tyr 
        355                 360                 365             


Leu Lys Gln His Leu Ile Gln Leu Cys Leu Ala Met Gln Asp Gly Val 
    370                 375                 380                 


Asp Val Arg Gly Tyr Met Ala Trp Ser Leu Leu Asp Asn Phe Glu Trp 
385                 390                 395                 400 


Gly His Gly Phe Ser Lys Arg Phe Gly Leu Val His Val Asp Tyr Glu 
                405                 410                 415     


Ser Gln Lys Arg Ile Ile Lys Asp Ser Gly Glu Trp Tyr Ala Ser Val 
            420                 425                 430         


Ile Arg Lys Asn Glu Val Val Glu 
        435                 440 


<210> 351
<211> 1389
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 351
atgagcgctc cgagtcccgc ccgccccgtg tcctttcctc cccgcttcgt gtggggagcc     60

gcggccgcat cctatcaaat cgagggcgcc gtccgggagg acggcaaggg cccttcggtg    120

tgggacatgt tctgcgagaa gccgggagcc gtcttcgagg ggcacgacgg ggcggtggct    180

tgcgatcact accaccgtta ccgggaagac gtggccctga tgcggcagat tgggctccag    240

gcttaccgcc tgagcgtgtg ctggcccagg gtgctgcccg aggggaccgg gcagcccaac    300

gagaaggggc tcgacttcta ctcccggctc gtcgacgcct tgctcgaggc ggggatcacg    360

ccttgggtca ccctttttca ctgggactac ccactagccc tatatcaccg gggaggctgg    420

ctcaatcggg atagctcaga ctggttcggc gagtacgcgg gtctgattgc ggagcgcctc    480

tccgatcggg tgagccactt cttcacccag aacgagcccc aggtgtacat cggcttcggg    540

cacctcgagg ggaaacacgc gccgggcgat acccttcccc tgtcgcagat gctgctggcc    600

ggtcaccaca gcctgctcgc ccatggaaag gccgtgcagg cgctgcgcgc ccacggcaag    660

cagcagctgc gggttggata cgctccggtg gggatgccgc tgcatccggt cagcgagtcc    720

gccgaagacg tggcggctgc acgcaccgcc actttccgcg tccgagagaa gaattcctgg    780

aacaacgctt ggtggatgga cccggtgtac ctcggtgagt accccgccca agggctcgag    840

ttctacgggc gagacgtccc cgcgatccgg tccggagaca tggaactcat ccggcaaccc    900

ttggactttt tcggcgtcaa catctaccag agcacgcccg tgcgcgccgc gggggcgccc    960

caggggttcg aggtcgtccg gcatccgacg ggccacccca tcaccgcgtt caactggccg   1020

gttacgccac aggccttgta ttgggggccg cggttcttct acgagcgcta tggcaagccc   1080

atcgtcatta cggaaaacgg gctttcctgc cgagacgtga tcgcccttga cggcaaggtg   1140

cacgatccgt cccgcatcga cttcaccacg cgctacctgc gcgagctcca ccgcgccatc   1200

gccgaaggca acgaggtgga gggctacttc cactggtcca tcatggacaa cttcgaatgg   1260

gctgccggat accgagaacg cttcgggctc gttcacgtgg attacgagac cctggtgagg   1320

acacccaagg actctgcggc gtggtaccgc caggtcatcc agagcaacgg ggccgtgctg   1380

ttcgattga                                                           1389

<210> 352
<211> 462
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (8)...(458)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (16)...(30)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (366)...(374)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 352
Met Ser Ala Pro Ser Pro Ala Arg Pro Val Ser Phe Pro Pro Arg Phe 
1               5                   10                  15      


Val Trp Gly Ala Ala Ala Ala Ser Tyr Gln Ile Glu Gly Ala Val Arg 
            20                  25                  30          


Glu Asp Gly Lys Gly Pro Ser Val Trp Asp Met Phe Cys Glu Lys Pro 
        35                  40                  45              


Gly Ala Val Phe Glu Gly His Asp Gly Ala Val Ala Cys Asp His Tyr 
    50                  55                  60                  


His Arg Tyr Arg Glu Asp Val Ala Leu Met Arg Gln Ile Gly Leu Gln 
65                  70                  75                  80  


Ala Tyr Arg Leu Ser Val Cys Trp Pro Arg Val Leu Pro Glu Gly Thr 
                85                  90                  95      


Gly Gln Pro Asn Glu Lys Gly Leu Asp Phe Tyr Ser Arg Leu Val Asp 
            100                 105                 110         


Ala Leu Leu Glu Ala Gly Ile Thr Pro Trp Val Thr Leu Phe His Trp 
        115                 120                 125             


Asp Tyr Pro Leu Ala Leu Tyr His Arg Gly Gly Trp Leu Asn Arg Asp 
    130                 135                 140                 


Ser Ser Asp Trp Phe Gly Glu Tyr Ala Gly Leu Ile Ala Glu Arg Leu 
145                 150                 155                 160 


Ser Asp Arg Val Ser His Phe Phe Thr Gln Asn Glu Pro Gln Val Tyr 
                165                 170                 175     


Ile Gly Phe Gly His Leu Glu Gly Lys His Ala Pro Gly Asp Thr Leu 
            180                 185                 190         


Pro Leu Ser Gln Met Leu Leu Ala Gly His His Ser Leu Leu Ala His 
        195                 200                 205             


Gly Lys Ala Val Gln Ala Leu Arg Ala His Gly Lys Gln Gln Leu Arg 
    210                 215                 220                 


Val Gly Tyr Ala Pro Val Gly Met Pro Leu His Pro Val Ser Glu Ser 
225                 230                 235                 240 


Ala Glu Asp Val Ala Ala Ala Arg Thr Ala Thr Phe Arg Val Arg Glu 
                245                 250                 255     


Lys Asn Ser Trp Asn Asn Ala Trp Trp Met Asp Pro Val Tyr Leu Gly 
            260                 265                 270         


Glu Tyr Pro Ala Gln Gly Leu Glu Phe Tyr Gly Arg Asp Val Pro Ala 
        275                 280                 285             


Ile Arg Ser Gly Asp Met Glu Leu Ile Arg Gln Pro Leu Asp Phe Phe 
    290                 295                 300                 


Gly Val Asn Ile Tyr Gln Ser Thr Pro Val Arg Ala Ala Gly Ala Pro 
305                 310                 315                 320 


Gln Gly Phe Glu Val Val Arg His Pro Thr Gly His Pro Ile Thr Ala 
                325                 330                 335     


Phe Asn Trp Pro Val Thr Pro Gln Ala Leu Tyr Trp Gly Pro Arg Phe 
            340                 345                 350         


Phe Tyr Glu Arg Tyr Gly Lys Pro Ile Val Ile Thr Glu Asn Gly Leu 
        355                 360                 365             


Ser Cys Arg Asp Val Ile Ala Leu Asp Gly Lys Val His Asp Pro Ser 
    370                 375                 380                 


Arg Ile Asp Phe Thr Thr Arg Tyr Leu Arg Glu Leu His Arg Ala Ile 
385                 390                 395                 400 


Ala Glu Gly Asn Glu Val Glu Gly Tyr Phe His Trp Ser Ile Met Asp 
                405                 410                 415     


Asn Phe Glu Trp Ala Ala Gly Tyr Arg Glu Arg Phe Gly Leu Val His 
            420                 425                 430         


Val Asp Tyr Glu Thr Leu Val Arg Thr Pro Lys Asp Ser Ala Ala Trp 
        435                 440                 445             


Tyr Arg Gln Val Ile Gln Ser Asn Gly Ala Val Leu Phe Asp 
    450                 455                 460         


<210> 353
<211> 1098
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 353
atgactcgga ggtctatcgt gcgttcttct tccaacaagt ggcttgtcct tgccggtgcg     60

gcgctgctcg cctgcaccgc cctcgggtgc aagaaaaaag gcgagagcgg tgacgtcgcc    120

tcggccccgg ggcaggccca ggcgggcggc aagcagccgt ttcccgacga tgcgccgatc    180

accgaaccgc ccgctccgcc ccctcgtagc ggcaatcctc tggtgggcgc caagctcttc    240

gtcgacccgg aatctttggc catgttgcag gcgaacaagc tgcggcgcac cgacccggag    300

aaggcggcga ttttggatcg catcgcccag cagccccagg ctttgtggat gggcgagtgg    360

aacacgaaca tcttccgcgc ggtcgagcat ttcgtggctc gcgccaaggc ggagggcgcc    420

gtgcccgtca tgatcgccta caacatcccc caccgcgact gcgggcagta ctctcagggt    480

gggctttcct ccaaggaggc ttaccagcgc tggattcgga acgtcgccgc ggggattggc    540

agcgatgcag cggtcgtcgt gctcgagccc gacgcgctcg gccacttcca ggagtgtttg    600

accgaggagc agagcgccga gcgcatgttc ctgctcagcg acgccgtcaa ggtgctgcgc    660

caaaatccga agacggccgt gtacctggat gccgggcacg cgcgctgggt gccggtggag    720

gagatggccg agcgcctcaa gctcgcgggc atcgagcacg cccatggctt ttcgctcaac    780

acctcgaact acgtgggcac cgaggagaac gccgcttacg gccacaagct cgtcgaggcc    840

ctgggtggga acgtgcgctt cgtcatcgac acgagccgca atggggcggg cccctacgag    900

gaggccaaga acgccgagga gagctggtgc aacccgcccg gtcgcaagat cggcaagccg    960

ccgaccaccg agacggggga tcccctcatc gacggattcc tttggctgaa gcgcccgggc   1020

gagtcggacg gtcagtgcaa cggcgggccc aaggccggtg tgttctggct ggagcaggct   1080

ctccagcagg cccagtaa                                                 1098

<210> 354
<211> 365
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (81)...(358)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (187)...(196)
<223> Glycosyl hydrolases family 6 signature 2. Prosite id = PS00656

<220> 
<221> SITE
<222> (263)...(266)
<223> N-glycosylation site. Prosite id = PS00001

<400> 354
Met Thr Arg Arg Ser Ile Val Arg Ser Ser Ser Asn Lys Trp Leu Val 
1               5                   10                  15      


Leu Ala Gly Ala Ala Leu Leu Ala Cys Thr Ala Leu Gly Cys Lys Lys 
            20                  25                  30          


Lys Gly Glu Ser Gly Asp Val Ala Ser Ala Pro Gly Gln Ala Gln Ala 
        35                  40                  45              


Gly Gly Lys Gln Pro Phe Pro Asp Asp Ala Pro Ile Thr Glu Pro Pro 
    50                  55                  60                  


Ala Pro Pro Pro Arg Ser Gly Asn Pro Leu Val Gly Ala Lys Leu Phe 
65                  70                  75                  80  


Val Asp Pro Glu Ser Leu Ala Met Leu Gln Ala Asn Lys Leu Arg Arg 
                85                  90                  95      


Thr Asp Pro Glu Lys Ala Ala Ile Leu Asp Arg Ile Ala Gln Gln Pro 
            100                 105                 110         


Gln Ala Leu Trp Met Gly Glu Trp Asn Thr Asn Ile Phe Arg Ala Val 
        115                 120                 125             


Glu His Phe Val Ala Arg Ala Lys Ala Glu Gly Ala Val Pro Val Met 
    130                 135                 140                 


Ile Ala Tyr Asn Ile Pro His Arg Asp Cys Gly Gln Tyr Ser Gln Gly 
145                 150                 155                 160 


Gly Leu Ser Ser Lys Glu Ala Tyr Gln Arg Trp Ile Arg Asn Val Ala 
                165                 170                 175     


Ala Gly Ile Gly Ser Asp Ala Ala Val Val Val Leu Glu Pro Asp Ala 
            180                 185                 190         


Leu Gly His Phe Gln Glu Cys Leu Thr Glu Glu Gln Ser Ala Glu Arg 
        195                 200                 205             


Met Phe Leu Leu Ser Asp Ala Val Lys Val Leu Arg Gln Asn Pro Lys 
    210                 215                 220                 


Thr Ala Val Tyr Leu Asp Ala Gly His Ala Arg Trp Val Pro Val Glu 
225                 230                 235                 240 


Glu Met Ala Glu Arg Leu Lys Leu Ala Gly Ile Glu His Ala His Gly 
                245                 250                 255     


Phe Ser Leu Asn Thr Ser Asn Tyr Val Gly Thr Glu Glu Asn Ala Ala 
            260                 265                 270         


Tyr Gly His Lys Leu Val Glu Ala Leu Gly Gly Asn Val Arg Phe Val 
        275                 280                 285             


Ile Asp Thr Ser Arg Asn Gly Ala Gly Pro Tyr Glu Glu Ala Lys Asn 
    290                 295                 300                 


Ala Glu Glu Ser Trp Cys Asn Pro Pro Gly Arg Lys Ile Gly Lys Pro 
305                 310                 315                 320 


Pro Thr Thr Glu Thr Gly Asp Pro Leu Ile Asp Gly Phe Leu Trp Leu 
                325                 330                 335     


Lys Arg Pro Gly Glu Ser Asp Gly Gln Cys Asn Gly Gly Pro Lys Ala 
            340                 345                 350         


Gly Val Phe Trp Leu Glu Gln Ala Leu Gln Gln Ala Gln 
        355                 360                 365 


<210> 355
<211> 1347
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 355
atgactgacc atcgttttcc aaaaggattc atctggggaa ccgctacggc gtctttccag     60

attgaaggcg ccacccgcga agatggccgg ggcgaatcca tctgggaccg cttctgcgcc    120

acgccgggga aaattgtcac gggcgaaacc ggcgatcctg cctgcgactc ctatcatcgt    180

taccctgaag acatcgccct gatgaaggct atgtcgctca atggttaccg cttttcaatc    240

gcctggcctc gcgtcattcc tgacggagac ggtaaagtct gtcaggccgg gctcgactac    300

tacgatcgtg tggtagatgc tctcctggcg gagaatatcc aaccttttat caccctgtac    360

cactgggacc tgccccaggc attacaggat cggggtggct ggggcaaccg tgccacggtt    420

gaggcgttca ctcgctacgt agatattgtg gtttctcgcc tgggtgaccg cgtaaagtac    480

tggatgacac acaacgaacc ctggtgtgta tccattttga gccatgagct tggtgaacat    540

gcccccgggt tgaaggaccg aaaactggcc ctccaggtgg cgcaccatgt cctcgtttct    600

cacggcctgg ccgtgcccat catccgccag cgttgtaaag aggcgcaggt tggcatcgtg    660

ttgaattttt cacctgctta cccggccacc gatagcctgg ccgaccagat ggccacccgt    720

cagcaccacg cccggtttaa cctctggttc ctcgatccca tcgccgggcg cggctacccg    780

caggatgcct gggaagggta cggagccgat gttcccgcca tgaggcctga tgacatgcag    840

atcatcgccg cccccatcga cttcctgggc gtcaatttct acagtcgggc ggtctgccac    900

gatccggccg ggggcgaagg ttcccgggtg ctcaatgtgc gcagtaaaac cgaggccacc    960

gatcgagact gggagattta ccctcaggcg ctctacgatt tactcatctg gatccacaat   1020

ggataccagt tcagagatat ttacattacc gagaatggcg cctcatacaa cgatgtggtc   1080

tccccggatg ggaaagtgca cgatcctaaa cgtctggact atctgaaacg ccatctggcc   1140

atggctctgc gggccatcga agcgggcgtt ccactgcgtg gttatttctg ctggagcttg   1200

atggacaact tcgaatgggc catgggcacc agcagccgat tcgggttggc ctacaccgac   1260

ttcactaccc agaagcgtat tctcaaagac agtgggctct ggtttggcga agtggcacgg   1320

gcaaacgcct taatcgacct tccctga                                       1347

<210> 356
<211> 448
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (2)...(444)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (10)...(24)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (352)...(360)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 356
Met Thr Asp His Arg Phe Pro Lys Gly Phe Ile Trp Gly Thr Ala Thr 
1               5                   10                  15      


Ala Ser Phe Gln Ile Glu Gly Ala Thr Arg Glu Asp Gly Arg Gly Glu 
            20                  25                  30          


Ser Ile Trp Asp Arg Phe Cys Ala Thr Pro Gly Lys Ile Val Thr Gly 
        35                  40                  45              


Glu Thr Gly Asp Pro Ala Cys Asp Ser Tyr His Arg Tyr Pro Glu Asp 
    50                  55                  60                  


Ile Ala Leu Met Lys Ala Met Ser Leu Asn Gly Tyr Arg Phe Ser Ile 
65                  70                  75                  80  


Ala Trp Pro Arg Val Ile Pro Asp Gly Asp Gly Lys Val Cys Gln Ala 
                85                  90                  95      


Gly Leu Asp Tyr Tyr Asp Arg Val Val Asp Ala Leu Leu Ala Glu Asn 
            100                 105                 110         


Ile Gln Pro Phe Ile Thr Leu Tyr His Trp Asp Leu Pro Gln Ala Leu 
        115                 120                 125             


Gln Asp Arg Gly Gly Trp Gly Asn Arg Ala Thr Val Glu Ala Phe Thr 
    130                 135                 140                 


Arg Tyr Val Asp Ile Val Val Ser Arg Leu Gly Asp Arg Val Lys Tyr 
145                 150                 155                 160 


Trp Met Thr His Asn Glu Pro Trp Cys Val Ser Ile Leu Ser His Glu 
                165                 170                 175     


Leu Gly Glu His Ala Pro Gly Leu Lys Asp Arg Lys Leu Ala Leu Gln 
            180                 185                 190         


Val Ala His His Val Leu Val Ser His Gly Leu Ala Val Pro Ile Ile 
        195                 200                 205             


Arg Gln Arg Cys Lys Glu Ala Gln Val Gly Ile Val Leu Asn Phe Ser 
    210                 215                 220                 


Pro Ala Tyr Pro Ala Thr Asp Ser Leu Ala Asp Gln Met Ala Thr Arg 
225                 230                 235                 240 


Gln His His Ala Arg Phe Asn Leu Trp Phe Leu Asp Pro Ile Ala Gly 
                245                 250                 255     


Arg Gly Tyr Pro Gln Asp Ala Trp Glu Gly Tyr Gly Ala Asp Val Pro 
            260                 265                 270         


Ala Met Arg Pro Asp Asp Met Gln Ile Ile Ala Ala Pro Ile Asp Phe 
        275                 280                 285             


Leu Gly Val Asn Phe Tyr Ser Arg Ala Val Cys His Asp Pro Ala Gly 
    290                 295                 300                 


Gly Glu Gly Ser Arg Val Leu Asn Val Arg Ser Lys Thr Glu Ala Thr 
305                 310                 315                 320 


Asp Arg Asp Trp Glu Ile Tyr Pro Gln Ala Leu Tyr Asp Leu Leu Ile 
                325                 330                 335     


Trp Ile His Asn Gly Tyr Gln Phe Arg Asp Ile Tyr Ile Thr Glu Asn 
            340                 345                 350         


Gly Ala Ser Tyr Asn Asp Val Val Ser Pro Asp Gly Lys Val His Asp 
        355                 360                 365             


Pro Lys Arg Leu Asp Tyr Leu Lys Arg His Leu Ala Met Ala Leu Arg 
    370                 375                 380                 


Ala Ile Glu Ala Gly Val Pro Leu Arg Gly Tyr Phe Cys Trp Ser Leu 
385                 390                 395                 400 


Met Asp Asn Phe Glu Trp Ala Met Gly Thr Ser Ser Arg Phe Gly Leu 
                405                 410                 415     


Ala Tyr Thr Asp Phe Thr Thr Gln Lys Arg Ile Leu Lys Asp Ser Gly 
            420                 425                 430         


Leu Trp Phe Gly Glu Val Ala Arg Ala Asn Ala Leu Ile Asp Leu Pro 
        435                 440                 445             


<210> 357
<211> 1404
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 357
atgaatcatt ccctttcatt tccgccatcc tttgtatggg gcgcggcaac cgcaagctac     60

caactggaag gatcaaccca aggcgtggac ggctgcgccg agtccgtctg ggatatgcac    120

tgccgaagat ccggcgcgat caaggacggc tcgaacggat tcgtcgcctg cgatcactac    180

catcgctatc gcgaggatgt ggcgctcatg aacgagcttg gcttgaatgc ctatcgattc    240

tcaatcatgt ggccccgcgt catgcccgaa ggcaccggcg cggtgaacga gaagggcatg    300

gatttctacg atcggttggt tgatgaactg ctcgccgccg gcatcacacc ttgggttact    360

ttgttccact gggactttcc cctagccttg ttccaacgcg gtggctggct gaatgcggat    420

tccccgcaat ggtttgagga ttacactcgg gaagtggtta aacgcttgtc ggatcgtgtg    480

catcactggc taacgctcaa cgaaccggcg tgcttcattg agtttggcca ccgtaccggc    540

atgcatgcac ccggcttgca actggcggac aaggaagcct gccgggtctg gcaccatgcc    600

atgctggccc acggtcgcgc cgttcgcgct attcgccagg aatccgtgca tccatcaccc    660

caggtcggct acgcgccggt cttccgcact accatcccgg acactgaaga tcctgccgac    720

atcgaagcgg cccggacctc gatgtttgct catcaggccg gcaacctgtt cgatacgcgg    780

tggaacctcg acccctgctt tcggggcgcg tatccggaga tcatgatgca gtattggggc    840

gatgccgcgc cgcgcatcca ggacggcgac atggagttga tccgtcagga actcgatttt    900

ctcggcctga atatttacca gtccgagcgc attcgggccg gtgcggatgg cgcacccgag    960

gtggtgccat accctgcgga ttatccgcgc aaccagctcg gttggcccat cacgccggag   1020

gccctgcgct gggcgaccct ctttctcttt gaggagtacg ggaaacccct gatcatcaca   1080

gaaaacggaa tcaccctcga cgacaagccc aatgcagacg gcgaggtgaa tgatgtccag   1140

cggatcgctt ttctgaatga ctatcttagc ggtctccagc gcagcgtgga cgacggcatc   1200

cctgtactgg gctatttcca ctggtcgctg tgcgacaact ttgagtgggc agaaggctat   1260

gtccctcgct tcggcctgat ccatgtggac tatgccagtc aacgcagaac catcaaggcc   1320

tcaggacggt tttaccgcga catcattcgg ggccagacag ccacgccctg catcgcccaa   1380

tccagtcagc cggaaacaac ctaa                                          1404

<210> 358
<211> 467
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (3)...(454)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (2)...(5)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (11)...(25)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<400> 358
Met Asn His Ser Leu Ser Phe Pro Pro Ser Phe Val Trp Gly Ala Ala 
1               5                   10                  15      


Thr Ala Ser Tyr Gln Leu Glu Gly Ser Thr Gln Gly Val Asp Gly Cys 
            20                  25                  30          


Ala Glu Ser Val Trp Asp Met His Cys Arg Arg Ser Gly Ala Ile Lys 
        35                  40                  45              


Asp Gly Ser Asn Gly Phe Val Ala Cys Asp His Tyr His Arg Tyr Arg 
    50                  55                  60                  


Glu Asp Val Ala Leu Met Asn Glu Leu Gly Leu Asn Ala Tyr Arg Phe 
65                  70                  75                  80  


Ser Ile Met Trp Pro Arg Val Met Pro Glu Gly Thr Gly Ala Val Asn 
                85                  90                  95      


Glu Lys Gly Met Asp Phe Tyr Asp Arg Leu Val Asp Glu Leu Leu Ala 
            100                 105                 110         


Ala Gly Ile Thr Pro Trp Val Thr Leu Phe His Trp Asp Phe Pro Leu 
        115                 120                 125             


Ala Leu Phe Gln Arg Gly Gly Trp Leu Asn Ala Asp Ser Pro Gln Trp 
    130                 135                 140                 


Phe Glu Asp Tyr Thr Arg Glu Val Val Lys Arg Leu Ser Asp Arg Val 
145                 150                 155                 160 


His His Trp Leu Thr Leu Asn Glu Pro Ala Cys Phe Ile Glu Phe Gly 
                165                 170                 175     


His Arg Thr Gly Met His Ala Pro Gly Leu Gln Leu Ala Asp Lys Glu 
            180                 185                 190         


Ala Cys Arg Val Trp His His Ala Met Leu Ala His Gly Arg Ala Val 
        195                 200                 205             


Arg Ala Ile Arg Gln Glu Ser Val His Pro Ser Pro Gln Val Gly Tyr 
    210                 215                 220                 


Ala Pro Val Phe Arg Thr Thr Ile Pro Asp Thr Glu Asp Pro Ala Asp 
225                 230                 235                 240 


Ile Glu Ala Ala Arg Thr Ser Met Phe Ala His Gln Ala Gly Asn Leu 
                245                 250                 255     


Phe Asp Thr Arg Trp Asn Leu Asp Pro Cys Phe Arg Gly Ala Tyr Pro 
            260                 265                 270         


Glu Ile Met Met Gln Tyr Trp Gly Asp Ala Ala Pro Arg Ile Gln Asp 
        275                 280                 285             


Gly Asp Met Glu Leu Ile Arg Gln Glu Leu Asp Phe Leu Gly Leu Asn 
    290                 295                 300                 


Ile Tyr Gln Ser Glu Arg Ile Arg Ala Gly Ala Asp Gly Ala Pro Glu 
305                 310                 315                 320 


Val Val Pro Tyr Pro Ala Asp Tyr Pro Arg Asn Gln Leu Gly Trp Pro 
                325                 330                 335     


Ile Thr Pro Glu Ala Leu Arg Trp Ala Thr Leu Phe Leu Phe Glu Glu 
            340                 345                 350         


Tyr Gly Lys Pro Leu Ile Ile Thr Glu Asn Gly Ile Thr Leu Asp Asp 
        355                 360                 365             


Lys Pro Asn Ala Asp Gly Glu Val Asn Asp Val Gln Arg Ile Ala Phe 
    370                 375                 380                 


Leu Asn Asp Tyr Leu Ser Gly Leu Gln Arg Ser Val Asp Asp Gly Ile 
385                 390                 395                 400 


Pro Val Leu Gly Tyr Phe His Trp Ser Leu Cys Asp Asn Phe Glu Trp 
                405                 410                 415     


Ala Glu Gly Tyr Val Pro Arg Phe Gly Leu Ile His Val Asp Tyr Ala 
            420                 425                 430         


Ser Gln Arg Arg Thr Ile Lys Ala Ser Gly Arg Phe Tyr Arg Asp Ile 
        435                 440                 445             


Ile Arg Gly Gln Thr Ala Thr Pro Cys Ile Ala Gln Ser Ser Gln Pro 
    450                 455                 460                 


Glu Thr Thr 
465         


<210> 359
<211> 1101
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 359
atgagaaatc atctgaatgt acccttttac tttatcttct tttttttaat agcgtcaata     60

tttacagtct gttcatcatc aactgcttct gataacaatg agcatccacc gccagtggaa    120

gtcgcggatc aggacgcttt tcgtgatgct tttgaagtga atgaattact tggacgcggt    180

attaatctgg gtaatgccct tgaagcgccc aatgaaggcg aatggggaat ggtaatccag    240

gaagagtttc ttgatctgat acttgcagca ggttttgagt ctgtacgaat tccgattcgc    300

tggaatgccc atgccagtga aagtcaccct ttcaccattc aacgatcgtt ttttgatcgg    360

gttgatgaag tcatccaatg gtcgctggat cgtggccttt ctgtaatgat caatattcat    420

cactacaatg aactgatgca aaacccgcag cagcaccggc agcggttttt gcgactctgg    480

aaccagattg ctacacacta taaagattat ccggataatc tggtttttga aatccttaat    540

gaacctcatg ataatctgac tccttctatc tggaatagtt atttgaggga tgctattggc    600

atgattcgcc agacaaaccc acgcagggtt atcgctatcg gaacagcaaa ctggggtggt    660

ttcggagcat tatcacaact tgaaatcccc tcaaacgatc gccagatcat tgcaactgtt    720

cattattatg aacccttcag gttcacccat cagggggctg aatgggcagg accggaaaca    780

aacgattggc tggggacacg atgggatgga tcggatgagg aaaaatttga tattgaaagt    840

ggttttgatg ccgtacagtc ctgggcagtg acaaataacc ggcctgttca tctcggagaa    900

ttcggtgctt acagtactgc cgataatgaa tcacgcgaac gctggacaac ctttgttcgg    960

gaatccgctg agcaacgcaa tttcagctgg gcatactggg aatttgcagc cggttttggg   1020

atctatgacc gtaatcagtg gcaatggagg gattatctgt tgagggcttt gataccggat   1080

agcccggtcc tgttggagta a                                             1101

<210> 360
<211> 366
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (64)...(342)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (176)...(185)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (313)...(316)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (332)...(335)
<223> N-glycosylation site. Prosite id = PS00001

<400> 360
Met Arg Asn His Leu Asn Val Pro Phe Tyr Phe Ile Phe Phe Phe Leu 
1               5                   10                  15      


Ile Ala Ser Ile Phe Thr Val Cys Ser Ser Ser Thr Ala Ser Asp Asn 
            20                  25                  30          


Asn Glu His Pro Pro Pro Val Glu Val Ala Asp Gln Asp Ala Phe Arg 
        35                  40                  45              


Asp Ala Phe Glu Val Asn Glu Leu Leu Gly Arg Gly Ile Asn Leu Gly 
    50                  55                  60                  


Asn Ala Leu Glu Ala Pro Asn Glu Gly Glu Trp Gly Met Val Ile Gln 
65                  70                  75                  80  


Glu Glu Phe Leu Asp Leu Ile Leu Ala Ala Gly Phe Glu Ser Val Arg 
                85                  90                  95      


Ile Pro Ile Arg Trp Asn Ala His Ala Ser Glu Ser His Pro Phe Thr 
            100                 105                 110         


Ile Gln Arg Ser Phe Phe Asp Arg Val Asp Glu Val Ile Gln Trp Ser 
        115                 120                 125             


Leu Asp Arg Gly Leu Ser Val Met Ile Asn Ile His His Tyr Asn Glu 
    130                 135                 140                 


Leu Met Gln Asn Pro Gln Gln His Arg Gln Arg Phe Leu Arg Leu Trp 
145                 150                 155                 160 


Asn Gln Ile Ala Thr His Tyr Lys Asp Tyr Pro Asp Asn Leu Val Phe 
                165                 170                 175     


Glu Ile Leu Asn Glu Pro His Asp Asn Leu Thr Pro Ser Ile Trp Asn 
            180                 185                 190         


Ser Tyr Leu Arg Asp Ala Ile Gly Met Ile Arg Gln Thr Asn Pro Arg 
        195                 200                 205             


Arg Val Ile Ala Ile Gly Thr Ala Asn Trp Gly Gly Phe Gly Ala Leu 
    210                 215                 220                 


Ser Gln Leu Glu Ile Pro Ser Asn Asp Arg Gln Ile Ile Ala Thr Val 
225                 230                 235                 240 


His Tyr Tyr Glu Pro Phe Arg Phe Thr His Gln Gly Ala Glu Trp Ala 
                245                 250                 255     


Gly Pro Glu Thr Asn Asp Trp Leu Gly Thr Arg Trp Asp Gly Ser Asp 
            260                 265                 270         


Glu Glu Lys Phe Asp Ile Glu Ser Gly Phe Asp Ala Val Gln Ser Trp 
        275                 280                 285             


Ala Val Thr Asn Asn Arg Pro Val His Leu Gly Glu Phe Gly Ala Tyr 
    290                 295                 300                 


Ser Thr Ala Asp Asn Glu Ser Arg Glu Arg Trp Thr Thr Phe Val Arg 
305                 310                 315                 320 


Glu Ser Ala Glu Gln Arg Asn Phe Ser Trp Ala Tyr Trp Glu Phe Ala 
                325                 330                 335     


Ala Gly Phe Gly Ile Tyr Asp Arg Asn Gln Trp Gln Trp Arg Asp Tyr 
            340                 345                 350         


Leu Leu Arg Ala Leu Ile Pro Asp Ser Pro Val Leu Leu Glu 
        355                 360                 365     


<210> 361
<211> 1284
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 361
gtgaacaccg cgcatcgcat cgaattccct cggcaattta tcttcggttc cgccactgct     60

gctcaccaag tggagggcaa caacgttcac aatgattggt gggcccacga gcatgccacc    120

gacacgaatg ccgtggagcc gtcgggcctc gcctgcgacc actttcggcg ctttgccgac    180

gacttccgcc tcttacgcca actcggacag ccagcgcacc gcctgtcgct ggaatggagc    240

cgcatcgaac cggcacccgg tgaaatcgat cgttcggcat tgtcccacta ccgccgagtc    300

ctgggtactt tgcgagacct cggaatcgag ccatgggtca ccatccacca cttcacttgc    360

cctcgctggt tcgtggaaca gggagggttt acacgcatgg attcagcgcg ctctctcgtt    420

cgccataccg aacgcgtggc gagggagttc tccgacctag tcacaaactg gtgcaccata    480

aatgagccaa acgtcgtggc agaactcggt tatcgcttcg gatactttcc gccgcggttg    540

caggacgatg agctggcagc ggaagtgctc accaacttct ttcgcttaca cgctgaaatg    600

gcagaagttt tgcgcgctca cgcgcagaga tcggcgcaaa tcggtatcac ccttgcgatg    660

caagcacacg agccgctgcg catcgaaagc gaagcggacc gcgcactggc ggcgcggcgc    720

gacgccgaga ccaacggcgt catgctcaac gccttgcgaa ccggtgtatt cgcctacccg    780

ggacgggagc cggtggaaat ccctggactg aaaacgtcat cgaccttcgt gggggtccag    840

tactattcgc gggtccgcta cgacgccgag tcgcaaggtc cagcaatgcc cgacttcgag    900

cgcaccctca gccaaatggg atgggaggtg tatcctgagg ggttcggccc cttgctcgag    960

cgcgcagcag aaactggact cgaagtgatc gtcacagaga acgggatggc gcacgacgat   1020

gaccgtgtgc gcgtgcgttt tatcgccgac cacttgcggg tcgttcaccg ccttctggaa   1080

cgcggtgtgc gcatcggagg gtacttttac tggtcgacca tggacaactt cgaatggaac   1140

ttcgggtacg gaccgaagtt cggcctgatc gaagtggacc gttctaccct ggaacgcagg   1200

ccgcggcgaa gcgcgtattt cttccgtgac atgatccagc agcgagtgct cgacgacgac   1260

ctggtcgagc actggactcg ctga                                          1284

<210> 362
<211> 427
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (5)...(417)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (334)...(342)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 362
Met Asn Thr Ala His Arg Ile Glu Phe Pro Arg Gln Phe Ile Phe Gly 
1               5                   10                  15      


Ser Ala Thr Ala Ala His Gln Val Glu Gly Asn Asn Val His Asn Asp 
            20                  25                  30          


Trp Trp Ala His Glu His Ala Thr Asp Thr Asn Ala Val Glu Pro Ser 
        35                  40                  45              


Gly Leu Ala Cys Asp His Phe Arg Arg Phe Ala Asp Asp Phe Arg Leu 
    50                  55                  60                  


Leu Arg Gln Leu Gly Gln Pro Ala His Arg Leu Ser Leu Glu Trp Ser 
65                  70                  75                  80  


Arg Ile Glu Pro Ala Pro Gly Glu Ile Asp Arg Ser Ala Leu Ser His 
                85                  90                  95      


Tyr Arg Arg Val Leu Gly Thr Leu Arg Asp Leu Gly Ile Glu Pro Trp 
            100                 105                 110         


Val Thr Ile His His Phe Thr Cys Pro Arg Trp Phe Val Glu Gln Gly 
        115                 120                 125             


Gly Phe Thr Arg Met Asp Ser Ala Arg Ser Leu Val Arg His Thr Glu 
    130                 135                 140                 


Arg Val Ala Arg Glu Phe Ser Asp Leu Val Thr Asn Trp Cys Thr Ile 
145                 150                 155                 160 


Asn Glu Pro Asn Val Val Ala Glu Leu Gly Tyr Arg Phe Gly Tyr Phe 
                165                 170                 175     


Pro Pro Arg Leu Gln Asp Asp Glu Leu Ala Ala Glu Val Leu Thr Asn 
            180                 185                 190         


Phe Phe Arg Leu His Ala Glu Met Ala Glu Val Leu Arg Ala His Ala 
        195                 200                 205             


Gln Arg Ser Ala Gln Ile Gly Ile Thr Leu Ala Met Gln Ala His Glu 
    210                 215                 220                 


Pro Leu Arg Ile Glu Ser Glu Ala Asp Arg Ala Leu Ala Ala Arg Arg 
225                 230                 235                 240 


Asp Ala Glu Thr Asn Gly Val Met Leu Asn Ala Leu Arg Thr Gly Val 
                245                 250                 255     


Phe Ala Tyr Pro Gly Arg Glu Pro Val Glu Ile Pro Gly Leu Lys Thr 
            260                 265                 270         


Ser Ser Thr Phe Val Gly Val Gln Tyr Tyr Ser Arg Val Arg Tyr Asp 
        275                 280                 285             


Ala Glu Ser Gln Gly Pro Ala Met Pro Asp Phe Glu Arg Thr Leu Ser 
    290                 295                 300                 


Gln Met Gly Trp Glu Val Tyr Pro Glu Gly Phe Gly Pro Leu Leu Glu 
305                 310                 315                 320 


Arg Ala Ala Glu Thr Gly Leu Glu Val Ile Val Thr Glu Asn Gly Met 
                325                 330                 335     


Ala His Asp Asp Asp Arg Val Arg Val Arg Phe Ile Ala Asp His Leu 
            340                 345                 350         


Arg Val Val His Arg Leu Leu Glu Arg Gly Val Arg Ile Gly Gly Tyr 
        355                 360                 365             


Phe Tyr Trp Ser Thr Met Asp Asn Phe Glu Trp Asn Phe Gly Tyr Gly 
    370                 375                 380                 


Pro Lys Phe Gly Leu Ile Glu Val Asp Arg Ser Thr Leu Glu Arg Arg 
385                 390                 395                 400 


Pro Arg Arg Ser Ala Tyr Phe Phe Arg Asp Met Ile Gln Gln Arg Val 
                405                 410                 415     


Leu Asp Asp Asp Leu Val Glu His Trp Thr Arg 
            420                 425         


<210> 363
<211> 1386
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 363
atgtcgtttc cgagaaattt cctgtgggga tcagccacct cctcctacca aatcgaaggc     60

gcctggcaag aagacggcaa aggcccaaat atctgggacg tgttttcaca caccccgggg    120

aaagtcgcca atggcgacac cggtgatatc gccatcgacc actaccaccg ataccgagac    180

gacgttgccc tgatggctga gcttggactt caggcatacc gtttctcgtt ctcctgggcc    240

agaataatgc cggaaggagc aggccccatc gagcaacggg gtctggactt ctacgaccgc    300

ctcattgatg cactgctgga gaaaaacatc caacccatgg ccaccctcta ccactgggat    360

ttaccagccg cactgcaaga cagagggggg tggactaacc gcgacagcgc gtcctggttt    420

gctgactact cagccgttgt tcacgacgct ttttctgacc gggtgggaat gtgggcaacg    480

ttgaacgagc cgtgggtgtc tgcatttttg ggccacggaa ctggcatcca cgcacctggc    540

atcacaagcc cccacgcggc gttcgccgcg gggcatcacc tgcttctggg gcatggcaag    600

gccatccaag cgatgcgcgc tcaatcgtct agcacccaac tgggaattgt tttgaacctc    660

gcccccgtgt atctcgaagg tgacacccct gctgaccacc cggctcacac ctccgtggca    720

ctacacgatg ccattttgaa tgggttgtgg acagagccgc ttctgcgctc cagatacccc    780

gacctgcttc ttcaactagg cgacatggtg acaaaaaaca tccacgacgg tgacctcgcc    840

atcatggccg agccgattga ctggatgggc atcaactact accaggacat tagatttgtg    900

gccactgatg ttgcccccac ggctaacccg atggcccctc cgggtaacga cctgccgggc    960

accgtcgggg tggagcctgc gccagcaatc ggaaacatca ccagctttgg ctggtccacc   1020

acccccgacg gactgcgagt actgttggtg ggcctggatg aggaatacga caacctcccg   1080

ccgatattca ttaccgaaaa cgggtgtgct tacgattacc ccgtcgagga cggtgtcgtc   1140

aacgacaccc ttcgtgtcac atacatgcga gaacacctca ccgcgttgtc gcaggccatt   1200

gaggcgggtg tgaatgtccg gggctatatg cactggtctc tgttcgacaa cttcgagtgg   1260

gccgaagggt atcgccaacg ctttggcatg gtgcacgtcg actttgagac cttggagcgg   1320

actcccaaag cctcagctca ctactattca cgtgtcatca caaataacgc cctctctgac   1380

gactga                                                              1386

<210> 364
<211> 461
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(458)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (7)...(21)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (337)...(340)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (386)...(389)
<223> N-glycosylation site. Prosite id = PS00001

<400> 364
Met Ser Phe Pro Arg Asn Phe Leu Trp Gly Ser Ala Thr Ser Ser Tyr 
1               5                   10                  15      


Gln Ile Glu Gly Ala Trp Gln Glu Asp Gly Lys Gly Pro Asn Ile Trp 
            20                  25                  30          


Asp Val Phe Ser His Thr Pro Gly Lys Val Ala Asn Gly Asp Thr Gly 
        35                  40                  45              


Asp Ile Ala Ile Asp His Tyr His Arg Tyr Arg Asp Asp Val Ala Leu 
    50                  55                  60                  


Met Ala Glu Leu Gly Leu Gln Ala Tyr Arg Phe Ser Phe Ser Trp Ala 
65                  70                  75                  80  


Arg Ile Met Pro Glu Gly Ala Gly Pro Ile Glu Gln Arg Gly Leu Asp 
                85                  90                  95      


Phe Tyr Asp Arg Leu Ile Asp Ala Leu Leu Glu Lys Asn Ile Gln Pro 
            100                 105                 110         


Met Ala Thr Leu Tyr His Trp Asp Leu Pro Ala Ala Leu Gln Asp Arg 
        115                 120                 125             


Gly Gly Trp Thr Asn Arg Asp Ser Ala Ser Trp Phe Ala Asp Tyr Ser 
    130                 135                 140                 


Ala Val Val His Asp Ala Phe Ser Asp Arg Val Gly Met Trp Ala Thr 
145                 150                 155                 160 


Leu Asn Glu Pro Trp Val Ser Ala Phe Leu Gly His Gly Thr Gly Ile 
                165                 170                 175     


His Ala Pro Gly Ile Thr Ser Pro His Ala Ala Phe Ala Ala Gly His 
            180                 185                 190         


His Leu Leu Leu Gly His Gly Lys Ala Ile Gln Ala Met Arg Ala Gln 
        195                 200                 205             


Ser Ser Ser Thr Gln Leu Gly Ile Val Leu Asn Leu Ala Pro Val Tyr 
    210                 215                 220                 


Leu Glu Gly Asp Thr Pro Ala Asp His Pro Ala His Thr Ser Val Ala 
225                 230                 235                 240 


Leu His Asp Ala Ile Leu Asn Gly Leu Trp Thr Glu Pro Leu Leu Arg 
                245                 250                 255     


Ser Arg Tyr Pro Asp Leu Leu Leu Gln Leu Gly Asp Met Val Thr Lys 
            260                 265                 270         


Asn Ile His Asp Gly Asp Leu Ala Ile Met Ala Glu Pro Ile Asp Trp 
        275                 280                 285             


Met Gly Ile Asn Tyr Tyr Gln Asp Ile Arg Phe Val Ala Thr Asp Val 
    290                 295                 300                 


Ala Pro Thr Ala Asn Pro Met Ala Pro Pro Gly Asn Asp Leu Pro Gly 
305                 310                 315                 320 


Thr Val Gly Val Glu Pro Ala Pro Ala Ile Gly Asn Ile Thr Ser Phe 
                325                 330                 335     


Gly Trp Ser Thr Thr Pro Asp Gly Leu Arg Val Leu Leu Val Gly Leu 
            340                 345                 350         


Asp Glu Glu Tyr Asp Asn Leu Pro Pro Ile Phe Ile Thr Glu Asn Gly 
        355                 360                 365             


Cys Ala Tyr Asp Tyr Pro Val Glu Asp Gly Val Val Asn Asp Thr Leu 
    370                 375                 380                 


Arg Val Thr Tyr Met Arg Glu His Leu Thr Ala Leu Ser Gln Ala Ile 
385                 390                 395                 400 


Glu Ala Gly Val Asn Val Arg Gly Tyr Met His Trp Ser Leu Phe Asp 
                405                 410                 415     


Asn Phe Glu Trp Ala Glu Gly Tyr Arg Gln Arg Phe Gly Met Val His 
            420                 425                 430         


Val Asp Phe Glu Thr Leu Glu Arg Thr Pro Lys Ala Ser Ala His Tyr 
        435                 440                 445             


Tyr Ser Arg Val Ile Thr Asn Asn Ala Leu Ser Asp Asp 
    450                 455                 460     


<210> 365
<211> 1266
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 365
gtggaaaagg aaggctttct ctttggcgcg gccaccagcg cctaccagat cgaaggggcc     60

acgggggagg atgggcgagg gccttccatc tgggacgttt tctgccaacg ccccggggcc    120

atccgggatg ggagctcggg cgaacccgcc tgcgaccatt accgccgttg gcgcgaggac    180

ctgcagtgga tgcgttggct gggcctaaag gcctaccgtt tctccgtggc ctggccccgc    240

atcctccccg ggggaaaggg gcgcatcaac ccgaagggcc tcgccttcta cgaccgcctc    300

gtggacgccc tcctggaagc ggggatcacc cccttcctca ccctctacca ctgggacctc    360

ccctgggcct tggaggaacg gggcgggtgg cggagccggg agaccgccta cgccttcgcc    420

gagtacaccg ccttggtggc ccgggccctg gccgaccgcg tcccctactt cgccaccctc    480

aacgagccct ggtgcagcgc cttcctgggc cacttcacgg gcgagcatgc ccccgggctc    540

cggaacctcg aggccgccct ccgcgccgcc caccacctcc tcctgggcca cggcctggcc    600

gtggaggcct tgcgggccgc cggggccaag cgggtgggca tcgtcctgaa cttcacctgg    660

gtggaggggg aggacgagga agcggtggag cgggcggacc gctaccacaa ccgtttcttc    720

ctggaccccc tcttgggccg gggttacccc gaaagcccct tcgccaaccc acccagcgtc    780

cccatctatc ccaaggacct ggagcgcatg gcaaggcccc tggacttcct cggggtgaac    840

tactacaccc gggcccgggt ggcccggggg gaaggcctct tgccggtgcg ctacctgccc    900

ccggaaaggc ccaccacggc catgggctgg gaggtctacc ccgaggggct ctaccgcctc    960

ctgaggcgcc tggcccggga aaccccttgg cccctcttcg tcacggaaaa cggggccgcc   1020

tacccggacc tttggcgagg ggaagaggtg gtggaggacc ccgagcgggt ggcttacctg   1080

gaaagccacc tggaagcggt gggtagggcg cggagcgaag gcgtagacgt cagggggtac   1140

ttcgcctgga gccttctgga caactttgag tgggcccacg gctacaccaa gcgcttcggg   1200

ctcctctacg tggactatcc cacgggacgg cgcatcccca agcggagcgc cctctggtac   1260

cgggag                                                              1266

<210> 366
<211> 422
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(422)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (6)...(20)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (220)...(223)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (337)...(345)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 366
Met Glu Lys Glu Gly Phe Leu Phe Gly Ala Ala Thr Ser Ala Tyr Gln 
1               5                   10                  15      


Ile Glu Gly Ala Thr Gly Glu Asp Gly Arg Gly Pro Ser Ile Trp Asp 
            20                  25                  30          


Val Phe Cys Gln Arg Pro Gly Ala Ile Arg Asp Gly Ser Ser Gly Glu 
        35                  40                  45              


Pro Ala Cys Asp His Tyr Arg Arg Trp Arg Glu Asp Leu Gln Trp Met 
    50                  55                  60                  


Arg Trp Leu Gly Leu Lys Ala Tyr Arg Phe Ser Val Ala Trp Pro Arg 
65                  70                  75                  80  


Ile Leu Pro Gly Gly Lys Gly Arg Ile Asn Pro Lys Gly Leu Ala Phe 
                85                  90                  95      


Tyr Asp Arg Leu Val Asp Ala Leu Leu Glu Ala Gly Ile Thr Pro Phe 
            100                 105                 110         


Leu Thr Leu Tyr His Trp Asp Leu Pro Trp Ala Leu Glu Glu Arg Gly 
        115                 120                 125             


Gly Trp Arg Ser Arg Glu Thr Ala Tyr Ala Phe Ala Glu Tyr Thr Ala 
    130                 135                 140                 


Leu Val Ala Arg Ala Leu Ala Asp Arg Val Pro Tyr Phe Ala Thr Leu 
145                 150                 155                 160 


Asn Glu Pro Trp Cys Ser Ala Phe Leu Gly His Phe Thr Gly Glu His 
                165                 170                 175     


Ala Pro Gly Leu Arg Asn Leu Glu Ala Ala Leu Arg Ala Ala His His 
            180                 185                 190         


Leu Leu Leu Gly His Gly Leu Ala Val Glu Ala Leu Arg Ala Ala Gly 
        195                 200                 205             


Ala Lys Arg Val Gly Ile Val Leu Asn Phe Thr Trp Val Glu Gly Glu 
    210                 215                 220                 


Asp Glu Glu Ala Val Glu Arg Ala Asp Arg Tyr His Asn Arg Phe Phe 
225                 230                 235                 240 


Leu Asp Pro Leu Leu Gly Arg Gly Tyr Pro Glu Ser Pro Phe Ala Asn 
                245                 250                 255     


Pro Pro Ser Val Pro Ile Tyr Pro Lys Asp Leu Glu Arg Met Ala Arg 
            260                 265                 270         


Pro Leu Asp Phe Leu Gly Val Asn Tyr Tyr Thr Arg Ala Arg Val Ala 
        275                 280                 285             


Arg Gly Glu Gly Leu Leu Pro Val Arg Tyr Leu Pro Pro Glu Arg Pro 
    290                 295                 300                 


Thr Thr Ala Met Gly Trp Glu Val Tyr Pro Glu Gly Leu Tyr Arg Leu 
305                 310                 315                 320 


Leu Arg Arg Leu Ala Arg Glu Thr Pro Trp Pro Leu Phe Val Thr Glu 
                325                 330                 335     


Asn Gly Ala Ala Tyr Pro Asp Leu Trp Arg Gly Glu Glu Val Val Glu 
            340                 345                 350         


Asp Pro Glu Arg Val Ala Tyr Leu Glu Ser His Leu Glu Ala Val Gly 
        355                 360                 365             


Arg Ala Arg Ser Glu Gly Val Asp Val Arg Gly Tyr Phe Ala Trp Ser 
    370                 375                 380                 


Leu Leu Asp Asn Phe Glu Trp Ala His Gly Tyr Thr Lys Arg Phe Gly 
385                 390                 395                 400 


Leu Leu Tyr Val Asp Tyr Pro Thr Gly Arg Arg Ile Pro Lys Arg Ser 
                405                 410                 415     


Ala Leu Trp Tyr Arg Glu 
            420         


<210> 367
<211> 1374
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 367
atgaccacca tgcataacga cgataccgac accagctttc ccgccacctt cacctggggc     60

gtggccacca gcgcctacca gatcgaaggc gccgccgcca tcggcggccg cggcccgtcc    120

atctgggata ccttcagcca cacggaaggc aagatcatcg acggcagcaa tggcgacgtg    180

gcctgcgacc actaccaccg ctatgccgag gacgtggagc tgatcgccag cctgggcgtg    240

aacgcctacc gcttttccat gtcctggtcg cgcgtccagc ccacgggttc cggcgcctgg    300

aacgaagcag gctttgattt ctatgcccgc ctgctcgacg ccctggccgc caagggactc    360

gacgcgcacc tgaccctgta ccactgggac ctgccgcaag ccttgcagga cgagggcggc    420

tggctcaatc gcgccacctg ctaccacttc gccgcgtatg ccgccgaggt ggcgcgccgc    480

ttcggccaca aggtcgccag catcgccacg cacaatgagc cgtggtgcac tgccgtgctg    540

ggccacggca ccggccagtt cgcgcccggc atggccgatc cggccgccgc cgtgcaggtg    600

tcgcaccacc tgctgctgtc gcacggcctg gccatgcagg cgatgcgcac ggtgaacgcg    660

ccggcaaagc tgggcatcgt gctcaatcag tggacggcca cgccagccac cgacagcgcg    720

caggaccgcg agctggccga actcgaatat gcgcgctcgg tgcagtggta tatggacgcc    780

atcttcaagg gccgctaccc ggccctggct ctgaaacaca tcgacgcaca agctttatcc    840

atctttgaaa acgatttcat agatatcaag caacccatcg atttcctcgg cgtgaactac    900

tacacgcgcg ccttcatgag cgccgagacg ccgccgcgca agcctgaatg caagctcggc    960

gtcaacgaca tgggctggga aacctatccg cagggcttga cggaactgct cgtcggcctg   1020

caccgcgagt accgcctgcc gcccgtctac atcacggaaa acggcatggc cgtggccgac   1080

aagcccgtcg acggcaaggt gcacgacgaa ccgcgcatcg agtacgtgcg gctgcacctg   1140

gacgccttgc gcgccgtggt cgcgcagggc atcgacgtgc gcggctattt ctactggagc   1200

ctgatggaca acttcgagtg gaactcaggc tacgccaagc gcttcggcat gctgtatgtc   1260

gactacgcca cgcagcagcg cagcttcaag gacagcgccc tgtggtaccg cgacttcatc   1320

gccgcgcagc gggccgcgca cgctgatgcg ccggcgctgg cggaggggaa ctga         1374

<210> 368
<211> 457
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (9)...(445)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (17)...(31)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (354)...(362)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 368
Met Thr Thr Met His Asn Asp Asp Thr Asp Thr Ser Phe Pro Ala Thr 
1               5                   10                  15      


Phe Thr Trp Gly Val Ala Thr Ser Ala Tyr Gln Ile Glu Gly Ala Ala 
            20                  25                  30          


Ala Ile Gly Gly Arg Gly Pro Ser Ile Trp Asp Thr Phe Ser His Thr 
        35                  40                  45              


Glu Gly Lys Ile Ile Asp Gly Ser Asn Gly Asp Val Ala Cys Asp His 
    50                  55                  60                  


Tyr His Arg Tyr Ala Glu Asp Val Glu Leu Ile Ala Ser Leu Gly Val 
65                  70                  75                  80  


Asn Ala Tyr Arg Phe Ser Met Ser Trp Ser Arg Val Gln Pro Thr Gly 
                85                  90                  95      


Ser Gly Ala Trp Asn Glu Ala Gly Phe Asp Phe Tyr Ala Arg Leu Leu 
            100                 105                 110         


Asp Ala Leu Ala Ala Lys Gly Leu Asp Ala His Leu Thr Leu Tyr His 
        115                 120                 125             


Trp Asp Leu Pro Gln Ala Leu Gln Asp Glu Gly Gly Trp Leu Asn Arg 
    130                 135                 140                 


Ala Thr Cys Tyr His Phe Ala Ala Tyr Ala Ala Glu Val Ala Arg Arg 
145                 150                 155                 160 


Phe Gly His Lys Val Ala Ser Ile Ala Thr His Asn Glu Pro Trp Cys 
                165                 170                 175     


Thr Ala Val Leu Gly His Gly Thr Gly Gln Phe Ala Pro Gly Met Ala 
            180                 185                 190         


Asp Pro Ala Ala Ala Val Gln Val Ser His His Leu Leu Leu Ser His 
        195                 200                 205             


Gly Leu Ala Met Gln Ala Met Arg Thr Val Asn Ala Pro Ala Lys Leu 
    210                 215                 220                 


Gly Ile Val Leu Asn Gln Trp Thr Ala Thr Pro Ala Thr Asp Ser Ala 
225                 230                 235                 240 


Gln Asp Arg Glu Leu Ala Glu Leu Glu Tyr Ala Arg Ser Val Gln Trp 
                245                 250                 255     


Tyr Met Asp Ala Ile Phe Lys Gly Arg Tyr Pro Ala Leu Ala Leu Lys 
            260                 265                 270         


His Ile Asp Ala Gln Ala Leu Ser Ile Phe Glu Asn Asp Phe Ile Asp 
        275                 280                 285             


Ile Lys Gln Pro Ile Asp Phe Leu Gly Val Asn Tyr Tyr Thr Arg Ala 
    290                 295                 300                 


Phe Met Ser Ala Glu Thr Pro Pro Arg Lys Pro Glu Cys Lys Leu Gly 
305                 310                 315                 320 


Val Asn Asp Met Gly Trp Glu Thr Tyr Pro Gln Gly Leu Thr Glu Leu 
                325                 330                 335     


Leu Val Gly Leu His Arg Glu Tyr Arg Leu Pro Pro Val Tyr Ile Thr 
            340                 345                 350         


Glu Asn Gly Met Ala Val Ala Asp Lys Pro Val Asp Gly Lys Val His 
        355                 360                 365             


Asp Glu Pro Arg Ile Glu Tyr Val Arg Leu His Leu Asp Ala Leu Arg 
    370                 375                 380                 


Ala Val Val Ala Gln Gly Ile Asp Val Arg Gly Tyr Phe Tyr Trp Ser 
385                 390                 395                 400 


Leu Met Asp Asn Phe Glu Trp Asn Ser Gly Tyr Ala Lys Arg Phe Gly 
                405                 410                 415     


Met Leu Tyr Val Asp Tyr Ala Thr Gln Gln Arg Ser Phe Lys Asp Ser 
            420                 425                 430         


Ala Leu Trp Tyr Arg Asp Phe Ile Ala Ala Gln Arg Ala Ala His Ala 
        435                 440                 445             


Asp Ala Pro Ala Leu Ala Glu Gly Asn 
    450                 455         


<210> 369
<211> 1620
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 369
atgtttaaaa attattttat acttaacaga cttgttgtag aattaaacaa agaagtaaaa     60

gactttgaag taattcgtgc attttcctac gaaaaagata aaattgcttt tattctcagg    120

aaaaagctag atgaattaag tattgaaata tcagttaatc cgggatttcc atatttaagt    180

ttgcgaaata gattttcaat tccacgaaaa aaccttgttg atttctttag ttcatactta    240

cccctaaaaa ttatttcatt tggaatttca gatagagata gagtcataaa aataaatctg    300

gataaagcat ccatttattt tacaatcaga ggaaaatata ctaatgtctt tttaatatcg    360

aatgaagctg aaatggagaa ctttaaaaaa accgatgaat cagttttaaa agacttcaaa    420

tatgaagctg agaatcagca ttttattcat gaatttaacg aattgcaatt tattgaaaag    480

tttgttgaac ctgagttagt tagaaagaaa tatccaatca ttggtaaaga aataattaat    540

gaagtaaaaa ttagagcaga aaatgctgat gtctccacat ctctgttatt aaaagtaatc    600

actgaagtaa atgaaaaaga tcctgctgtt tttattgatg agaatacaaa tgaagttaat    660

ttaggaattg aaacgtttaa tgtatttcca ttcaccaaaa aagaaatatt ttcaaacctg    720

attgaagcat ttaattattt tcttaatcag aaattttcca gagaaagtat ttttgataaa    780

aagaagataa ttgaaaagta tcttgagaat ggattaaata aagtctcagc taaattaaat    840

gatgttcaat ccagaattca aaggggtacg aaagaggaag agtataataa aatagctaat    900

cttcttttaa tcaatattaa taattttact aaaggtaaga atagtgttga actacaggat    960

atttataaag agaattcctt cgtttcaata aaattggata ctaaattatc tccaaagcaa   1020

agtgttgatt catattttga aaaagcaaga aacgaaaaaa taaaatttga aaaatccagg   1080

cagctctact ccgagcatca gaaaaaatat tctgaattac aaaggattaa aggaaaattt   1140

ttaaaagcta aaataacaga agaatatgat ttaattatga aagaattaaa tattaagcag   1200

gaagtaaaaa ccaaacctca ggatgattta aaggataaat tcagacatta tttagtccac   1260

aagaaataca atgtttatgt aggaaaagac agcaaaagta atgatctctt aactatgaaa   1320

tttgcaaaac agaatgatta ttggtttcat gccagagggc tttcagggtc tcacgtagtt   1380

cttaagaatg aaaatgctaa agaaggaata ccaaaaaata tattaaaagc tactgcatca   1440

ttggcagcct atcatagtaa agcaaaaacc gcaggaatgg cacctgtttc ttacacacaa   1500

aaaaaatatg taataaaaaa gaaaggtatg gaacccggca aagttgcctt gctgaaagaa   1560

gaggttttaa tcgttaagcc ggtaattcct gtagagtgcg agtatattag tgttgaatga   1620

<210> 370
<211> 539
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(406)
<223> Fibronectin-binding protein A N-terminus (FbpA)

<220> 
<221> DOMAIN
<222> (417)...(506)
<223> Domain of unknown function (DUF814)

<220> 
<221> SITE
<222> (312)...(315)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (329)...(332)
<223> N-glycosylation site. Prosite id = PS00001

<400> 370
Met Phe Lys Asn Tyr Phe Ile Leu Asn Arg Leu Val Val Glu Leu Asn 
1               5                   10                  15      


Lys Glu Val Lys Asp Phe Glu Val Ile Arg Ala Phe Ser Tyr Glu Lys 
            20                  25                  30          


Asp Lys Ile Ala Phe Ile Leu Arg Lys Lys Leu Asp Glu Leu Ser Ile 
        35                  40                  45              


Glu Ile Ser Val Asn Pro Gly Phe Pro Tyr Leu Ser Leu Arg Asn Arg 
    50                  55                  60                  


Phe Ser Ile Pro Arg Lys Asn Leu Val Asp Phe Phe Ser Ser Tyr Leu 
65                  70                  75                  80  


Pro Leu Lys Ile Ile Ser Phe Gly Ile Ser Asp Arg Asp Arg Val Ile 
                85                  90                  95      


Lys Ile Asn Leu Asp Lys Ala Ser Ile Tyr Phe Thr Ile Arg Gly Lys 
            100                 105                 110         


Tyr Thr Asn Val Phe Leu Ile Ser Asn Glu Ala Glu Met Glu Asn Phe 
        115                 120                 125             


Lys Lys Thr Asp Glu Ser Val Leu Lys Asp Phe Lys Tyr Glu Ala Glu 
    130                 135                 140                 


Asn Gln His Phe Ile His Glu Phe Asn Glu Leu Gln Phe Ile Glu Lys 
145                 150                 155                 160 


Phe Val Glu Pro Glu Leu Val Arg Lys Lys Tyr Pro Ile Ile Gly Lys 
                165                 170                 175     


Glu Ile Ile Asn Glu Val Lys Ile Arg Ala Glu Asn Ala Asp Val Ser 
            180                 185                 190         


Thr Ser Leu Leu Leu Lys Val Ile Thr Glu Val Asn Glu Lys Asp Pro 
        195                 200                 205             


Ala Val Phe Ile Asp Glu Asn Thr Asn Glu Val Asn Leu Gly Ile Glu 
    210                 215                 220                 


Thr Phe Asn Val Phe Pro Phe Thr Lys Lys Glu Ile Phe Ser Asn Leu 
225                 230                 235                 240 


Ile Glu Ala Phe Asn Tyr Phe Leu Asn Gln Lys Phe Ser Arg Glu Ser 
                245                 250                 255     


Ile Phe Asp Lys Lys Lys Ile Ile Glu Lys Tyr Leu Glu Asn Gly Leu 
            260                 265                 270         


Asn Lys Val Ser Ala Lys Leu Asn Asp Val Gln Ser Arg Ile Gln Arg 
        275                 280                 285             


Gly Thr Lys Glu Glu Glu Tyr Asn Lys Ile Ala Asn Leu Leu Leu Ile 
    290                 295                 300                 


Asn Ile Asn Asn Phe Thr Lys Gly Lys Asn Ser Val Glu Leu Gln Asp 
305                 310                 315                 320 


Ile Tyr Lys Glu Asn Ser Phe Val Ser Ile Lys Leu Asp Thr Lys Leu 
                325                 330                 335     


Ser Pro Lys Gln Ser Val Asp Ser Tyr Phe Glu Lys Ala Arg Asn Glu 
            340                 345                 350         


Lys Ile Lys Phe Glu Lys Ser Arg Gln Leu Tyr Ser Glu His Gln Lys 
        355                 360                 365             


Lys Tyr Ser Glu Leu Gln Arg Ile Lys Gly Lys Phe Leu Lys Ala Lys 
    370                 375                 380                 


Ile Thr Glu Glu Tyr Asp Leu Ile Met Lys Glu Leu Asn Ile Lys Gln 
385                 390                 395                 400 


Glu Val Lys Thr Lys Pro Gln Asp Asp Leu Lys Asp Lys Phe Arg His 
                405                 410                 415     


Tyr Leu Val His Lys Lys Tyr Asn Val Tyr Val Gly Lys Asp Ser Lys 
            420                 425                 430         


Ser Asn Asp Leu Leu Thr Met Lys Phe Ala Lys Gln Asn Asp Tyr Trp 
        435                 440                 445             


Phe His Ala Arg Gly Leu Ser Gly Ser His Val Val Leu Lys Asn Glu 
    450                 455                 460                 


Asn Ala Lys Glu Gly Ile Pro Lys Asn Ile Leu Lys Ala Thr Ala Ser 
465                 470                 475                 480 


Leu Ala Ala Tyr His Ser Lys Ala Lys Thr Ala Gly Met Ala Pro Val 
                485                 490                 495     


Ser Tyr Thr Gln Lys Lys Tyr Val Ile Lys Lys Lys Gly Met Glu Pro 
            500                 505                 510         


Gly Lys Val Ala Leu Leu Lys Glu Glu Val Leu Ile Val Lys Pro Val 
        515                 520                 525             


Ile Pro Val Glu Cys Glu Tyr Ile Ser Val Glu 
    530                 535                 


<210> 371
<211> 2265
<212> DNA
<213> Thermococcus AEPII1a

<400> 371
gtgagcagta agcaaaagac tgtggcaata tttgttttgt ttgttgcttt ggcgggagta     60

gccggaagca ttcctgcaag ctatgcagcg ccaagcacca gcacgtacac gacgcccacg    120

ggaatatact atgaagtcag aggagataca atctacatga taaacgttgc aacgggagag    180

gagaccccaa tacacctctt tggagtcaac tggttcggct ttgagacacc gaactacgtt    240

gttcacggcc tatggagtag gaactgggag gacatgctcc tccagatcaa gagccttggc    300

ttcaatgcga taaggcttcc cttctgtacc cagtcagtaa aaccggggac gatgccaacg    360

gcgattgact acgccaagaa cccagacctc cagggtcttg acagcgtcca gataatggag    420

aaaataatca agaaggctgg agacctgggc atattcgtgc tcctcgacta ccacagaata    480

ggatgcaact tcatagagcc cctatggtac accgacagct tctcggagca ggactacata    540

aacacctggg ttgaagtcgc ccagaggttc ggcaagtact tgaacgttat cggcgcggac    600

ctgaagaacg aaccccacag ctcaagcccc gcacctgccg cctacactga cggaagtggg    660

gccacgtggg gaatgggcaa caacgccacc gactggaacc tggcggctga gaggatagga    720

agggcaatcc tggaggttgc cccacactgg cttatattcg tcgagggaac ccagttcacc    780

acccccgaga tagacggtag ctacaagtgg ggccacaacg cctggtgggg cggaaacctc    840

atgggtgtta ggaagtaccc agtcaacctg cccaggaaca agctcgtcta cagcccccac    900

gtttacggtc cagacgttta cgaccagccc tactttgacc ccgctgaggg cttccccgac    960

aacctccccg acatatggta ccaccacttc ggctacgtaa agcttgatct cggttaccct   1020

gttgttatag gtgagttcgg aggcaagtac ggccatgggg gagacccgag ggatgtcact   1080

tggcagaaca agataataga ctggatgatc cagaacaaat tctgtgactt cttctactgg   1140

agctggaacc caaacagcgg tgacaccggt ggaattctga aggatgactg gacgacaata   1200

tgggaggaca agtacaacaa cctgaagagg ctcatggaca gctgttctgg aaacgccact   1260

gccccgtccg tccccacgac aactacaaca acaagcacac cgccaacgac cacaacgact   1320

acaacatcca ctccaacgac cactacccag accccgacca ccactactcc aactacgaca   1380

accaccacga ccacaactcc ttcaaataac gtcccatttg aaattgtgaa cgttctcccg   1440

actagctccc agtacgaggg aaccagcgtg gaggttgtat gtgatggaac ccagtgtgcc   1500

tccagcgttt ggggagctcc gaacctctgg ggagtcgtta aaatcggaaa cgccaccatg   1560

gaccccaacg tttggggctg ggaggacgtt tacaagactg caccccagga cattggaacc   1620

ggcagcacaa agatggagat aaggaacggg gtgctcaagg ttacaaacct ctggaacatc   1680

aacatgcatc cgaagtataa cacaatggca tacccggagg tcatatacgg cgccaagcct   1740

tggggcaacc agccaataaa cgctccgaac ttcgtgctcc cgataaaggt ctcccagctt   1800

ccgaggatac tcgttgacac aaagtacacg ctcgaaaaga gcttcccggg aaacaacttc   1860

gcctttgagg cctggctctt caaggatgcc aacaacatga gggcaccagg ccagggggac   1920

tacgagataa tggtacagct ctacatcgag ggcggctatc ctgcgggcta cgacaagggg   1980

ccagttctca ccgttgatgt tccgataatc gtcgatggaa ggcttgtaaa ccagactttt   2040

gagctctacg acgtcatagc ggatgccgga tggaggttct tcaccttcaa gccaactaag   2100

aactacaacg gctcagaggt tgtgttcgac tatcagagct gtcccgcaga aggtattcaa   2160

cgctcaagga gatgctggag agaactcaat gaccccttcc ttcttcaaaa actcctctat   2220

ctcaaggacg tcttcgggag acagctgaaa gacccttctc tctga                   2265

<210> 372
<211> 754
<212> PRT
<213> Thermococcus AEPII1a

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (54)...(390)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> DOMAIN
<222> (589)...(736)
<223> Glycosyl hydrolase family 12

<220> 
<221> SITE
<222> (200)...(209)
<223> Glycosyl hydrolases family 5 signature. Prosite id = PS00659

<220> 
<221> SITE
<222> (231)...(234)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (424)...(427)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (524)...(527)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (687)...(690)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (713)...(716)
<223> N-glycosylation site. Prosite id = PS00001

<400> 372
Met Ser Ser Lys Gln Lys Thr Val Ala Ile Phe Val Leu Phe Val Ala 
1               5                   10                  15      


Leu Ala Gly Val Ala Gly Ser Ile Pro Ala Ser Tyr Ala Ala Pro Ser 
            20                  25                  30          


Thr Ser Thr Tyr Thr Thr Pro Thr Gly Ile Tyr Tyr Glu Val Arg Gly 
        35                  40                  45              


Asp Thr Ile Tyr Met Ile Asn Val Ala Thr Gly Glu Glu Thr Pro Ile 
    50                  55                  60                  


His Leu Phe Gly Val Asn Trp Phe Gly Phe Glu Thr Pro Asn Tyr Val 
65                  70                  75                  80  


Val His Gly Leu Trp Ser Arg Asn Trp Glu Asp Met Leu Leu Gln Ile 
                85                  90                  95      


Lys Ser Leu Gly Phe Asn Ala Ile Arg Leu Pro Phe Cys Thr Gln Ser 
            100                 105                 110         


Val Lys Pro Gly Thr Met Pro Thr Ala Ile Asp Tyr Ala Lys Asn Pro 
        115                 120                 125             


Asp Leu Gln Gly Leu Asp Ser Val Gln Ile Met Glu Lys Ile Ile Lys 
    130                 135                 140                 


Lys Ala Gly Asp Leu Gly Ile Phe Val Leu Leu Asp Tyr His Arg Ile 
145                 150                 155                 160 


Gly Cys Asn Phe Ile Glu Pro Leu Trp Tyr Thr Asp Ser Phe Ser Glu 
                165                 170                 175     


Gln Asp Tyr Ile Asn Thr Trp Val Glu Val Ala Gln Arg Phe Gly Lys 
            180                 185                 190         


Tyr Leu Asn Val Ile Gly Ala Asp Leu Lys Asn Glu Pro His Ser Ser 
        195                 200                 205             


Ser Pro Ala Pro Ala Ala Tyr Thr Asp Gly Ser Gly Ala Thr Trp Gly 
    210                 215                 220                 


Met Gly Asn Asn Ala Thr Asp Trp Asn Leu Ala Ala Glu Arg Ile Gly 
225                 230                 235                 240 


Arg Ala Ile Leu Glu Val Ala Pro His Trp Leu Ile Phe Val Glu Gly 
                245                 250                 255     


Thr Gln Phe Thr Thr Pro Glu Ile Asp Gly Ser Tyr Lys Trp Gly His 
            260                 265                 270         


Asn Ala Trp Trp Gly Gly Asn Leu Met Gly Val Arg Lys Tyr Pro Val 
        275                 280                 285             


Asn Leu Pro Arg Asn Lys Leu Val Tyr Ser Pro His Val Tyr Gly Pro 
    290                 295                 300                 


Asp Val Tyr Asp Gln Pro Tyr Phe Asp Pro Ala Glu Gly Phe Pro Asp 
305                 310                 315                 320 


Asn Leu Pro Asp Ile Trp Tyr His His Phe Gly Tyr Val Lys Leu Asp 
                325                 330                 335     


Leu Gly Tyr Pro Val Val Ile Gly Glu Phe Gly Gly Lys Tyr Gly His 
            340                 345                 350         


Gly Gly Asp Pro Arg Asp Val Thr Trp Gln Asn Lys Ile Ile Asp Trp 
        355                 360                 365             


Met Ile Gln Asn Lys Phe Cys Asp Phe Phe Tyr Trp Ser Trp Asn Pro 
    370                 375                 380                 


Asn Ser Gly Asp Thr Gly Gly Ile Leu Lys Asp Asp Trp Thr Thr Ile 
385                 390                 395                 400 


Trp Glu Asp Lys Tyr Asn Asn Leu Lys Arg Leu Met Asp Ser Cys Ser 
                405                 410                 415     


Gly Asn Ala Thr Ala Pro Ser Val Pro Thr Thr Thr Thr Thr Thr Ser 
            420                 425                 430         


Thr Pro Pro Thr Thr Thr Thr Thr Thr Thr Ser Thr Pro Thr Thr Thr 
        435                 440                 445             


Thr Gln Thr Pro Thr Thr Thr Thr Pro Thr Thr Thr Thr Thr Thr Thr 
    450                 455                 460                 


Thr Thr Pro Ser Asn Asn Val Pro Phe Glu Ile Val Asn Val Leu Pro 
465                 470                 475                 480 


Thr Ser Ser Gln Tyr Glu Gly Thr Ser Val Glu Val Val Cys Asp Gly 
                485                 490                 495     


Thr Gln Cys Ala Ser Ser Val Trp Gly Ala Pro Asn Leu Trp Gly Val 
            500                 505                 510         


Val Lys Ile Gly Asn Ala Thr Met Asp Pro Asn Val Trp Gly Trp Glu 
        515                 520                 525             


Asp Val Tyr Lys Thr Ala Pro Gln Asp Ile Gly Thr Gly Ser Thr Lys 
    530                 535                 540                 


Met Glu Ile Arg Asn Gly Val Leu Lys Val Thr Asn Leu Trp Asn Ile 
545                 550                 555                 560 


Asn Met His Pro Lys Tyr Asn Thr Met Ala Tyr Pro Glu Val Ile Tyr 
                565                 570                 575     


Gly Ala Lys Pro Trp Gly Asn Gln Pro Ile Asn Ala Pro Asn Phe Val 
            580                 585                 590         


Leu Pro Ile Lys Val Ser Gln Leu Pro Arg Ile Leu Val Asp Thr Lys 
        595                 600                 605             


Tyr Thr Leu Glu Lys Ser Phe Pro Gly Asn Asn Phe Ala Phe Glu Ala 
    610                 615                 620                 


Trp Leu Phe Lys Asp Ala Asn Asn Met Arg Ala Pro Gly Gln Gly Asp 
625                 630                 635                 640 


Tyr Glu Ile Met Val Gln Leu Tyr Ile Glu Gly Gly Tyr Pro Ala Gly 
                645                 650                 655     


Tyr Asp Lys Gly Pro Val Leu Thr Val Asp Val Pro Ile Ile Val Asp 
            660                 665                 670         


Gly Arg Leu Val Asn Gln Thr Phe Glu Leu Tyr Asp Val Ile Ala Asp 
        675                 680                 685             


Ala Gly Trp Arg Phe Phe Thr Phe Lys Pro Thr Lys Asn Tyr Asn Gly 
    690                 695                 700                 


Ser Glu Val Val Phe Asp Tyr Gln Ser Cys Pro Ala Glu Gly Ile Gln 
705                 710                 715                 720 


Arg Ser Arg Arg Cys Trp Arg Glu Leu Asn Asp Pro Phe Leu Leu Gln 
                725                 730                 735     


Lys Leu Leu Tyr Leu Lys Asp Val Phe Gly Arg Gln Leu Lys Asp Pro 
            740                 745                 750         


Ser Leu 
        


<210> 373
<211> 2007
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 373
atgaattgca ctttgaaacc gatggcccgc gtggtcgccg gctgcgtcgc cacgcttgcc     60

ctggccgctt gcggcagcga taccggcagc gagggctaca cgcagcccgt gttcggcgcc    120

actacctaca cccagctcaa ggtcgacggt tataccttca aggatatgaa ccgcaacggc    180

aagatcgacc cgtacgaaga ctggcgcctg tcggccgagg agcgcgccga cgacctgctg    240

tcgcgcctga gcctcgatga aaaagcgggc ctgatgatgc acggcacggc gcccaccgtg    300

tccgaccctt ccggcatcgg cctgggcggc gcgtatgacc tggcggcctt gcaagacctg    360

atcgtgaagc agtatgtcaa cacgtttatt acgcgcatgg cgggcgatac ggccaatatg    420

gcggcccagt acaacaaggt gcaggccctg agcgaaacct cgcgccatgg cattcccgtc    480

tccatcagca cggacccgcg ccaccatttc cagtatacgg tgggcgccag cgcggctacc    540

tccggcttct cgcagtggcc ggaaacactg ggcctggccg ccattggcga tgatgccctg    600

gtgcgccgct tcggcgacat cgcgcgccag gaatacctgg ccgtcggcat cacgcaagcc    660

ctgtcgccgc aggcggacct ggctaccgag ccgcgctggt cgcgcatcaa cggcactttt    720

ggcgaagacg ccgacctggc caagcgcatg gtgcagcact atatcgaggg cttccaggac    780

ggcaatacgg gcttgcatga cggcagcgtg gtggctgtcg tcaagcactg ggtcggctac    840

ggcgccacga aagagggctt tgacggccac aattactatg gccgctacat gacctacccg    900

ggcaacaact tcgcttacca cgtgaagccg ttcgaagggg cgttcacggc caaggcggct    960

tccgtcatgc cgacgtacgc cctgccagac ggcaatatca ccatagccgg catcaccctg   1020

gaacaagtgg cggccggttt cagcaagacc atgttgaccg atctgctgcg cggcaaatat   1080

ggttttgagg gggtgatcct gtccgactgg ggcatcacgt cagactgcga cgccaactgc   1140

cgcaacggca cggcgccggg cgtcgcgcct tcgttcatcg gcttcggcac gccgtggggc   1200

atggaagacg ccaccaaggc cgaacgttac gtgaaggctg tcacggcggg gatggaccag   1260

tttggcggcg tgacggaagc gccgtacctg acgcaagccg tgcagcgggg ccagctgacg   1320

gaagcgcgca tcaacgcctc ggcgcggcgt atcctgatcc aaaaattcaa gcagggcctt   1380

ttcgagcatc catttgtcga tgcggccaag gcagccgcca cggtgggcaa ggcggaattc   1440

gtcgaggcgg gcctggacgc ccagcgccgt tcgctggtcc tgctggaaaa caaggacaag   1500

gtcttgccgc tggcggccag cgtcaagaag gtttacctgt acggcatcga cgcggccgtg   1560

gccaagcagt atggctacac ggtggtggcc acgccgcagg aagcggacgt ggccctgctg   1620

cgggtggcgg cgccctatga aaccctgcac ccgaattaca tcttcggcag catgcagcac   1680

gaaggccgcc tgaactatgc cgatggcgac gccgactacg aggcgatcaa gaacgcggcg   1740

aaattcgcgc cgaagacggt ggtgacggtg tacctggacc gaccggccat tctcggcaac   1800

gtgcaggaca aggccagtgc cattgtcgcc aactttggcg tgagcgacgg cgcgctgttc   1860

gatgtcctga cgggcaaggc caagccgcaa ggcaagctgc cgttcgagct gccctcgtcg   1920

atggccgaag tgcagatgca gaaatcggac gtgccgtatg acacggccca tccgctgtac   1980

aagtttggcg ccgggctggc gtactga                                       2007

<210> 374
<211> 668
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (150)...(381)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (492)...(668)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (2)...(5)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (240)...(243)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (337)...(340)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (360)...(377)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (387)...(390)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (451)...(454)
<223> N-glycosylation site. Prosite id = PS00001

<400> 374
Met Asn Cys Thr Leu Lys Pro Met Ala Arg Val Val Ala Gly Cys Val 
1               5                   10                  15      


Ala Thr Leu Ala Leu Ala Ala Cys Gly Ser Asp Thr Gly Ser Glu Gly 
            20                  25                  30          


Tyr Thr Gln Pro Val Phe Gly Ala Thr Thr Tyr Thr Gln Leu Lys Val 
        35                  40                  45              


Asp Gly Tyr Thr Phe Lys Asp Met Asn Arg Asn Gly Lys Ile Asp Pro 
    50                  55                  60                  


Tyr Glu Asp Trp Arg Leu Ser Ala Glu Glu Arg Ala Asp Asp Leu Leu 
65                  70                  75                  80  


Ser Arg Leu Ser Leu Asp Glu Lys Ala Gly Leu Met Met His Gly Thr 
                85                  90                  95      


Ala Pro Thr Val Ser Asp Pro Ser Gly Ile Gly Leu Gly Gly Ala Tyr 
            100                 105                 110         


Asp Leu Ala Ala Leu Gln Asp Leu Ile Val Lys Gln Tyr Val Asn Thr 
        115                 120                 125             


Phe Ile Thr Arg Met Ala Gly Asp Thr Ala Asn Met Ala Ala Gln Tyr 
    130                 135                 140                 


Asn Lys Val Gln Ala Leu Ser Glu Thr Ser Arg His Gly Ile Pro Val 
145                 150                 155                 160 


Ser Ile Ser Thr Asp Pro Arg His His Phe Gln Tyr Thr Val Gly Ala 
                165                 170                 175     


Ser Ala Ala Thr Ser Gly Phe Ser Gln Trp Pro Glu Thr Leu Gly Leu 
            180                 185                 190         


Ala Ala Ile Gly Asp Asp Ala Leu Val Arg Arg Phe Gly Asp Ile Ala 
        195                 200                 205             


Arg Gln Glu Tyr Leu Ala Val Gly Ile Thr Gln Ala Leu Ser Pro Gln 
    210                 215                 220                 


Ala Asp Leu Ala Thr Glu Pro Arg Trp Ser Arg Ile Asn Gly Thr Phe 
225                 230                 235                 240 


Gly Glu Asp Ala Asp Leu Ala Lys Arg Met Val Gln His Tyr Ile Glu 
                245                 250                 255     


Gly Phe Gln Asp Gly Asn Thr Gly Leu His Asp Gly Ser Val Val Ala 
            260                 265                 270         


Val Val Lys His Trp Val Gly Tyr Gly Ala Thr Lys Glu Gly Phe Asp 
        275                 280                 285             


Gly His Asn Tyr Tyr Gly Arg Tyr Met Thr Tyr Pro Gly Asn Asn Phe 
    290                 295                 300                 


Ala Tyr His Val Lys Pro Phe Glu Gly Ala Phe Thr Ala Lys Ala Ala 
305                 310                 315                 320 


Ser Val Met Pro Thr Tyr Ala Leu Pro Asp Gly Asn Ile Thr Ile Ala 
                325                 330                 335     


Gly Ile Thr Leu Glu Gln Val Ala Ala Gly Phe Ser Lys Thr Met Leu 
            340                 345                 350         


Thr Asp Leu Leu Arg Gly Lys Tyr Gly Phe Glu Gly Val Ile Leu Ser 
        355                 360                 365             


Asp Trp Gly Ile Thr Ser Asp Cys Asp Ala Asn Cys Arg Asn Gly Thr 
    370                 375                 380                 


Ala Pro Gly Val Ala Pro Ser Phe Ile Gly Phe Gly Thr Pro Trp Gly 
385                 390                 395                 400 


Met Glu Asp Ala Thr Lys Ala Glu Arg Tyr Val Lys Ala Val Thr Ala 
                405                 410                 415     


Gly Met Asp Gln Phe Gly Gly Val Thr Glu Ala Pro Tyr Leu Thr Gln 
            420                 425                 430         


Ala Val Gln Arg Gly Gln Leu Thr Glu Ala Arg Ile Asn Ala Ser Ala 
        435                 440                 445             


Arg Arg Ile Leu Ile Gln Lys Phe Lys Gln Gly Leu Phe Glu His Pro 
    450                 455                 460                 


Phe Val Asp Ala Ala Lys Ala Ala Ala Thr Val Gly Lys Ala Glu Phe 
465                 470                 475                 480 


Val Glu Ala Gly Leu Asp Ala Gln Arg Arg Ser Leu Val Leu Leu Glu 
                485                 490                 495     


Asn Lys Asp Lys Val Leu Pro Leu Ala Ala Ser Val Lys Lys Val Tyr 
            500                 505                 510         


Leu Tyr Gly Ile Asp Ala Ala Val Ala Lys Gln Tyr Gly Tyr Thr Val 
        515                 520                 525             


Val Ala Thr Pro Gln Glu Ala Asp Val Ala Leu Leu Arg Val Ala Ala 
    530                 535                 540                 


Pro Tyr Glu Thr Leu His Pro Asn Tyr Ile Phe Gly Ser Met Gln His 
545                 550                 555                 560 


Glu Gly Arg Leu Asn Tyr Ala Asp Gly Asp Ala Asp Tyr Glu Ala Ile 
                565                 570                 575     


Lys Asn Ala Ala Lys Phe Ala Pro Lys Thr Val Val Thr Val Tyr Leu 
            580                 585                 590         


Asp Arg Pro Ala Ile Leu Gly Asn Val Gln Asp Lys Ala Ser Ala Ile 
        595                 600                 605             


Val Ala Asn Phe Gly Val Ser Asp Gly Ala Leu Phe Asp Val Leu Thr 
    610                 615                 620                 


Gly Lys Ala Lys Pro Gln Gly Lys Leu Pro Phe Glu Leu Pro Ser Ser 
625                 630                 635                 640 


Met Ala Glu Val Gln Met Gln Lys Ser Asp Val Pro Tyr Asp Thr Ala 
                645                 650                 655     


His Pro Leu Tyr Lys Phe Gly Ala Gly Leu Ala Tyr 
            660                 665             


<210> 375
<211> 2244
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 375
atgtcgttat tcagaccaca ccctctgaaa accgctctgg caactgttct gctcggagcc     60

ctcaccggac aagcattggc cgcccccgca ccgggcgttg atgctttgac gttgaaacag    120

aaaattggcc agttgcagca aatgcacagt gaggccaaaa ccgttggaga aatccccgag    180

gaactcaaag ccgccatccg caatggccaa gtgggttcta tcctgaacat cggcagcccc    240

gaggtcgcca acgaactgca gcgcatcgcc atgacggaaa gcgccgccaa gatcccgctg    300

attttcgccc gggatgtaat ccacggttat cgcaccatca tgcctactcc cttggggcag    360

gcggcgagct ggaatgcgga cttggtggag aaaggcgctc gcgctgcggc ggtggaagcg    420

tccagcgtgg gtatccgctg gaccttcgcg ccgatgatcg acatctcccg cgactcccgc    480

tggggccgca ttaccgaatc ttttggcgaa gacccttacc tcacctccgt tctgaccgtc    540

gcatccgtgc gcggttatca gggcgattct ctgagcgatc ccactagtat tgccgccacc    600

gccaaacatt ttgtcggcta cggcgcggcc gagggcgggc gggactacaa cgtcacttac    660

atccccgagc cgctgctgca caacatctat ctgccgccat tcaaagccgc cgtggatgca    720

ggggttgcga ccatcatgtc cgcattcaat gaactcaacg gtgtgcctgc gtctgcccac    780

gattacgcaa tcaatacggt attgcgtggc gaatttggtt tcgacggcgt gatcgtcagc    840

gactgggatt cggtgctgga actgattgct cacggttacg cagcggataa acaacaagcc    900

gcattgcagg ctttgaaggc tggcctcgac attgaaatgg tctccaccac catgcaagag    960

aacttgccag cgctgctcga cagcggcaag gtgagcgagg aggaagtgga tgtaaaagtg   1020

acccggatca ttgacttaaa acgcaatctt ggtttgttcg agaagcccta tatccaaggt   1080

gatccttcga agtcaatact cactccagag cacaagagct tgagcgaaca gctggccctc   1140

gaaagtcttg tgctgctgaa aaatgacaag cagaccctgc cgttggcgaa gggaaagcgc   1200

gtcgctttga ttggcccgct cgcccacgcc gcgcatgacc agttgggttc atgggtgctg   1260

gatgcgcaga agcaggattc ggtgaccgtg ctggatgcct tccgcaatgt gcttagccag   1320

gacaagttgt tctactctca ggcgctttac agcagccggt cccgcgacag caaagatttc   1380

gcccaagcca tcgagcaggc gaataaggct gacgtgattg tttatgtggg cggtgaagaa   1440

gcggtgttga gcggcgaggc gcacagtcgt gcggataccc gtctgccggg tgcacaggaa   1500

caattgatcc gcgaactgaa gaagaccggc aagccattgg tggtggtgat tatggcgggc   1560

cggccgattg cgatgaatga cgttatcgac gaaattgatg cgctggtgat ggcatggcat   1620

ccgggcacca tgggcggccc ggcgattgtc cgtttgctgc aaggggatgc ggagcccgtg   1680

ggccgtctgc cggtcacttg gccgaaagtc accggccaga tgccgatcta ttacaaccat   1740

ccgagttctg gtcgcccggc gtccgaaggc aacttcaccc agatggatga ttttccgctg   1800

gaggctttcc aacactctac cggtcacaaa aatcactaca ttgatatagg ttttaaaccg   1860

caatttccgt ttggatttgg tctgagttat tccacggtga catacagcga tatcaaggtg   1920

gataaaaaag ccatcgcgat caacggcgaa ttaaaggtgt ccgccaaaat caccaatagc   1980

ggcaaacgcc cggtgacgga aacggcgcag ctttatattc gcgatttagt cgccagcagc   2040

gtacgaccgg tacgggagtt gaagagcttc cagcgcgtta ccctgaagcc gaaagaatcc   2100

aagcgagtta ccttcacgtt acgcgagagc gatttggcgt tttacaacca gaaattggag   2160

cacctggtgg agcctggtga attccatgtt tggattgcgc ctaatgcgga agagggcgta   2220

aaagcggaat ttgctgtgcg ttga                                          2244

<210> 376
<211> 747
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(27)

<220> 
<221> DOMAIN
<222> (95)...(314)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (383)...(632)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (220)...(223)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (271)...(288)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (600)...(603)
<223> N-glycosylation site. Prosite id = PS00001

<400> 376
Met Ser Leu Phe Arg Pro His Pro Leu Lys Thr Ala Leu Ala Thr Val 
1               5                   10                  15      


Leu Leu Gly Ala Leu Thr Gly Gln Ala Leu Ala Ala Pro Ala Pro Gly 
            20                  25                  30          


Val Asp Ala Leu Thr Leu Lys Gln Lys Ile Gly Gln Leu Gln Gln Met 
        35                  40                  45              


His Ser Glu Ala Lys Thr Val Gly Glu Ile Pro Glu Glu Leu Lys Ala 
    50                  55                  60                  


Ala Ile Arg Asn Gly Gln Val Gly Ser Ile Leu Asn Ile Gly Ser Pro 
65                  70                  75                  80  


Glu Val Ala Asn Glu Leu Gln Arg Ile Ala Met Thr Glu Ser Ala Ala 
                85                  90                  95      


Lys Ile Pro Leu Ile Phe Ala Arg Asp Val Ile His Gly Tyr Arg Thr 
            100                 105                 110         


Ile Met Pro Thr Pro Leu Gly Gln Ala Ala Ser Trp Asn Ala Asp Leu 
        115                 120                 125             


Val Glu Lys Gly Ala Arg Ala Ala Ala Val Glu Ala Ser Ser Val Gly 
    130                 135                 140                 


Ile Arg Trp Thr Phe Ala Pro Met Ile Asp Ile Ser Arg Asp Ser Arg 
145                 150                 155                 160 


Trp Gly Arg Ile Thr Glu Ser Phe Gly Glu Asp Pro Tyr Leu Thr Ser 
                165                 170                 175     


Val Leu Thr Val Ala Ser Val Arg Gly Tyr Gln Gly Asp Ser Leu Ser 
            180                 185                 190         


Asp Pro Thr Ser Ile Ala Ala Thr Ala Lys His Phe Val Gly Tyr Gly 
        195                 200                 205             


Ala Ala Glu Gly Gly Arg Asp Tyr Asn Val Thr Tyr Ile Pro Glu Pro 
    210                 215                 220                 


Leu Leu His Asn Ile Tyr Leu Pro Pro Phe Lys Ala Ala Val Asp Ala 
225                 230                 235                 240 


Gly Val Ala Thr Ile Met Ser Ala Phe Asn Glu Leu Asn Gly Val Pro 
                245                 250                 255     


Ala Ser Ala His Asp Tyr Ala Ile Asn Thr Val Leu Arg Gly Glu Phe 
            260                 265                 270         


Gly Phe Asp Gly Val Ile Val Ser Asp Trp Asp Ser Val Leu Glu Leu 
        275                 280                 285             


Ile Ala His Gly Tyr Ala Ala Asp Lys Gln Gln Ala Ala Leu Gln Ala 
    290                 295                 300                 


Leu Lys Ala Gly Leu Asp Ile Glu Met Val Ser Thr Thr Met Gln Glu 
305                 310                 315                 320 


Asn Leu Pro Ala Leu Leu Asp Ser Gly Lys Val Ser Glu Glu Glu Val 
                325                 330                 335     


Asp Val Lys Val Thr Arg Ile Ile Asp Leu Lys Arg Asn Leu Gly Leu 
            340                 345                 350         


Phe Glu Lys Pro Tyr Ile Gln Gly Asp Pro Ser Lys Ser Ile Leu Thr 
        355                 360                 365             


Pro Glu His Lys Ser Leu Ser Glu Gln Leu Ala Leu Glu Ser Leu Val 
    370                 375                 380                 


Leu Leu Lys Asn Asp Lys Gln Thr Leu Pro Leu Ala Lys Gly Lys Arg 
385                 390                 395                 400 


Val Ala Leu Ile Gly Pro Leu Ala His Ala Ala His Asp Gln Leu Gly 
                405                 410                 415     


Ser Trp Val Leu Asp Ala Gln Lys Gln Asp Ser Val Thr Val Leu Asp 
            420                 425                 430         


Ala Phe Arg Asn Val Leu Ser Gln Asp Lys Leu Phe Tyr Ser Gln Ala 
        435                 440                 445             


Leu Tyr Ser Ser Arg Ser Arg Asp Ser Lys Asp Phe Ala Gln Ala Ile 
    450                 455                 460                 


Glu Gln Ala Asn Lys Ala Asp Val Ile Val Tyr Val Gly Gly Glu Glu 
465                 470                 475                 480 


Ala Val Leu Ser Gly Glu Ala His Ser Arg Ala Asp Thr Arg Leu Pro 
                485                 490                 495     


Gly Ala Gln Glu Gln Leu Ile Arg Glu Leu Lys Lys Thr Gly Lys Pro 
            500                 505                 510         


Leu Val Val Val Ile Met Ala Gly Arg Pro Ile Ala Met Asn Asp Val 
        515                 520                 525             


Ile Asp Glu Ile Asp Ala Leu Val Met Ala Trp His Pro Gly Thr Met 
    530                 535                 540                 


Gly Gly Pro Ala Ile Val Arg Leu Leu Gln Gly Asp Ala Glu Pro Val 
545                 550                 555                 560 


Gly Arg Leu Pro Val Thr Trp Pro Lys Val Thr Gly Gln Met Pro Ile 
                565                 570                 575     


Tyr Tyr Asn His Pro Ser Ser Gly Arg Pro Ala Ser Glu Gly Asn Phe 
            580                 585                 590         


Thr Gln Met Asp Asp Phe Pro Leu Glu Ala Phe Gln His Ser Thr Gly 
        595                 600                 605             


His Lys Asn His Tyr Ile Asp Ile Gly Phe Lys Pro Gln Phe Pro Phe 
    610                 615                 620                 


Gly Phe Gly Leu Ser Tyr Ser Thr Val Thr Tyr Ser Asp Ile Lys Val 
625                 630                 635                 640 


Asp Lys Lys Ala Ile Ala Ile Asn Gly Glu Leu Lys Val Ser Ala Lys 
                645                 650                 655     


Ile Thr Asn Ser Gly Lys Arg Pro Val Thr Glu Thr Ala Gln Leu Tyr 
            660                 665                 670         


Ile Arg Asp Leu Val Ala Ser Ser Val Arg Pro Val Arg Glu Leu Lys 
        675                 680                 685             


Ser Phe Gln Arg Val Thr Leu Lys Pro Lys Glu Ser Lys Arg Val Thr 
    690                 695                 700                 


Phe Thr Leu Arg Glu Ser Asp Leu Ala Phe Tyr Asn Gln Lys Leu Glu 
705                 710                 715                 720 


His Leu Val Glu Pro Gly Glu Phe His Val Trp Ile Ala Pro Asn Ala 
                725                 730                 735     


Glu Glu Gly Val Lys Ala Glu Phe Ala Val Arg 
            740                 745         


<210> 377
<211> 2250
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 377
atgacggtgg aagaaaaagt caatatggtc gtcggtgggg gcatgttcgt gccgggcatg     60

cagatgccgg gtgccgcggc ccaggcctcc gatgcccaga agagggtttt aggcgcggcg    120

ggaaccagtt ttgccatccc gcgcctgggc attcccggga tcgtggtatg cgacggaccc    180

gcgggcatac acgcgttcaa cgcggggaag agcagggtat actatgccac cgcatggccg    240

atcggtactt tgctggcctc gagctgggac accgccctgg tgcggaaagt aggtgcagtc    300

tatggcgccg aagcaaaaga atacggaatc gatgtcatcc tcggtcccgg catgaatatc    360

caccgcaacc cacttggcgc gcggaacttt gaatactatt ctgaagaccc ggtgcttacc    420

gggattatcg cagctgccat ggtcaacggg attcagtcga acggcgttgg cacttccccc    480

aaacatttct ttgccaacaa ccaggagacc aaccgcaaca ccgtaaacac gatcatgagc    540

gaaagggcca tgcgtgaaat ttatctccgg ggctggcagc tcatgctgaa gcattccagt    600

ccatggacga tcatgagttc ctataacctc gtgaacggac cgtacacctc tgaaaatccc    660

gaactgctca acaccatcct gcgcaaggaa tggggcttcc agggttttgt catgaccgac    720

tggttcggtg gtaaggatgc cgtagcccag caaaaagcag gcaacgacct gctcatgccc    780

ggaacgccgc aacaaaagaa agccgtcatg gacgccctga aaagtgggca gctggacgag    840

caagtgctgg accagaacgt tgctgccatc ctgaatatcg tcctgaaatc tccgaactac    900

gccaattaca aatatagcga taaccctccc ctgaaggaca atgcgcagat ttccaggcag    960

gcggccgcgg agagcatggt cttgttgaag aatgaaggca aggcgctgcc gatcgcgccg   1020

ggcatgagcc tggcggtatt cggaaacagc gccgtggaac tggttgcggg cggcaccgga   1080

agtggtgacg tcaacaagat gtactccgtt cccctgtttg acggcctttt taaagcgggc   1140

tttgcgctga atacggacct gtaccgggca tatacccaat actatgcggc cgaacaggca   1200

aagcgtccga agcgaaacgc cttcgaggac atgtttgccc ccaaggcgcc gatcgcggaa   1260

atgagcatca ccccggaaga gatccgcaag gcggcaacgc attcgtccat cgccatcgtg   1320

gctattggtc gcaatgcggg cgaaggcaaa gacagggtac tgaaagatga ttacttcctg   1380

acggaacagg aattggccct cgtgaaaaat gtgtcggcag ccttccacgc gcagaacaag   1440

aaagtggtgg tggtattgaa tatcggcggt gtcatcgatg tatctgcatg gcgcgacgcg   1500

gtggatgcca tcctgctggg ctggcagccg ggtctcgaag gcggcaatgc ctttgccgac   1560

ctgatctccg gcaaggtaaa cccatcgggc aaactggcta caaccttccc cctccggtat   1620

gaagatgata tcacgtcgaa aaatttcccc ggccgggaaa tccccgggac cgagaagccc   1680

ggcatgatgg gcatgaaaac cgttgatgcc gaagtcgtgt atgaagaagg tgtatatgtg   1740

ggctatcgtt attatagcac attcggtgtg cagactgcct atccctttgg ttatggcctc   1800

tcctataccc agttcagctt tggaaacctg aacattggtg tgccggatgc cgatggcaat   1860

gtacaggtaa gcgttaccgt tactaatacc ggaacggtag caggcaggga agtggcgcag   1920

ctatatgtca gcgcgcctgc cgggaaaatc gacaagcctg tgctggagct gaaggcattt   1980

gacaaaacga aactgctgca gccgggagaa tcccaggagc tccgtttcac acttgcaccc   2040

gcagacctgg catccttcca taccgcgcag agcgcatgga tcgcggaagc cggagcatac   2100

acggtgaaat tcggcaacgc gcagcaggct gtactgtccg gcagtttcaa gctccccaaa   2160

cagctgatcg tggaaaaagt gaacaaggtg attgtgccga aagtggcgat caatgaactg   2220

aaacctcccg cggggaaatc gaagaaataa                                    2250

<210> 378
<211> 749
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (43)...(260)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (326)...(604)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (229)...(246)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (361)...(367)
<223> Tubulin subunits alpha, beta, and gamma signature. Prosite id = PS00227

<220> 
<221> SITE
<222> (477)...(480)
<223> N-glycosylation site. Prosite id = PS00001

<400> 378
Met Thr Val Glu Glu Lys Val Asn Met Val Val Gly Gly Gly Met Phe 
1               5                   10                  15      


Val Pro Gly Met Gln Met Pro Gly Ala Ala Ala Gln Ala Ser Asp Ala 
            20                  25                  30          


Gln Lys Arg Val Leu Gly Ala Ala Gly Thr Ser Phe Ala Ile Pro Arg 
        35                  40                  45              


Leu Gly Ile Pro Gly Ile Val Val Cys Asp Gly Pro Ala Gly Ile His 
    50                  55                  60                  


Ala Phe Asn Ala Gly Lys Ser Arg Val Tyr Tyr Ala Thr Ala Trp Pro 
65                  70                  75                  80  


Ile Gly Thr Leu Leu Ala Ser Ser Trp Asp Thr Ala Leu Val Arg Lys 
                85                  90                  95      


Val Gly Ala Val Tyr Gly Ala Glu Ala Lys Glu Tyr Gly Ile Asp Val 
            100                 105                 110         


Ile Leu Gly Pro Gly Met Asn Ile His Arg Asn Pro Leu Gly Ala Arg 
        115                 120                 125             


Asn Phe Glu Tyr Tyr Ser Glu Asp Pro Val Leu Thr Gly Ile Ile Ala 
    130                 135                 140                 


Ala Ala Met Val Asn Gly Ile Gln Ser Asn Gly Val Gly Thr Ser Pro 
145                 150                 155                 160 


Lys His Phe Phe Ala Asn Asn Gln Glu Thr Asn Arg Asn Thr Val Asn 
                165                 170                 175     


Thr Ile Met Ser Glu Arg Ala Met Arg Glu Ile Tyr Leu Arg Gly Trp 
            180                 185                 190         


Gln Leu Met Leu Lys His Ser Ser Pro Trp Thr Ile Met Ser Ser Tyr 
        195                 200                 205             


Asn Leu Val Asn Gly Pro Tyr Thr Ser Glu Asn Pro Glu Leu Leu Asn 
    210                 215                 220                 


Thr Ile Leu Arg Lys Glu Trp Gly Phe Gln Gly Phe Val Met Thr Asp 
225                 230                 235                 240 


Trp Phe Gly Gly Lys Asp Ala Val Ala Gln Gln Lys Ala Gly Asn Asp 
                245                 250                 255     


Leu Leu Met Pro Gly Thr Pro Gln Gln Lys Lys Ala Val Met Asp Ala 
            260                 265                 270         


Leu Lys Ser Gly Gln Leu Asp Glu Gln Val Leu Asp Gln Asn Val Ala 
        275                 280                 285             


Ala Ile Leu Asn Ile Val Leu Lys Ser Pro Asn Tyr Ala Asn Tyr Lys 
    290                 295                 300                 


Tyr Ser Asp Asn Pro Pro Leu Lys Asp Asn Ala Gln Ile Ser Arg Gln 
305                 310                 315                 320 


Ala Ala Ala Glu Ser Met Val Leu Leu Lys Asn Glu Gly Lys Ala Leu 
                325                 330                 335     


Pro Ile Ala Pro Gly Met Ser Leu Ala Val Phe Gly Asn Ser Ala Val 
            340                 345                 350         


Glu Leu Val Ala Gly Gly Thr Gly Ser Gly Asp Val Asn Lys Met Tyr 
        355                 360                 365             


Ser Val Pro Leu Phe Asp Gly Leu Phe Lys Ala Gly Phe Ala Leu Asn 
    370                 375                 380                 


Thr Asp Leu Tyr Arg Ala Tyr Thr Gln Tyr Tyr Ala Ala Glu Gln Ala 
385                 390                 395                 400 


Lys Arg Pro Lys Arg Asn Ala Phe Glu Asp Met Phe Ala Pro Lys Ala 
                405                 410                 415     


Pro Ile Ala Glu Met Ser Ile Thr Pro Glu Glu Ile Arg Lys Ala Ala 
            420                 425                 430         


Thr His Ser Ser Ile Ala Ile Val Ala Ile Gly Arg Asn Ala Gly Glu 
        435                 440                 445             


Gly Lys Asp Arg Val Leu Lys Asp Asp Tyr Phe Leu Thr Glu Gln Glu 
    450                 455                 460                 


Leu Ala Leu Val Lys Asn Val Ser Ala Ala Phe His Ala Gln Asn Lys 
465                 470                 475                 480 


Lys Val Val Val Val Leu Asn Ile Gly Gly Val Ile Asp Val Ser Ala 
                485                 490                 495     


Trp Arg Asp Ala Val Asp Ala Ile Leu Leu Gly Trp Gln Pro Gly Leu 
            500                 505                 510         


Glu Gly Gly Asn Ala Phe Ala Asp Leu Ile Ser Gly Lys Val Asn Pro 
        515                 520                 525             


Ser Gly Lys Leu Ala Thr Thr Phe Pro Leu Arg Tyr Glu Asp Asp Ile 
    530                 535                 540                 


Thr Ser Lys Asn Phe Pro Gly Arg Glu Ile Pro Gly Thr Glu Lys Pro 
545                 550                 555                 560 


Gly Met Met Gly Met Lys Thr Val Asp Ala Glu Val Val Tyr Glu Glu 
                565                 570                 575     


Gly Val Tyr Val Gly Tyr Arg Tyr Tyr Ser Thr Phe Gly Val Gln Thr 
            580                 585                 590         


Ala Tyr Pro Phe Gly Tyr Gly Leu Ser Tyr Thr Gln Phe Ser Phe Gly 
        595                 600                 605             


Asn Leu Asn Ile Gly Val Pro Asp Ala Asp Gly Asn Val Gln Val Ser 
    610                 615                 620                 


Val Thr Val Thr Asn Thr Gly Thr Val Ala Gly Arg Glu Val Ala Gln 
625                 630                 635                 640 


Leu Tyr Val Ser Ala Pro Ala Gly Lys Ile Asp Lys Pro Val Leu Glu 
                645                 650                 655     


Leu Lys Ala Phe Asp Lys Thr Lys Leu Leu Gln Pro Gly Glu Ser Gln 
            660                 665                 670         


Glu Leu Arg Phe Thr Leu Ala Pro Ala Asp Leu Ala Ser Phe His Thr 
        675                 680                 685             


Ala Gln Ser Ala Trp Ile Ala Glu Ala Gly Ala Tyr Thr Val Lys Phe 
    690                 695                 700                 


Gly Asn Ala Gln Gln Ala Val Leu Ser Gly Ser Phe Lys Leu Pro Lys 
705                 710                 715                 720 


Gln Leu Ile Val Glu Lys Val Asn Lys Val Ile Val Pro Lys Val Ala 
                725                 730                 735     


Ile Asn Glu Leu Lys Pro Pro Ala Gly Lys Ser Lys Lys 
            740                 745                 


<210> 379
<211> 2367
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 379
atgaaaaaag catttatgat tttaggcgca gcacttgtca ccttgggtgc aagcgcacaa     60

cccaaactca ccaaggacaa cattgatgaa gtgctcaagg ctatgaccct cgaggagaag    120

gccgagatcg tagtaggagg aggctggggc agtatgattg ccgcttccgt tcccacaggt    180

tccgcatccc tcgtctctgg cgccgcaggt accaccaatc ccattgcccg tctcggcatc    240

cctgcgaccg tcctcgctga cggccctgcc ggactgcgca tcaatcctac ccgtcccgat    300

acagaccaga cattctactg caccggtttc cccgtaggta ctgtccttgc ttgttcctgg    360

aacaccgctc tcgtggagga actcaccacg gcaatgggca acgaagtgct cgaatatggc    420

gctgacgtgc tcctcgctcc tggtatgaac atccaccgca accctctctg cggccgtaac    480

ttcgagtatt tctcagagga tcctatcctc tccggcaaga tgggtgctgc atatatcaag    540

ggtattcaga gcaatggtgt cggtgtatct gcgaagcatt tcgcattcaa caaccaggaa    600

atcaaccgta gcgacaatat ggccaatgtc tctaagagag cagctcgcga gatttatctc    660

aagaactttg agattgctgt ccgcgaggcc cagccttgga ccattatgtc atcctacaat    720

caggtgaatg gcgactacac ccagcagagc cacgaccttc tcaccagcat cctccgcgag    780

gactggggtt atgaaggcat cgtgatgacc gactggggta ccaaggacgg cacagtcaag    840

gctgtttacg caggcaatga cctgatggag ccgggtgcag acatcgagaa gagccgcatc    900

atcgcagctg tcaaggacgg cagcctcgat gtcgcagacc tcgaccgcaa tgtccgcaga    960

atgctcgagt acatcgtcaa gactcctcgc ttcaagggtt acaaattctc caacaagcct   1020

gacctcaacg ctcacgccgc actcgtgcgc aagggcgcag ctgaaggtat ggtactcctc   1080

aagaacgaag acgctctccc tctggccaag gatatcaaga acgtggccct gttcggtctc   1140

aacgcctata agtccatcgc tggtggtaca ggctcaggca atgtcaacaa gccttacatc   1200

cgcaacgtcg atgagggtct ggcagccgca ggtctcaagg ttgatgagaa actcgccaag   1260

ttctacaagg actacagaac ttacaatgag agtcttaacg acatcaatgg aagcggtggc   1320

gattcattcg gcatcctcct cggagaggcc gttctcgcag aggtcggcat cgccaagagc   1380

gcagtcgagg cttctgagaa gcgcaacgat gcagctatcg tcgtcatcgg ccgcaatgct   1440

ggtgaaggtg atgaccgtag ggttcccaat gatttcgagc tcacctctga ggagcgtgaa   1500

cttctcgcca atgtcgagaa cgtttaccac aaggctggca agaaagtcgt cgtggtcctc   1560

aatatcggcg gtgtcatcga gactgcatcc tggaagagcc ttcccgatgc cattctcctc   1620

gcttggactc ccggacagga agtcggtaac agcgtggcag atgtgctcgt aggcaagtcc   1680

aatccttctg gaaagctcgc tatgacattc cctatgaggt atcttgatca tccttcatca   1740

ttcaatttcc cttacaacgg acagcccgca aagggcaatg caggtgctat tgatattgcc   1800

gcccttatgg gactcagcgt caagcctcag cccgtcaagg atattgacta tacagactac   1860

aatgaaggta tctgggtggg atatcgttac tttgacacag caggcaagga agtctcctat   1920

cctttcggtt atggcctttc atataccaca ttcgcttata gcgcacccgt ggtcaaggct   1980

accaaggacg gtggcatcac agccagcatc actgtcaaga ataccggttc tgtcgcaggt   2040

aaggaagctg ttcagctcta tgtctctgca cctgcaggcg gtctcgtgaa gcctgccaag   2100

gagctcaagg cattcgccaa gacacgcgaa ctccagcctg gtgagagcca aaccctcact   2160

atggaagtat gcgcctacac tctcgcctcc ttcaatgaag atgcttcaca gtgggagact   2220

cctgccggca catacaccgt gaagttcggt gcttccgtag cagacatccg cgcttctgta   2280

cccgttcagc tcaagaaagc tcagagctgg aaggtcaacg atgtcctcgc tcccgtcaag   2340

gaaatcgctg aaatctcagt gaaataa                                       2367

<210> 380
<211> 788
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(19)

<220> 
<221> DOMAIN
<222> (72)...(291)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (357)...(650)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (205)...(208)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (212)...(215)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (435)...(438)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (442)...(445)
<223> N-glycosylation site. Prosite id = PS00001

<400> 380
Met Lys Lys Ala Phe Met Ile Leu Gly Ala Ala Leu Val Thr Leu Gly 
1               5                   10                  15      


Ala Ser Ala Gln Pro Lys Leu Thr Lys Asp Asn Ile Asp Glu Val Leu 
            20                  25                  30          


Lys Ala Met Thr Leu Glu Glu Lys Ala Glu Ile Val Val Gly Gly Gly 
        35                  40                  45              


Trp Gly Ser Met Ile Ala Ala Ser Val Pro Thr Gly Ser Ala Ser Leu 
    50                  55                  60                  


Val Ser Gly Ala Ala Gly Thr Thr Asn Pro Ile Ala Arg Leu Gly Ile 
65                  70                  75                  80  


Pro Ala Thr Val Leu Ala Asp Gly Pro Ala Gly Leu Arg Ile Asn Pro 
                85                  90                  95      


Thr Arg Pro Asp Thr Asp Gln Thr Phe Tyr Cys Thr Gly Phe Pro Val 
            100                 105                 110         


Gly Thr Val Leu Ala Cys Ser Trp Asn Thr Ala Leu Val Glu Glu Leu 
        115                 120                 125             


Thr Thr Ala Met Gly Asn Glu Val Leu Glu Tyr Gly Ala Asp Val Leu 
    130                 135                 140                 


Leu Ala Pro Gly Met Asn Ile His Arg Asn Pro Leu Cys Gly Arg Asn 
145                 150                 155                 160 


Phe Glu Tyr Phe Ser Glu Asp Pro Ile Leu Ser Gly Lys Met Gly Ala 
                165                 170                 175     


Ala Tyr Ile Lys Gly Ile Gln Ser Asn Gly Val Gly Val Ser Ala Lys 
            180                 185                 190         


His Phe Ala Phe Asn Asn Gln Glu Ile Asn Arg Ser Asp Asn Met Ala 
        195                 200                 205             


Asn Val Ser Lys Arg Ala Ala Arg Glu Ile Tyr Leu Lys Asn Phe Glu 
    210                 215                 220                 


Ile Ala Val Arg Glu Ala Gln Pro Trp Thr Ile Met Ser Ser Tyr Asn 
225                 230                 235                 240 


Gln Val Asn Gly Asp Tyr Thr Gln Gln Ser His Asp Leu Leu Thr Ser 
                245                 250                 255     


Ile Leu Arg Glu Asp Trp Gly Tyr Glu Gly Ile Val Met Thr Asp Trp 
            260                 265                 270         


Gly Thr Lys Asp Gly Thr Val Lys Ala Val Tyr Ala Gly Asn Asp Leu 
        275                 280                 285             


Met Glu Pro Gly Ala Asp Ile Glu Lys Ser Arg Ile Ile Ala Ala Val 
    290                 295                 300                 


Lys Asp Gly Ser Leu Asp Val Ala Asp Leu Asp Arg Asn Val Arg Arg 
305                 310                 315                 320 


Met Leu Glu Tyr Ile Val Lys Thr Pro Arg Phe Lys Gly Tyr Lys Phe 
                325                 330                 335     


Ser Asn Lys Pro Asp Leu Asn Ala His Ala Ala Leu Val Arg Lys Gly 
            340                 345                 350         


Ala Ala Glu Gly Met Val Leu Leu Lys Asn Glu Asp Ala Leu Pro Leu 
        355                 360                 365             


Ala Lys Asp Ile Lys Asn Val Ala Leu Phe Gly Leu Asn Ala Tyr Lys 
    370                 375                 380                 


Ser Ile Ala Gly Gly Thr Gly Ser Gly Asn Val Asn Lys Pro Tyr Ile 
385                 390                 395                 400 


Arg Asn Val Asp Glu Gly Leu Ala Ala Ala Gly Leu Lys Val Asp Glu 
                405                 410                 415     


Lys Leu Ala Lys Phe Tyr Lys Asp Tyr Arg Thr Tyr Asn Glu Ser Leu 
            420                 425                 430         


Asn Asp Ile Asn Gly Ser Gly Gly Asp Ser Phe Gly Ile Leu Leu Gly 
        435                 440                 445             


Glu Ala Val Leu Ala Glu Val Gly Ile Ala Lys Ser Ala Val Glu Ala 
    450                 455                 460                 


Ser Glu Lys Arg Asn Asp Ala Ala Ile Val Val Ile Gly Arg Asn Ala 
465                 470                 475                 480 


Gly Glu Gly Asp Asp Arg Arg Val Pro Asn Asp Phe Glu Leu Thr Ser 
                485                 490                 495     


Glu Glu Arg Glu Leu Leu Ala Asn Val Glu Asn Val Tyr His Lys Ala 
            500                 505                 510         


Gly Lys Lys Val Val Val Val Leu Asn Ile Gly Gly Val Ile Glu Thr 
        515                 520                 525             


Ala Ser Trp Lys Ser Leu Pro Asp Ala Ile Leu Leu Ala Trp Thr Pro 
    530                 535                 540                 


Gly Gln Glu Val Gly Asn Ser Val Ala Asp Val Leu Val Gly Lys Ser 
545                 550                 555                 560 


Asn Pro Ser Gly Lys Leu Ala Met Thr Phe Pro Met Arg Tyr Leu Asp 
                565                 570                 575     


His Pro Ser Ser Phe Asn Phe Pro Tyr Asn Gly Gln Pro Ala Lys Gly 
            580                 585                 590         


Asn Ala Gly Ala Ile Asp Ile Ala Ala Leu Met Gly Leu Ser Val Lys 
        595                 600                 605             


Pro Gln Pro Val Lys Asp Ile Asp Tyr Thr Asp Tyr Asn Glu Gly Ile 
    610                 615                 620                 


Trp Val Gly Tyr Arg Tyr Phe Asp Thr Ala Gly Lys Glu Val Ser Tyr 
625                 630                 635                 640 


Pro Phe Gly Tyr Gly Leu Ser Tyr Thr Thr Phe Ala Tyr Ser Ala Pro 
                645                 650                 655     


Val Val Lys Ala Thr Lys Asp Gly Gly Ile Thr Ala Ser Ile Thr Val 
            660                 665                 670         


Lys Asn Thr Gly Ser Val Ala Gly Lys Glu Ala Val Gln Leu Tyr Val 
        675                 680                 685             


Ser Ala Pro Ala Gly Gly Leu Val Lys Pro Ala Lys Glu Leu Lys Ala 
    690                 695                 700                 


Phe Ala Lys Thr Arg Glu Leu Gln Pro Gly Glu Ser Gln Thr Leu Thr 
705                 710                 715                 720 


Met Glu Val Cys Ala Tyr Thr Leu Ala Ser Phe Asn Glu Asp Ala Ser 
                725                 730                 735     


Gln Trp Glu Thr Pro Ala Gly Thr Tyr Thr Val Lys Phe Gly Ala Ser 
            740                 745                 750         


Val Ala Asp Ile Arg Ala Ser Val Pro Val Gln Leu Lys Lys Ala Gln 
        755                 760                 765             


Ser Trp Lys Val Asn Asp Val Leu Ala Pro Val Lys Glu Ile Ala Glu 
    770                 775                 780                 


Ile Ser Val Lys 
785             


<210> 381
<211> 2172
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 381
atgcggcccg atcccaagat ggaccagttc atcagcgact tgatcgcgaa gatgacgctg     60

gacgagaaga tcggccagct caccctgttg acgaccgact gggagtcgac cgggccgacg    120

ctgcgtgccg gctataagga agatgtcgcc gcggggcgcg tgggcgcgat gttcaatgcg    180

cattcggtga agttcacgcg tgagcttcag cgcatggcgg tggaggagac gcggctcaag    240

attccgctgc tgttcggcta tgacgtgatc cacggccacc gcacgatgtt cccgatctcg    300

ctcggcgagg cggcttcatg ggaccttgcg gcgatcgaga aggccgcgcg catttcggcg    360

atcgaaggcg cggccgaggg gctgcactgg acctttgcgc cgatggtcga cattgcgcgc    420

gatccgcgct ggggccgcat gtccgaaggc gcgggcgagg acgtctatct cggcagccgc    480

atcgccgagg cacgcgtgcg cggctatcag ggcaagcgca tcggcgatac cgacagcctg    540

atcgccaccg tcaagcattt cgccgcctat ggcgcggcgc aggccggccg tgactatcac    600

accaccgaca tgtcggatcg cgagctgcga gacacccacc tgcccccgtt caaggctgcc    660

atcgacgctg gcgcggcgac ggtgatgacc tcgttcaacg agctcaacgg cattccggcg    720

agcggcaaca gctatctgct gaccgacatc ctgcgcaagg aatggggctt caagggcttt    780

gtcgtaaccg attacacgtc gatcaacgaa atggtcccgc acggctattc caaggacgag    840

gctcaggccg gcgaacaggc gatcaatgcc ggggtcgaca tggacatgca gggcgcggtg    900

ttcatgaacc acctcgccaa atcggttgcc gaagggcggg tgccgatggc gcggatcgac    960

gccgccgtgc gctcggtgct cgagatgaag taccggctcg gcctgttcgc cgacccctat   1020

cgcttcagcg acgccgcgcg cgaaaaggcg cgtgtcggca ccgccgagca caaggccgcc   1080

gcgcgcgacg tggcgcgcaa gtcgatggtg ctgctcaaga acgacggttc gcttccgctc   1140

gtcgcatcgg cgcgcaaaat cgcggtgatc ggcccgctcg ccgacagcaa gcccgacatg   1200

atcggcagtt gggcggcgca gggcgatcgc cagggatcgg tcaccgtgct cgaggggatc   1260

cgcgcgcgcg ccaagggcgc gacggtcagc tatgccaagg gcgccagcta ccagttcgag   1320

gatgccggca agaccgacgg ctttgccgag gcgctggccg ccgcgcgcga cgccgatgtc   1380

atcgttgcgg cgatgggcga gcattacgac cataccggcg aagcggctag ccgcacctcg   1440

ctcgacctgc cgggcaatca gcaggcgctg ctcgaagcgc tcaaggcgac gggcaagccg   1500

gtggtgctcg tgctgctttc cggccgcccc aattcaatcg gctgggccgc ggaaaacgtg   1560

aactccatcc tgcatgcctg gtatcccggc acgatgggcg gccatgccgt ggcggacgtg   1620

ctgttcggcg actacaaccc ctcgggcaag ctgccggtca ccttcccgcg caatgtcggc   1680

caggtgccga tctattacag catgaagaac accggccggc cctatacggc cgacaaacag   1740

gggcagaaat atctctcgcg ctacctcgat tcgcccaaca gcccgcttta tccgttcggc   1800

tatgggctga gctacacgag cttcggctac tcgccggtca ccctcgacaa gacgcgcatc   1860

cgccccggcg aaacgctgac cgccacggtt acggtcacca acaccggcaa gcgcgcaggt   1920

gaggaggtcg tccagctcta tgtccgcgac ctggtcggtt cggtgacgcg cccggtcaag   1980

gagctcaagg ggttccgcaa ggtgatgctc aagccgggtg aggcgcgccg gatctcgttc   2040

agcctgacgg accgcgacct tgcattccac cgtgccgaca tgagctatgg cgcggagccg   2100

ggcgagttcc ggctgtggat cgggccgtcc tcggccgaag gcagcgagac cggcttcacg   2160

ctgaccgaat ag                                                       2172

<210> 382
<211> 723
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (73)...(297)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (369)...(607)
<223> Glycosyl hydrolase family 3 C terminal domain

<400> 382
Met Arg Pro Asp Pro Lys Met Asp Gln Phe Ile Ser Asp Leu Ile Ala 
1               5                   10                  15      


Lys Met Thr Leu Asp Glu Lys Ile Gly Gln Leu Thr Leu Leu Thr Thr 
            20                  25                  30          


Asp Trp Glu Ser Thr Gly Pro Thr Leu Arg Ala Gly Tyr Lys Glu Asp 
        35                  40                  45              


Val Ala Ala Gly Arg Val Gly Ala Met Phe Asn Ala His Ser Val Lys 
    50                  55                  60                  


Phe Thr Arg Glu Leu Gln Arg Met Ala Val Glu Glu Thr Arg Leu Lys 
65                  70                  75                  80  


Ile Pro Leu Leu Phe Gly Tyr Asp Val Ile His Gly His Arg Thr Met 
                85                  90                  95      


Phe Pro Ile Ser Leu Gly Glu Ala Ala Ser Trp Asp Leu Ala Ala Ile 
            100                 105                 110         


Glu Lys Ala Ala Arg Ile Ser Ala Ile Glu Gly Ala Ala Glu Gly Leu 
        115                 120                 125             


His Trp Thr Phe Ala Pro Met Val Asp Ile Ala Arg Asp Pro Arg Trp 
    130                 135                 140                 


Gly Arg Met Ser Glu Gly Ala Gly Glu Asp Val Tyr Leu Gly Ser Arg 
145                 150                 155                 160 


Ile Ala Glu Ala Arg Val Arg Gly Tyr Gln Gly Lys Arg Ile Gly Asp 
                165                 170                 175     


Thr Asp Ser Leu Ile Ala Thr Val Lys His Phe Ala Ala Tyr Gly Ala 
            180                 185                 190         


Ala Gln Ala Gly Arg Asp Tyr His Thr Thr Asp Met Ser Asp Arg Glu 
        195                 200                 205             


Leu Arg Asp Thr His Leu Pro Pro Phe Lys Ala Ala Ile Asp Ala Gly 
    210                 215                 220                 


Ala Ala Thr Val Met Thr Ser Phe Asn Glu Leu Asn Gly Ile Pro Ala 
225                 230                 235                 240 


Ser Gly Asn Ser Tyr Leu Leu Thr Asp Ile Leu Arg Lys Glu Trp Gly 
                245                 250                 255     


Phe Lys Gly Phe Val Val Thr Asp Tyr Thr Ser Ile Asn Glu Met Val 
            260                 265                 270         


Pro His Gly Tyr Ser Lys Asp Glu Ala Gln Ala Gly Glu Gln Ala Ile 
        275                 280                 285             


Asn Ala Gly Val Asp Met Asp Met Gln Gly Ala Val Phe Met Asn His 
    290                 295                 300                 


Leu Ala Lys Ser Val Ala Glu Gly Arg Val Pro Met Ala Arg Ile Asp 
305                 310                 315                 320 


Ala Ala Val Arg Ser Val Leu Glu Met Lys Tyr Arg Leu Gly Leu Phe 
                325                 330                 335     


Ala Asp Pro Tyr Arg Phe Ser Asp Ala Ala Arg Glu Lys Ala Arg Val 
            340                 345                 350         


Gly Thr Ala Glu His Lys Ala Ala Ala Arg Asp Val Ala Arg Lys Ser 
        355                 360                 365             


Met Val Leu Leu Lys Asn Asp Gly Ser Leu Pro Leu Val Ala Ser Ala 
    370                 375                 380                 


Arg Lys Ile Ala Val Ile Gly Pro Leu Ala Asp Ser Lys Pro Asp Met 
385                 390                 395                 400 


Ile Gly Ser Trp Ala Ala Gln Gly Asp Arg Gln Gly Ser Val Thr Val 
                405                 410                 415     


Leu Glu Gly Ile Arg Ala Arg Ala Lys Gly Ala Thr Val Ser Tyr Ala 
            420                 425                 430         


Lys Gly Ala Ser Tyr Gln Phe Glu Asp Ala Gly Lys Thr Asp Gly Phe 
        435                 440                 445             


Ala Glu Ala Leu Ala Ala Ala Arg Asp Ala Asp Val Ile Val Ala Ala 
    450                 455                 460                 


Met Gly Glu His Tyr Asp His Thr Gly Glu Ala Ala Ser Arg Thr Ser 
465                 470                 475                 480 


Leu Asp Leu Pro Gly Asn Gln Gln Ala Leu Leu Glu Ala Leu Lys Ala 
                485                 490                 495     


Thr Gly Lys Pro Val Val Leu Val Leu Leu Ser Gly Arg Pro Asn Ser 
            500                 505                 510         


Ile Gly Trp Ala Ala Glu Asn Val Asn Ser Ile Leu His Ala Trp Tyr 
        515                 520                 525             


Pro Gly Thr Met Gly Gly His Ala Val Ala Asp Val Leu Phe Gly Asp 
    530                 535                 540                 


Tyr Asn Pro Ser Gly Lys Leu Pro Val Thr Phe Pro Arg Asn Val Gly 
545                 550                 555                 560 


Gln Val Pro Ile Tyr Tyr Ser Met Lys Asn Thr Gly Arg Pro Tyr Thr 
                565                 570                 575     


Ala Asp Lys Gln Gly Gln Lys Tyr Leu Ser Arg Tyr Leu Asp Ser Pro 
            580                 585                 590         


Asn Ser Pro Leu Tyr Pro Phe Gly Tyr Gly Leu Ser Tyr Thr Ser Phe 
        595                 600                 605             


Gly Tyr Ser Pro Val Thr Leu Asp Lys Thr Arg Ile Arg Pro Gly Glu 
    610                 615                 620                 


Thr Leu Thr Ala Thr Val Thr Val Thr Asn Thr Gly Lys Arg Ala Gly 
625                 630                 635                 640 


Glu Glu Val Val Gln Leu Tyr Val Arg Asp Leu Val Gly Ser Val Thr 
                645                 650                 655     


Arg Pro Val Lys Glu Leu Lys Gly Phe Arg Lys Val Met Leu Lys Pro 
            660                 665                 670         


Gly Glu Ala Arg Arg Ile Ser Phe Ser Leu Thr Asp Arg Asp Leu Ala 
        675                 680                 685             


Phe His Arg Ala Asp Met Ser Tyr Gly Ala Glu Pro Gly Glu Phe Arg 
    690                 695                 700                 


Leu Trp Ile Gly Pro Ser Ser Ala Glu Gly Ser Glu Thr Gly Phe Thr 
705                 710                 715                 720 


Leu Thr Glu 
            


<210> 383
<211> 1404
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 383
atgaatcatt ctctttcatt tccgccatcc tttgtatggg gcgcagcaac cgcaagctac     60

caactggaag gatcaaccca gggcgtggac ggctgcgccg agtccgtctg ggatatgcac    120

tgccgcagat ccggcgcgat caaggacggc tcgaacggat tcgtcgcctg cgatcactac    180

caccgctatc gcgaggatgt ggcgctcatg aacgagcttg gcttgaatgc ctatcgattc    240

tcaatcatgt ggccccgcgt catgcccgaa ggcaccggcg cggtgaacga gaagggcatg    300

gacttctaca atcggttggt tgatgaactg ctcgccgccg gcatcactcc ttgggttact    360

ttgttccact gggactttcc cctagccttg ttccaacgcg gtggctggct gaatgcggat    420

tccccgcaat ggtttgagga ttacacccgg gaagtggtta aacgcttgtc ggatcgcgtg    480

catcactggc tgacgctcaa cgaaccggcc tgcttcattg agtttggcca ccgtaccggc    540

atgcatgcac ccggcttgca actggcggac aaggaagcct gccgggtctg gcaccatgcc    600

atgctggccc acggtcgcgc cgttcgcgct atccgcgagg aatccgtgca tccatcaccc    660

caggttggct acgccccggt cttccgtacc accatcccgg acactgaaga tcctgccgac    720

atcgaagcgg cccgggcctc gatgtttgcc catcaggccg gcaacctgtt cgatacgcgg    780

tggaacctcg ccccctgctt tcggggcgcg tatccggaga tcatgatgca gtattggggc    840

gatgccgcgc cgcgcatcca ggacggcgac atggagttga tccgtcagga actcgatttt    900

ctcggcttga atatttacca gtccgagcgc attcgagccg gtgcggatgg cgcacccgag    960

gtggtgccat accctgcgga ttatccgcgc aaccagctcg gttggcccat cacgccggag   1020

gccctgcgct gggcgaccct ctttctcttt gaggagtacg ggaaacccct gatcatcaca   1080

gaaaacggaa tcaccctcga cgacaagccc aatgcagacg gcgaggtgaa tgatgtccag   1140

cggatcgctt ttttgaacga ctatcttagc ggtctccagc gcagcgtgga cgacggcatc   1200

cctgtactgg gctatttcca ctggtcgctg tgcgacaact ttgagtgggc agaaggctat   1260

gtccctcgct tcggcctgat ccatgtggac tatgccagtc aacgcagaac catcaaggcc   1320

tcaggacggt tttaccgcga catcattcgg ggccagacag ccacgccctg catcgcccaa   1380

tccagtcagc cggaaacaac ctaa                                          1404

<210> 384
<211> 467
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (3)...(454)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (2)...(5)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (11)...(25)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<400> 384
Met Asn His Ser Leu Ser Phe Pro Pro Ser Phe Val Trp Gly Ala Ala 
1               5                   10                  15      


Thr Ala Ser Tyr Gln Leu Glu Gly Ser Thr Gln Gly Val Asp Gly Cys 
            20                  25                  30          


Ala Glu Ser Val Trp Asp Met His Cys Arg Arg Ser Gly Ala Ile Lys 
        35                  40                  45              


Asp Gly Ser Asn Gly Phe Val Ala Cys Asp His Tyr His Arg Tyr Arg 
    50                  55                  60                  


Glu Asp Val Ala Leu Met Asn Glu Leu Gly Leu Asn Ala Tyr Arg Phe 
65                  70                  75                  80  


Ser Ile Met Trp Pro Arg Val Met Pro Glu Gly Thr Gly Ala Val Asn 
                85                  90                  95      


Glu Lys Gly Met Asp Phe Tyr Asn Arg Leu Val Asp Glu Leu Leu Ala 
            100                 105                 110         


Ala Gly Ile Thr Pro Trp Val Thr Leu Phe His Trp Asp Phe Pro Leu 
        115                 120                 125             


Ala Leu Phe Gln Arg Gly Gly Trp Leu Asn Ala Asp Ser Pro Gln Trp 
    130                 135                 140                 


Phe Glu Asp Tyr Thr Arg Glu Val Val Lys Arg Leu Ser Asp Arg Val 
145                 150                 155                 160 


His His Trp Leu Thr Leu Asn Glu Pro Ala Cys Phe Ile Glu Phe Gly 
                165                 170                 175     


His Arg Thr Gly Met His Ala Pro Gly Leu Gln Leu Ala Asp Lys Glu 
            180                 185                 190         


Ala Cys Arg Val Trp His His Ala Met Leu Ala His Gly Arg Ala Val 
        195                 200                 205             


Arg Ala Ile Arg Glu Glu Ser Val His Pro Ser Pro Gln Val Gly Tyr 
    210                 215                 220                 


Ala Pro Val Phe Arg Thr Thr Ile Pro Asp Thr Glu Asp Pro Ala Asp 
225                 230                 235                 240 


Ile Glu Ala Ala Arg Ala Ser Met Phe Ala His Gln Ala Gly Asn Leu 
                245                 250                 255     


Phe Asp Thr Arg Trp Asn Leu Ala Pro Cys Phe Arg Gly Ala Tyr Pro 
            260                 265                 270         


Glu Ile Met Met Gln Tyr Trp Gly Asp Ala Ala Pro Arg Ile Gln Asp 
        275                 280                 285             


Gly Asp Met Glu Leu Ile Arg Gln Glu Leu Asp Phe Leu Gly Leu Asn 
    290                 295                 300                 


Ile Tyr Gln Ser Glu Arg Ile Arg Ala Gly Ala Asp Gly Ala Pro Glu 
305                 310                 315                 320 


Val Val Pro Tyr Pro Ala Asp Tyr Pro Arg Asn Gln Leu Gly Trp Pro 
                325                 330                 335     


Ile Thr Pro Glu Ala Leu Arg Trp Ala Thr Leu Phe Leu Phe Glu Glu 
            340                 345                 350         


Tyr Gly Lys Pro Leu Ile Ile Thr Glu Asn Gly Ile Thr Leu Asp Asp 
        355                 360                 365             


Lys Pro Asn Ala Asp Gly Glu Val Asn Asp Val Gln Arg Ile Ala Phe 
    370                 375                 380                 


Leu Asn Asp Tyr Leu Ser Gly Leu Gln Arg Ser Val Asp Asp Gly Ile 
385                 390                 395                 400 


Pro Val Leu Gly Tyr Phe His Trp Ser Leu Cys Asp Asn Phe Glu Trp 
                405                 410                 415     


Ala Glu Gly Tyr Val Pro Arg Phe Gly Leu Ile His Val Asp Tyr Ala 
            420                 425                 430         


Ser Gln Arg Arg Thr Ile Lys Ala Ser Gly Arg Phe Tyr Arg Asp Ile 
        435                 440                 445             


Ile Arg Gly Gln Thr Ala Thr Pro Cys Ile Ala Gln Ser Ser Gln Pro 
    450                 455                 460                 


Glu Thr Thr 
465         


<210> 385
<211> 2457
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 385
atgtcagaga caattgaccg gttgctggcg gagatgacgc tggaggaaaa gatcaagctg     60

ctgggcggtt ggcgcgcggt accaggtacg ccgcgggacg cagacgtata tggcgtaccc    120

cgtctaggga tagccccact cagattggcg gacggtccga ccggggtcca ttggtggacc    180

aaggcatcga cctgttaccc ggccctgatt aatttggcgg cttcttttga tgaaaagatt    240

gcctacacat tcggacgcgc actgggaacc gattgccgtg ctttcggggt acatgtgatc    300

ctcgctcccg gtgtaaacct gtaccgctct ccattatgtg gacggaactt cgaatatctc    360

ggagaggacc cggaattggc cggaggactg gccgcagcct acatccgtgg tttgcaatca    420

aaaggagtgg ctgctaccgt caaacacctg gcagccaata atcaggaata cgatcgtcac    480

aatatcagct cggatattga cgagcgcacc ttgcgcgaag tctacctacg cccctttgaa    540

agggctgtcc atgaagggca aacggcagca gtcatgacgg catataatcc ggtcaatgga    600

cagcatgcct cggaaaatgc ctggctgctg gatggcgtgt tgcgccacga ttggggcttc    660

aacgggtgga ttatgtcgga ttggacatcc gtctattcca cagtgcagac tttgaatagc    720

gggttggatc tggaaatgcc gtttgccttg cacctgacgg aagaaaaaat caaagctgct    780

ttggcgaccg gggtcacgac tgtcacccgc atcgatcgca tggtactgca ccgattggca    840

ctgatggagc gcttcgggtg gttggatccc gggcatgcac agcaggataa gtcattgccc    900

gaccgtaacc ccgaaaccga agccgtggct ttggaggtgg cacgccgcgg cattgtactg    960

cttaagaatc gaaaccatgt cttgccctgt ccgcctgcca gcctgcgccg catcgtcgtg   1020

ctggggcatc acgcagccca acccattctg tccgggggcg gttcagcctt ctgcccgccg   1080

cacgaatcgg taaccctgct ggaggcactg cgccagacat acgcgcagga tgtgcaggtg   1140

gattatttcg agatggtaga tccgtggtgc gaacgggctg ctttggaagc gagtagcttt   1200

tatacggctg atggcgaacc gggcttgcat gcctgctact ttgccaataa tcgcctggaa   1260

ggctcccctg ccctcgagcg ggtggatacc aaactgcaat tccgttggat ttctgaaaaa   1320

ccggatccaa tgctggaaga tgattatttt tcggcacgct ggagcggctt tttcgatatt   1380

gatgctgccg gtatctacga ctggtatctg aaaagcgaag acgggacatt tcaggtatgg   1440

gtggatgatc atcccctgac ggatcccttt acaggcacgc ggcgtgttcc tcttgatctg   1500

tctgcgggac aacatcggat aactgtcacc tattgccagt tgcgcggcgg atgggccacc   1560

tgccaatgtg gctttgaacg cgcagctaat gcgcttcgtg catatgagga cggtctggct   1620

gcagcagccg aagcagatct tgtggtggtt acaaccggtt ttgtggcacc cacggaacgg   1680

gaatccagtg accgcacatt tgaattggat cagcgcctga atcaaatggt cctcgatgta   1740

gccagacaga atgcgcgcac ggttgttgtt ttgtatgcag gtggcgccgt ggatgtggca   1800

ccgtggatcg atcaggttgc ggctttgttg catatctggt atcccggtca aaacggtacg   1860

ctggcagcag ccgagataat tgccggatac actaacccat cgggcaaatt acctttcaca   1920

tgggaacatc aactctctga ccgcggttcc tttagcgcat atcatgatga cgatggggat   1980

ggccgcgtgg cttacacaga tggtgttttt accggatatc gatggttcga ccatcaccgg   2040

ataaaagcac gctatccatt tggattcggc ttgtcataca ctacatttgc ctatgaaaat   2100

cttgcctgta ctcctgctac gttgcaggag aacgaaacgg tccaagtaac gttcgacgtg   2160

atcaacaccg gcagcgtggc cggacgacat gctgcgctag tctttgtggc tccgcctgcg   2220

ggtacggttc cgagacccgt aaaggaatac aagcaatcgt gcgtggttaa tctagcacct   2280

ggtgaacggc agtccgtcac tgtcaccttg ccatggcgcg ccttccagtt ttggcatcca   2340

gatacacgat gctggagcat caccccgggt gcacatgcaa ttcaagtccg accggacgcc   2400

gatagcacgg ggctgcaagc tgaagtcaca ctggtggatg gacacgccaa acgatga      2457

<210> 386
<211> 818
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (36)...(247)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (318)...(695)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> DOMAIN
<222> (405)...(539)
<223> PA14 domain

<220> 
<221> SITE
<222> (15)...(27)
<223> Lipocalin signature. Prosite id = PS00213

<220> 
<221> SITE
<222> (163)...(166)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (627)...(630)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (721)...(724)
<223> N-glycosylation site. Prosite id = PS00001

<400> 386
Met Ser Glu Thr Ile Asp Arg Leu Leu Ala Glu Met Thr Leu Glu Glu 
1               5                   10                  15      


Lys Ile Lys Leu Leu Gly Gly Trp Arg Ala Val Pro Gly Thr Pro Arg 
            20                  25                  30          


Asp Ala Asp Val Tyr Gly Val Pro Arg Leu Gly Ile Ala Pro Leu Arg 
        35                  40                  45              


Leu Ala Asp Gly Pro Thr Gly Val His Trp Trp Thr Lys Ala Ser Thr 
    50                  55                  60                  


Cys Tyr Pro Ala Leu Ile Asn Leu Ala Ala Ser Phe Asp Glu Lys Ile 
65                  70                  75                  80  


Ala Tyr Thr Phe Gly Arg Ala Leu Gly Thr Asp Cys Arg Ala Phe Gly 
                85                  90                  95      


Val His Val Ile Leu Ala Pro Gly Val Asn Leu Tyr Arg Ser Pro Leu 
            100                 105                 110         


Cys Gly Arg Asn Phe Glu Tyr Leu Gly Glu Asp Pro Glu Leu Ala Gly 
        115                 120                 125             


Gly Leu Ala Ala Ala Tyr Ile Arg Gly Leu Gln Ser Lys Gly Val Ala 
    130                 135                 140                 


Ala Thr Val Lys His Leu Ala Ala Asn Asn Gln Glu Tyr Asp Arg His 
145                 150                 155                 160 


Asn Ile Ser Ser Asp Ile Asp Glu Arg Thr Leu Arg Glu Val Tyr Leu 
                165                 170                 175     


Arg Pro Phe Glu Arg Ala Val His Glu Gly Gln Thr Ala Ala Val Met 
            180                 185                 190         


Thr Ala Tyr Asn Pro Val Asn Gly Gln His Ala Ser Glu Asn Ala Trp 
        195                 200                 205             


Leu Leu Asp Gly Val Leu Arg His Asp Trp Gly Phe Asn Gly Trp Ile 
    210                 215                 220                 


Met Ser Asp Trp Thr Ser Val Tyr Ser Thr Val Gln Thr Leu Asn Ser 
225                 230                 235                 240 


Gly Leu Asp Leu Glu Met Pro Phe Ala Leu His Leu Thr Glu Glu Lys 
                245                 250                 255     


Ile Lys Ala Ala Leu Ala Thr Gly Val Thr Thr Val Thr Arg Ile Asp 
            260                 265                 270         


Arg Met Val Leu His Arg Leu Ala Leu Met Glu Arg Phe Gly Trp Leu 
        275                 280                 285             


Asp Pro Gly His Ala Gln Gln Asp Lys Ser Leu Pro Asp Arg Asn Pro 
    290                 295                 300                 


Glu Thr Glu Ala Val Ala Leu Glu Val Ala Arg Arg Gly Ile Val Leu 
305                 310                 315                 320 


Leu Lys Asn Arg Asn His Val Leu Pro Cys Pro Pro Ala Ser Leu Arg 
                325                 330                 335     


Arg Ile Val Val Leu Gly His His Ala Ala Gln Pro Ile Leu Ser Gly 
            340                 345                 350         


Gly Gly Ser Ala Phe Cys Pro Pro His Glu Ser Val Thr Leu Leu Glu 
        355                 360                 365             


Ala Leu Arg Gln Thr Tyr Ala Gln Asp Val Gln Val Asp Tyr Phe Glu 
    370                 375                 380                 


Met Val Asp Pro Trp Cys Glu Arg Ala Ala Leu Glu Ala Ser Ser Phe 
385                 390                 395                 400 


Tyr Thr Ala Asp Gly Glu Pro Gly Leu His Ala Cys Tyr Phe Ala Asn 
                405                 410                 415     


Asn Arg Leu Glu Gly Ser Pro Ala Leu Glu Arg Val Asp Thr Lys Leu 
            420                 425                 430         


Gln Phe Arg Trp Ile Ser Glu Lys Pro Asp Pro Met Leu Glu Asp Asp 
        435                 440                 445             


Tyr Phe Ser Ala Arg Trp Ser Gly Phe Phe Asp Ile Asp Ala Ala Gly 
    450                 455                 460                 


Ile Tyr Asp Trp Tyr Leu Lys Ser Glu Asp Gly Thr Phe Gln Val Trp 
465                 470                 475                 480 


Val Asp Asp His Pro Leu Thr Asp Pro Phe Thr Gly Thr Arg Arg Val 
                485                 490                 495     


Pro Leu Asp Leu Ser Ala Gly Gln His Arg Ile Thr Val Thr Tyr Cys 
            500                 505                 510         


Gln Leu Arg Gly Gly Trp Ala Thr Cys Gln Cys Gly Phe Glu Arg Ala 
        515                 520                 525             


Ala Asn Ala Leu Arg Ala Tyr Glu Asp Gly Leu Ala Ala Ala Ala Glu 
    530                 535                 540                 


Ala Asp Leu Val Val Val Thr Thr Gly Phe Val Ala Pro Thr Glu Arg 
545                 550                 555                 560 


Glu Ser Ser Asp Arg Thr Phe Glu Leu Asp Gln Arg Leu Asn Gln Met 
                565                 570                 575     


Val Leu Asp Val Ala Arg Gln Asn Ala Arg Thr Val Val Val Leu Tyr 
            580                 585                 590         


Ala Gly Gly Ala Val Asp Val Ala Pro Trp Ile Asp Gln Val Ala Ala 
        595                 600                 605             


Leu Leu His Ile Trp Tyr Pro Gly Gln Asn Gly Thr Leu Ala Ala Ala 
    610                 615                 620                 


Glu Ile Ile Ala Gly Tyr Thr Asn Pro Ser Gly Lys Leu Pro Phe Thr 
625                 630                 635                 640 


Trp Glu His Gln Leu Ser Asp Arg Gly Ser Phe Ser Ala Tyr His Asp 
                645                 650                 655     


Asp Asp Gly Asp Gly Arg Val Ala Tyr Thr Asp Gly Val Phe Thr Gly 
            660                 665                 670         


Tyr Arg Trp Phe Asp His His Arg Ile Lys Ala Arg Tyr Pro Phe Gly 
        675                 680                 685             


Phe Gly Leu Ser Tyr Thr Thr Phe Ala Tyr Glu Asn Leu Ala Cys Thr 
    690                 695                 700                 


Pro Ala Thr Leu Gln Glu Asn Glu Thr Val Gln Val Thr Phe Asp Val 
705                 710                 715                 720 


Ile Asn Thr Gly Ser Val Ala Gly Arg His Ala Ala Leu Val Phe Val 
                725                 730                 735     


Ala Pro Pro Ala Gly Thr Val Pro Arg Pro Val Lys Glu Tyr Lys Gln 
            740                 745                 750         


Ser Cys Val Val Asn Leu Ala Pro Gly Glu Arg Gln Ser Val Thr Val 
        755                 760                 765             


Thr Leu Pro Trp Arg Ala Phe Gln Phe Trp His Pro Asp Thr Arg Cys 
    770                 775                 780                 


Trp Ser Ile Thr Pro Gly Ala His Ala Ile Gln Val Arg Pro Asp Ala 
785                 790                 795                 800 


Asp Ser Thr Gly Leu Gln Ala Glu Val Thr Leu Val Asp Gly His Ala 
                805                 810                 815     


Lys Arg 
        


<210> 387
<211> 1362
<212> DNA
<213> Bacteria

<400> 387
atgcttcagt ttccgaaaga ttttatttgg ggagctgcaa cttcatcgta tcaaattgaa     60

ggaacagcga ctggagaaga taaaatttac tcgatctggg atcacttttc ccgcattcct    120

ggcaaaatag cgaatgggga taatggcgat attgcaattg atcattacaa tcgttatgtt    180

gaagacatcg cattaatgaa agcgcttcat ttgaaagcgt atcgattttc gactagttgg    240

gcgagacttt attgtgaaac gccagggaag tttaacgaaa aaggtttaga tttttataag    300

cgtcttgtaa atgagctgct agagaacggt atcgagccaa tgttgaccat ttatcattgg    360

gatatgccac aagctcttca agagaaaggt ggctgggaaa atcgtgatat cgttcaatac    420

ttccaagaat acgctgcttt cctttacgag aatcttgggg atgtcgtgaa aaaatggatt    480

acgcataatg agccgtgggt tgtcacctat ttaggatatg ggaatggcga acatgcccca    540

gggattcaaa actttacatc atttttaaaa gcagcacatc atgttcttct ctcacacggg    600

gaagcggtaa aagcgtttcg agcaatcggt ccgaaagatg gggaaattgg tattacgttg    660

aatttgacac ctggatatgc ggttgatccg aaagatgaaa aagcagttga tgccgctcga    720

aaatgggacg gctttatgaa tcgttggttt ttagatcctg tatttaaggg acaatatcca    780

gcagatatgt tagaagtgta taaagattat ttaccagacg tttacaaaga gggagattta    840

caaacgattc agcaaccgat cgactttttc ggatttaact attattcaac agcaacatta    900

aaagattgga aaaaaggtga ccgtgaaccg atcgtatttg aacatgtgag cacaggaaga    960

cctgtgacgg atatgaattg ggaagtgaat ccaaacggtt tgtttgattt aatggtgcga   1020

ttgaaaaaag attatggcga tattccatta tacattaccg aaaacggtgc tgcatacaaa   1080

gatcgcgtca acgaacaagg tgaagtagaa gatgatgagc gagttgctta tatacgagag   1140

catttaatcg cttgccaccg cgcgattgaa caaggcgtca atttaaaagg atattatgta   1200

tggtcgctgt tcgataattt tgagtgggca tttggatatg ataagcgctt tgggattgta   1260

tacgtggatt atgaaacgct agagcgcatc ccgaaaaaga gtgcattatg gtataaggaa   1320

acgattataa acaacggatt gaaagcagaa aaagataaat aa                      1362

<210> 388
<211> 453
<212> PRT
<213> Bacteria

<220> 
<221> DOMAIN
<222> (1)...(447)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (8)...(22)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (186)...(189)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (355)...(363)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 388
Met Leu Gln Phe Pro Lys Asp Phe Ile Trp Gly Ala Ala Thr Ser Ser 
1               5                   10                  15      


Tyr Gln Ile Glu Gly Thr Ala Thr Gly Glu Asp Lys Ile Tyr Ser Ile 
            20                  25                  30          


Trp Asp His Phe Ser Arg Ile Pro Gly Lys Ile Ala Asn Gly Asp Asn 
        35                  40                  45              


Gly Asp Ile Ala Ile Asp His Tyr Asn Arg Tyr Val Glu Asp Ile Ala 
    50                  55                  60                  


Leu Met Lys Ala Leu His Leu Lys Ala Tyr Arg Phe Ser Thr Ser Trp 
65                  70                  75                  80  


Ala Arg Leu Tyr Cys Glu Thr Pro Gly Lys Phe Asn Glu Lys Gly Leu 
                85                  90                  95      


Asp Phe Tyr Lys Arg Leu Val Asn Glu Leu Leu Glu Asn Gly Ile Glu 
            100                 105                 110         


Pro Met Leu Thr Ile Tyr His Trp Asp Met Pro Gln Ala Leu Gln Glu 
        115                 120                 125             


Lys Gly Gly Trp Glu Asn Arg Asp Ile Val Gln Tyr Phe Gln Glu Tyr 
    130                 135                 140                 


Ala Ala Phe Leu Tyr Glu Asn Leu Gly Asp Val Val Lys Lys Trp Ile 
145                 150                 155                 160 


Thr His Asn Glu Pro Trp Val Val Thr Tyr Leu Gly Tyr Gly Asn Gly 
                165                 170                 175     


Glu His Ala Pro Gly Ile Gln Asn Phe Thr Ser Phe Leu Lys Ala Ala 
            180                 185                 190         


His His Val Leu Leu Ser His Gly Glu Ala Val Lys Ala Phe Arg Ala 
        195                 200                 205             


Ile Gly Pro Lys Asp Gly Glu Ile Gly Ile Thr Leu Asn Leu Thr Pro 
    210                 215                 220                 


Gly Tyr Ala Val Asp Pro Lys Asp Glu Lys Ala Val Asp Ala Ala Arg 
225                 230                 235                 240 


Lys Trp Asp Gly Phe Met Asn Arg Trp Phe Leu Asp Pro Val Phe Lys 
                245                 250                 255     


Gly Gln Tyr Pro Ala Asp Met Leu Glu Val Tyr Lys Asp Tyr Leu Pro 
            260                 265                 270         


Asp Val Tyr Lys Glu Gly Asp Leu Gln Thr Ile Gln Gln Pro Ile Asp 
        275                 280                 285             


Phe Phe Gly Phe Asn Tyr Tyr Ser Thr Ala Thr Leu Lys Asp Trp Lys 
    290                 295                 300                 


Lys Gly Asp Arg Glu Pro Ile Val Phe Glu His Val Ser Thr Gly Arg 
305                 310                 315                 320 


Pro Val Thr Asp Met Asn Trp Glu Val Asn Pro Asn Gly Leu Phe Asp 
                325                 330                 335     


Leu Met Val Arg Leu Lys Lys Asp Tyr Gly Asp Ile Pro Leu Tyr Ile 
            340                 345                 350         


Thr Glu Asn Gly Ala Ala Tyr Lys Asp Arg Val Asn Glu Gln Gly Glu 
        355                 360                 365             


Val Glu Asp Asp Glu Arg Val Ala Tyr Ile Arg Glu His Leu Ile Ala 
    370                 375                 380                 


Cys His Arg Ala Ile Glu Gln Gly Val Asn Leu Lys Gly Tyr Tyr Val 
385                 390                 395                 400 


Trp Ser Leu Phe Asp Asn Phe Glu Trp Ala Phe Gly Tyr Asp Lys Arg 
                405                 410                 415     


Phe Gly Ile Val Tyr Val Asp Tyr Glu Thr Leu Glu Arg Ile Pro Lys 
            420                 425                 430         


Lys Ser Ala Leu Trp Tyr Lys Glu Thr Ile Ile Asn Asn Gly Leu Lys 
        435                 440                 445             


Ala Glu Lys Asp Lys 
    450             


<210> 389
<211> 2103
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 389
atgatacatg ccagctcttc gttcacatcc gggggtgtgg aaaggcttgg catacctgag     60

ctcattatgt ctgatggtcc tcatggtgtg cggcatgaac acgggcgtgg ctggtttgca    120

ctcgaagatg ctgacgacat ggcgacttat ttacccgtag gtatcaacct tgcttcaaca    180

tggaacaggg aactgggcta tgaatacggt gttgtgcttg gcaatgaagc caaagaaagg    240

ggcaagcatg ttctccttgg tcctggtgtt aatatcattc gcagtcccct gaatggccgt    300

aattttgaat acttaagtga ggatcccttc ctgaccgctg aaatggctgt gggatatacc    360

cttggagtgc aggatcaggg agtggcggta tgtgtaaaac attatgctgc caataaccag    420

gaaaccgacc gtcataatat tgatgtaatt gttagcaagc gcgctctgca cgaaatttac    480

ttccctgctt ttaaagctac ggtgcaggaa gccggtgcat ggagcatcat gggggcttac    540

aacaagatca atgggcaata caccacccac cacgaatatc ttaacaatga aattctgaaa    600

ggaaagtggg gttttgatgg ggttgtgata agtgactggg catcagtaaa ggatacccgc    660

gaagcacttt tatatggtac cgaccttgaa atgggcaccg agctgttggg agatttcacc    720

aatcctgatt atgatgattt ctatctggca cgccacgcaa agaaaatgat tgaaagtggc    780

gaaattgatg aaagttatgt tgacgaaaaa gtgcgccgga tccttaaact gatgtaccgt    840

accactgccc ttggtaaaca tggacctggt aagcgaaatg tacccgaaca ccagcaactg    900

gcactgaagg tggcccagga aggaattgtt ttgctgaaaa atgatggcct gttgccactc    960

aataaaaagg agatcactac aatagctgtc atcgggcaca atgccacaag gctttttgcc   1020

gaacggggag gcagctcaca ggtaagagcc ctgtatgaga tcactccgct ggaaggaata   1080

atgaatctcg tgggtaatga tgtggaagtg atctacgccg aaggatatga gccctactat   1140

gatgaaaccc ttttcagggg tgcatcggga gatgctgctt cccagtcaag atccaacaga   1200

agaactgttg aactggccag aactgccaac aaggaacttg ccagggaagc acttgaagtt   1260

gcacaaaaag ccgacattgt gatctttgtg ggaggatgga tacatggcca cgaaggaatg   1320

ccctggggtg aaggaaccta cgatgccgaa gcccgcgata aactcaatat caaacttccc   1380

tttggccagg aacaacttat ccgcgagata aaccgggcaa acaaacaaac ggttgtggta   1440

ctcatgggtg gcagcaacgt ggagatggat aactggctgc cagaaacgcc agcatacctt   1500

catgcctggt accccggtat ggaagggggc actgccattg cgcaggtact tttcggcgaa   1560

gtgaatcctt caggaaaact gccaatgaca tttgccaatt cacacaagga ttacccttct   1620

cattccatcg gcgaatttcc aggatacaaa aaagtacatt acaccgagga tatttttgta   1680

ggataccgcc acttcgatgc aaagggtaaa gatgtggtgt tcccctttgg gtttggactt   1740

tcctatacaa ccttcgattt ctcagggctg aagctgacca ggcaaggaaa taaagtgatc   1800

gtggaatgta aggttaccaa tacaggaaac cgggtcggtg ccgaggtggt gcaggtttat   1860

gtgcatcaga aagagtcttc ggtggaacgg cccatccgtg aactgaaagg gtttgaaaag   1920

gtaatgctca atccgggtga aacaacaaca gtgaagatcg gactcgatgc atcatctttt   1980

agtttcttcc atccggaacg tcttgaatgg actttggaac caggcatgtt tgaaattgca   2040

gtgggctcat catccagaga tctgccactg aaaggttcaa tagatattgg tatgcttgaa   2100

tag                                                                 2103

<210> 390
<211> 700
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (10)...(232)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (309)...(562)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (201)...(218)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (339)...(342)
<223> N-glycosylation site. Prosite id = PS00001

<400> 390
Met Ile His Ala Ser Ser Ser Phe Thr Ser Gly Gly Val Glu Arg Leu 
1               5                   10                  15      


Gly Ile Pro Glu Leu Ile Met Ser Asp Gly Pro His Gly Val Arg His 
            20                  25                  30          


Glu His Gly Arg Gly Trp Phe Ala Leu Glu Asp Ala Asp Asp Met Ala 
        35                  40                  45              


Thr Tyr Leu Pro Val Gly Ile Asn Leu Ala Ser Thr Trp Asn Arg Glu 
    50                  55                  60                  


Leu Gly Tyr Glu Tyr Gly Val Val Leu Gly Asn Glu Ala Lys Glu Arg 
65                  70                  75                  80  


Gly Lys His Val Leu Leu Gly Pro Gly Val Asn Ile Ile Arg Ser Pro 
                85                  90                  95      


Leu Asn Gly Arg Asn Phe Glu Tyr Leu Ser Glu Asp Pro Phe Leu Thr 
            100                 105                 110         


Ala Glu Met Ala Val Gly Tyr Thr Leu Gly Val Gln Asp Gln Gly Val 
        115                 120                 125             


Ala Val Cys Val Lys His Tyr Ala Ala Asn Asn Gln Glu Thr Asp Arg 
    130                 135                 140                 


His Asn Ile Asp Val Ile Val Ser Lys Arg Ala Leu His Glu Ile Tyr 
145                 150                 155                 160 


Phe Pro Ala Phe Lys Ala Thr Val Gln Glu Ala Gly Ala Trp Ser Ile 
                165                 170                 175     


Met Gly Ala Tyr Asn Lys Ile Asn Gly Gln Tyr Thr Thr His His Glu 
            180                 185                 190         


Tyr Leu Asn Asn Glu Ile Leu Lys Gly Lys Trp Gly Phe Asp Gly Val 
        195                 200                 205             


Val Ile Ser Asp Trp Ala Ser Val Lys Asp Thr Arg Glu Ala Leu Leu 
    210                 215                 220                 


Tyr Gly Thr Asp Leu Glu Met Gly Thr Glu Leu Leu Gly Asp Phe Thr 
225                 230                 235                 240 


Asn Pro Asp Tyr Asp Asp Phe Tyr Leu Ala Arg His Ala Lys Lys Met 
                245                 250                 255     


Ile Glu Ser Gly Glu Ile Asp Glu Ser Tyr Val Asp Glu Lys Val Arg 
            260                 265                 270         


Arg Ile Leu Lys Leu Met Tyr Arg Thr Thr Ala Leu Gly Lys His Gly 
        275                 280                 285             


Pro Gly Lys Arg Asn Val Pro Glu His Gln Gln Leu Ala Leu Lys Val 
    290                 295                 300                 


Ala Gln Glu Gly Ile Val Leu Leu Lys Asn Asp Gly Leu Leu Pro Leu 
305                 310                 315                 320 


Asn Lys Lys Glu Ile Thr Thr Ile Ala Val Ile Gly His Asn Ala Thr 
                325                 330                 335     


Arg Leu Phe Ala Glu Arg Gly Gly Ser Ser Gln Val Arg Ala Leu Tyr 
            340                 345                 350         


Glu Ile Thr Pro Leu Glu Gly Ile Met Asn Leu Val Gly Asn Asp Val 
        355                 360                 365             


Glu Val Ile Tyr Ala Glu Gly Tyr Glu Pro Tyr Tyr Asp Glu Thr Leu 
    370                 375                 380                 


Phe Arg Gly Ala Ser Gly Asp Ala Ala Ser Gln Ser Arg Ser Asn Arg 
385                 390                 395                 400 


Arg Thr Val Glu Leu Ala Arg Thr Ala Asn Lys Glu Leu Ala Arg Glu 
                405                 410                 415     


Ala Leu Glu Val Ala Gln Lys Ala Asp Ile Val Ile Phe Val Gly Gly 
            420                 425                 430         


Trp Ile His Gly His Glu Gly Met Pro Trp Gly Glu Gly Thr Tyr Asp 
        435                 440                 445             


Ala Glu Ala Arg Asp Lys Leu Asn Ile Lys Leu Pro Phe Gly Gln Glu 
    450                 455                 460                 


Gln Leu Ile Arg Glu Ile Asn Arg Ala Asn Lys Gln Thr Val Val Val 
465                 470                 475                 480 


Leu Met Gly Gly Ser Asn Val Glu Met Asp Asn Trp Leu Pro Glu Thr 
                485                 490                 495     


Pro Ala Tyr Leu His Ala Trp Tyr Pro Gly Met Glu Gly Gly Thr Ala 
            500                 505                 510         


Ile Ala Gln Val Leu Phe Gly Glu Val Asn Pro Ser Gly Lys Leu Pro 
        515                 520                 525             


Met Thr Phe Ala Asn Ser His Lys Asp Tyr Pro Ser His Ser Ile Gly 
    530                 535                 540                 


Glu Phe Pro Gly Tyr Lys Lys Val His Tyr Thr Glu Asp Ile Phe Val 
545                 550                 555                 560 


Gly Tyr Arg His Phe Asp Ala Lys Gly Lys Asp Val Val Phe Pro Phe 
                565                 570                 575     


Gly Phe Gly Leu Ser Tyr Thr Thr Phe Asp Phe Ser Gly Leu Lys Leu 
            580                 585                 590         


Thr Arg Gln Gly Asn Lys Val Ile Val Glu Cys Lys Val Thr Asn Thr 
        595                 600                 605             


Gly Asn Arg Val Gly Ala Glu Val Val Gln Val Tyr Val His Gln Lys 
    610                 615                 620                 


Glu Ser Ser Val Glu Arg Pro Ile Arg Glu Leu Lys Gly Phe Glu Lys 
625                 630                 635                 640 


Val Met Leu Asn Pro Gly Glu Thr Thr Thr Val Lys Ile Gly Leu Asp 
                645                 650                 655     


Ala Ser Ser Phe Ser Phe Phe His Pro Glu Arg Leu Glu Trp Thr Leu 
            660                 665                 670         


Glu Pro Gly Met Phe Glu Ile Ala Val Gly Ser Ser Ser Arg Asp Leu 
        675                 680                 685             


Pro Leu Lys Gly Ser Ile Asp Ile Gly Met Leu Glu 
    690                 695                 700 


<210> 391
<211> 1341
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 391
atgaacgtga aaaagttccc tgaaggattc ctctggggtg ttgcaacagc ttcctaccag     60

atcgagggtt ctcccctcgc agacggagct ggtatgtcta tctggcacac cttctcccat    120

actcctggaa atgtaaagaa cggtgacacg ggagatgtgg cctgcgacca ctacaacaga    180

tggaaagagg acattgaaat catagagaaa ctcggagtaa aggcttacag attttcaatc    240

agctggccaa gaatacttcc ggaaggaaca ggaagggtga atcagaaagg actggatttt    300

tacaacagga tcatagacac cctgctggaa aaaggtatca caccctttgt gaccatctat    360

cactgggatc ttcccttcgc tcttcagctg aaaggaggat gggcgaacag agaaatagcg    420

gattggttcg cagaatactc aagggttctc tttgaaaatt tcggtgatcg tgtgaagaac    480

tggatcacct tgaacgaacc gtgggttgtt gccatagtgg ggcatctgta cggagtccac    540

gctcctggaa tgagagatat ttacgtggct ttccgagctg ttcacaatct cttgagggca    600

cacgccagag cggtgaaagt gttcagggaa accgtgaaag atggaaagat cggaatagtt    660

ttcaacaatg gatatttcga acctgcgagt gaaaaagaag aagacatcag agcggtgaga    720

ttcatgcatc agttcaacaa ctatcctctc tttctcaatc cgatctacag aggagattac    780

ccggagctcg ttctggaatt tgccagagag tatctaccgg agaattacaa agatgacatg    840

tccgagatac aggaaaagat cgactttgtt ggattgaact attactccgg tcatttggtg    900

aagttcgatc cagatgcacc agctaaggtc tctttcgttg aaagggatct tccaaaaaca    960

gccatgggat gggagatcgt tccagaagga atctactgga tcctgaagaa ggtgaaagaa   1020

gaatacaacc caccagaggt ttacatcaca gagaatgggg ctgcttttga cgacgtagtt   1080

agtgaagatg gaagagttca cgatcaaaac agaatcgatt atttgaaggc ccacattggt   1140

caggcatgga aggccataca ggagggagtg ccgcttaaag gttacttcgt ctggtcgctc   1200

ctcgacaatt tcgaatgggc agagggatat tccaagagat ttggtattgt gtatgtagac   1260

tacagcactc aaaaacgcat cgtaaaagac agtgggtact ggtactcgaa tgtggttaaa   1320

aacaacggtc tggaagactg a                                             1341

<210> 392
<211> 446
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (2)...(444)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (10)...(24)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (352)...(360)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 392
Met Asn Val Lys Lys Phe Pro Glu Gly Phe Leu Trp Gly Val Ala Thr 
1               5                   10                  15      


Ala Ser Tyr Gln Ile Glu Gly Ser Pro Leu Ala Asp Gly Ala Gly Met 
            20                  25                  30          


Ser Ile Trp His Thr Phe Ser His Thr Pro Gly Asn Val Lys Asn Gly 
        35                  40                  45              


Asp Thr Gly Asp Val Ala Cys Asp His Tyr Asn Arg Trp Lys Glu Asp 
    50                  55                  60                  


Ile Glu Ile Ile Glu Lys Leu Gly Val Lys Ala Tyr Arg Phe Ser Ile 
65                  70                  75                  80  


Ser Trp Pro Arg Ile Leu Pro Glu Gly Thr Gly Arg Val Asn Gln Lys 
                85                  90                  95      


Gly Leu Asp Phe Tyr Asn Arg Ile Ile Asp Thr Leu Leu Glu Lys Gly 
            100                 105                 110         


Ile Thr Pro Phe Val Thr Ile Tyr His Trp Asp Leu Pro Phe Ala Leu 
        115                 120                 125             


Gln Leu Lys Gly Gly Trp Ala Asn Arg Glu Ile Ala Asp Trp Phe Ala 
    130                 135                 140                 


Glu Tyr Ser Arg Val Leu Phe Glu Asn Phe Gly Asp Arg Val Lys Asn 
145                 150                 155                 160 


Trp Ile Thr Leu Asn Glu Pro Trp Val Val Ala Ile Val Gly His Leu 
                165                 170                 175     


Tyr Gly Val His Ala Pro Gly Met Arg Asp Ile Tyr Val Ala Phe Arg 
            180                 185                 190         


Ala Val His Asn Leu Leu Arg Ala His Ala Arg Ala Val Lys Val Phe 
        195                 200                 205             


Arg Glu Thr Val Lys Asp Gly Lys Ile Gly Ile Val Phe Asn Asn Gly 
    210                 215                 220                 


Tyr Phe Glu Pro Ala Ser Glu Lys Glu Glu Asp Ile Arg Ala Val Arg 
225                 230                 235                 240 


Phe Met His Gln Phe Asn Asn Tyr Pro Leu Phe Leu Asn Pro Ile Tyr 
                245                 250                 255     


Arg Gly Asp Tyr Pro Glu Leu Val Leu Glu Phe Ala Arg Glu Tyr Leu 
            260                 265                 270         


Pro Glu Asn Tyr Lys Asp Asp Met Ser Glu Ile Gln Glu Lys Ile Asp 
        275                 280                 285             


Phe Val Gly Leu Asn Tyr Tyr Ser Gly His Leu Val Lys Phe Asp Pro 
    290                 295                 300                 


Asp Ala Pro Ala Lys Val Ser Phe Val Glu Arg Asp Leu Pro Lys Thr 
305                 310                 315                 320 


Ala Met Gly Trp Glu Ile Val Pro Glu Gly Ile Tyr Trp Ile Leu Lys 
                325                 330                 335     


Lys Val Lys Glu Glu Tyr Asn Pro Pro Glu Val Tyr Ile Thr Glu Asn 
            340                 345                 350         


Gly Ala Ala Phe Asp Asp Val Val Ser Glu Asp Gly Arg Val His Asp 
        355                 360                 365             


Gln Asn Arg Ile Asp Tyr Leu Lys Ala His Ile Gly Gln Ala Trp Lys 
    370                 375                 380                 


Ala Ile Gln Glu Gly Val Pro Leu Lys Gly Tyr Phe Val Trp Ser Leu 
385                 390                 395                 400 


Leu Asp Asn Phe Glu Trp Ala Glu Gly Tyr Ser Lys Arg Phe Gly Ile 
                405                 410                 415     


Val Tyr Val Asp Tyr Ser Thr Gln Lys Arg Ile Val Lys Asp Ser Gly 
            420                 425                 430         


Tyr Trp Tyr Ser Asn Val Val Lys Asn Asn Gly Leu Glu Asp 
        435                 440                 445     


<210> 393
<211> 2229
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 393
gtggaaagaa tcccatttgt tgaagattta ctggcccaaa tgaccctgga agagaaagtg     60

gggcaaatgg cacagataac ccttgatgtg attaccattg gggaagataa cactgtatct    120

gacgaaccca ttgcactgga catggacctg gtaagatggg cagtggcaga atacggtgtg    180

ggttcggtgc tcaacaccgc caataacagg gcgcgtaccc gcgaaaaatg gcacaccatc    240

gtcagccagc tgcaggatgt tgccatcaat gaaacccgcc tgggtattcc tgttttgtat    300

ggtattgatg ccattcatgg aacaacttac actgccggtg caaccttttt tccacagcaa    360

atcgggcagg ccgctacttt taatccttca ttggtgcgca aagcagctga aattactgcc    420

tacgaaaccc gtgcctctgc catcccctgg actttctctc cggtacttga tatggggcgc    480

gatcctcgtt tcccaaggat gtgggaaacc tttggggaag atgtttatct tgctaccgtg    540

cttggcgtag aaatgatcaa aggatatgaa ggggagaata atgacattag tgatcctttc    600

agagtagcat cctgtgcgaa gcattacctg ggttacagtg tcccggtttc aggaaaagac    660

cggaccccgg ctctgatacc ggaaattgaa cttcgcgaaa gacacctgcc cccgtttcag    720

gcagctgtgg aagccggtac ccacaccatt atggtaaact ccggccttat caacggagtg    780

cccgttcatg cctcttatga actgcttacg gaaatgctga agaaagaact tgggtttaca    840

ggattgttgt taaccgactg gaccgatatt gagaatctgc atacccgcga ccgtattgcc    900

gccacatcaa aagaagcagt taagctggcc ataaatgcag gaattgatat gtccatgatt    960

ccctacgatc tcgacttttg tgattacctg attgaactgg tgaatgaagg agaagttccc   1020

atgtcgcgta tcgacgatgc cgtgagacgg attcttaata caaaatacaa gcttggcctt   1080

tttgaaactc ctgttactta tcacagtgat tacccgctgt ttggaagcga cgaatttatt   1140

gaagttgcgt accagacagc acaggaatcc ataacactgc ttaaaaatga aaacaatgtt   1200

ttgcctttgc gtaaaaacgc tcgggtgctg gtaaccggac ccaattccaa ctctatgcga   1260

tcactcaatg gtggctggag ctattcatgg cagggagaga aagtggatga gtttgctgaa   1320

gaatattcta ccattctaga tgccatccgg gaaaaagttg gtgaaaataa cgctgtattt   1380

cgcgaagggg ttcgatatga taatgaaagc cagtactggg tagatgaagc ttttgatata   1440

cagggagctg taagagctgc agcgcaggta gattacatca tcattgcatt aggagaaaac   1500

tcttatgctg agaaaccagg cgatttacat gatctttctt tgtcacaaaa tcagattgag   1560

cttgccaaag cccttgccaa aaccggtaag cccatgatac tggttttgaa ccagggtcgc   1620

ccgcgtatca tcagggaaat cgaacccctg atgagtggaa taatcaatgc ttatctgccg   1680

ggaaactacg gcggacctgc tgttgcagat gtgatttttg gagattataa ccccaatggg   1740

aaactccctt ttacctatcc gctttatgtc aattctctgg ttacctatga tcataaaccc   1800

tcagaagatc aggcgagaat ggagggggtg tatgattatg aatcggattt tgccattcaa   1860

tatgaatttg gttttggttt gagttatacc acctttgaat attccaacct gaccattagc   1920

aatgaaaggt tgtctatgaa cgggaagctg gaagttaccg ttgatgtaaa aaataccggt   1980

gaactgccag gtcaggaagt ggtgcagtta tacacttcgg cacattatac ttcagttaca   2040

cctgatgtga aaaggctcag aggtttcagt aagatccacc tcgaacccgg acaaaagaaa   2100

tcggtatcct ttaccctttc tccatctgat atttctttta tcaatgcgca aaacagaagg   2160

gtaaatgaac ccggtgccta tgatgtgctt atcgaaagtc tttcctcaac cttcaggctt   2220

gtagaatag                                                           2229

<210> 394
<211> 742
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (88)...(320)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (391)...(631)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (65)...(68)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (91)...(94)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (276)...(293)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (475)...(478)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (645)...(648)
<223> N-glycosylation site. Prosite id = PS00001

<400> 394
Met Glu Arg Ile Pro Phe Val Glu Asp Leu Leu Ala Gln Met Thr Leu 
1               5                   10                  15      


Glu Glu Lys Val Gly Gln Met Ala Gln Ile Thr Leu Asp Val Ile Thr 
            20                  25                  30          


Ile Gly Glu Asp Asn Thr Val Ser Asp Glu Pro Ile Ala Leu Asp Met 
        35                  40                  45              


Asp Leu Val Arg Trp Ala Val Ala Glu Tyr Gly Val Gly Ser Val Leu 
    50                  55                  60                  


Asn Thr Ala Asn Asn Arg Ala Arg Thr Arg Glu Lys Trp His Thr Ile 
65                  70                  75                  80  


Val Ser Gln Leu Gln Asp Val Ala Ile Asn Glu Thr Arg Leu Gly Ile 
                85                  90                  95      


Pro Val Leu Tyr Gly Ile Asp Ala Ile His Gly Thr Thr Tyr Thr Ala 
            100                 105                 110         


Gly Ala Thr Phe Phe Pro Gln Gln Ile Gly Gln Ala Ala Thr Phe Asn 
        115                 120                 125             


Pro Ser Leu Val Arg Lys Ala Ala Glu Ile Thr Ala Tyr Glu Thr Arg 
    130                 135                 140                 


Ala Ser Ala Ile Pro Trp Thr Phe Ser Pro Val Leu Asp Met Gly Arg 
145                 150                 155                 160 


Asp Pro Arg Phe Pro Arg Met Trp Glu Thr Phe Gly Glu Asp Val Tyr 
                165                 170                 175     


Leu Ala Thr Val Leu Gly Val Glu Met Ile Lys Gly Tyr Glu Gly Glu 
            180                 185                 190         


Asn Asn Asp Ile Ser Asp Pro Phe Arg Val Ala Ser Cys Ala Lys His 
        195                 200                 205             


Tyr Leu Gly Tyr Ser Val Pro Val Ser Gly Lys Asp Arg Thr Pro Ala 
    210                 215                 220                 


Leu Ile Pro Glu Ile Glu Leu Arg Glu Arg His Leu Pro Pro Phe Gln 
225                 230                 235                 240 


Ala Ala Val Glu Ala Gly Thr His Thr Ile Met Val Asn Ser Gly Leu 
                245                 250                 255     


Ile Asn Gly Val Pro Val His Ala Ser Tyr Glu Leu Leu Thr Glu Met 
            260                 265                 270         


Leu Lys Lys Glu Leu Gly Phe Thr Gly Leu Leu Leu Thr Asp Trp Thr 
        275                 280                 285             


Asp Ile Glu Asn Leu His Thr Arg Asp Arg Ile Ala Ala Thr Ser Lys 
    290                 295                 300                 


Glu Ala Val Lys Leu Ala Ile Asn Ala Gly Ile Asp Met Ser Met Ile 
305                 310                 315                 320 


Pro Tyr Asp Leu Asp Phe Cys Asp Tyr Leu Ile Glu Leu Val Asn Glu 
                325                 330                 335     


Gly Glu Val Pro Met Ser Arg Ile Asp Asp Ala Val Arg Arg Ile Leu 
            340                 345                 350         


Asn Thr Lys Tyr Lys Leu Gly Leu Phe Glu Thr Pro Val Thr Tyr His 
        355                 360                 365             


Ser Asp Tyr Pro Leu Phe Gly Ser Asp Glu Phe Ile Glu Val Ala Tyr 
    370                 375                 380                 


Gln Thr Ala Gln Glu Ser Ile Thr Leu Leu Lys Asn Glu Asn Asn Val 
385                 390                 395                 400 


Leu Pro Leu Arg Lys Asn Ala Arg Val Leu Val Thr Gly Pro Asn Ser 
                405                 410                 415     


Asn Ser Met Arg Ser Leu Asn Gly Gly Trp Ser Tyr Ser Trp Gln Gly 
            420                 425                 430         


Glu Lys Val Asp Glu Phe Ala Glu Glu Tyr Ser Thr Ile Leu Asp Ala 
        435                 440                 445             


Ile Arg Glu Lys Val Gly Glu Asn Asn Ala Val Phe Arg Glu Gly Val 
    450                 455                 460                 


Arg Tyr Asp Asn Glu Ser Gln Tyr Trp Val Asp Glu Ala Phe Asp Ile 
465                 470                 475                 480 


Gln Gly Ala Val Arg Ala Ala Ala Gln Val Asp Tyr Ile Ile Ile Ala 
                485                 490                 495     


Leu Gly Glu Asn Ser Tyr Ala Glu Lys Pro Gly Asp Leu His Asp Leu 
            500                 505                 510         


Ser Leu Ser Gln Asn Gln Ile Glu Leu Ala Lys Ala Leu Ala Lys Thr 
        515                 520                 525             


Gly Lys Pro Met Ile Leu Val Leu Asn Gln Gly Arg Pro Arg Ile Ile 
    530                 535                 540                 


Arg Glu Ile Glu Pro Leu Met Ser Gly Ile Ile Asn Ala Tyr Leu Pro 
545                 550                 555                 560 


Gly Asn Tyr Gly Gly Pro Ala Val Ala Asp Val Ile Phe Gly Asp Tyr 
                565                 570                 575     


Asn Pro Asn Gly Lys Leu Pro Phe Thr Tyr Pro Leu Tyr Val Asn Ser 
            580                 585                 590         


Leu Val Thr Tyr Asp His Lys Pro Ser Glu Asp Gln Ala Arg Met Glu 
        595                 600                 605             


Gly Val Tyr Asp Tyr Glu Ser Asp Phe Ala Ile Gln Tyr Glu Phe Gly 
    610                 615                 620                 


Phe Gly Leu Ser Tyr Thr Thr Phe Glu Tyr Ser Asn Leu Thr Ile Ser 
625                 630                 635                 640 


Asn Glu Arg Leu Ser Met Asn Gly Lys Leu Glu Val Thr Val Asp Val 
                645                 650                 655     


Lys Asn Thr Gly Glu Leu Pro Gly Gln Glu Val Val Gln Leu Tyr Thr 
            660                 665                 670         


Ser Ala His Tyr Thr Ser Val Thr Pro Asp Val Lys Arg Leu Arg Gly 
        675                 680                 685             


Phe Ser Lys Ile His Leu Glu Pro Gly Gln Lys Lys Ser Val Ser Phe 
    690                 695                 700                 


Thr Leu Ser Pro Ser Asp Ile Ser Phe Ile Asn Ala Gln Asn Arg Arg 
705                 710                 715                 720 


Val Asn Glu Pro Gly Ala Tyr Asp Val Leu Ile Glu Ser Leu Ser Ser 
                725                 730                 735     


Thr Phe Arg Leu Val Glu 
            740         


<210> 395
<211> 2232
<212> DNA
<213> Bacteria

<400> 395
atgtctcatt ccaagaagct tattttaacc ggtagtcttt cagcggttgc gctttgcgcg     60

atgatgttga cgcccgccac cgccggaaaa gccaggtcac tgcgaacatc tgaacaggcc    120

agcgaaatgg cggcgaagac gctgtcgcaa atgacggccg aagaaaaaac gatccttacc    180

cacgggatca tgcctcttcc gcttggaccc gaggcaccca aaattcccga cgatgcaatt    240

ccgggagccg gctacattcc tggaattccc cgccttaatg tcccagcgct gaaagagacg    300

gacgcgagcc tcggtgtggc ctatgccttc ggtatccggc aggacggagc tacggccctt    360

ccttcgggtc tcgctatcgc ctcgacttgg aacgatgagt tggccgaggt tgcaggtcgg    420

atgattggcc aggaggcacg cgcgaaaggc ttcaatgtca tgttggcggg aggcgtcaat    480

cttgctcgcg accccagaaa cggtcgaaat ttcgagtatt ttggcgagga ccccttgctg    540

agcggtgtca tcgcaggtcg gtcgatccga ggtattcaat ctaacaacat catttctacg    600

attaagcact ttgcgttgaa cggccaggaa accggccgaa aagtcgtcga ctctcggatt    660

tcagaaggcg cggccatgga gagcgacctg ctagccttca agatcgggat tgaactggga    720

aaccccggtt cggtgatgtg cgcctacaat cttgtgaatg gacatccatc ctgttcaaat    780

gattggcttt tgaacaaagt tctgaaacag gattgggggt ataaaggctt tgtcatgtcc    840

gactggggcg ccgtacccgg cctggaggct gcaataaacg ggctggatca acaatctggc    900

gcgcagttgg acccagcagt attcttcgat cagcccttgg ctgaagccgg aaggaccgat    960

gccaattatc gcaagcggat cgacgacatg aatcgaagga tcctctgggc aatctattcc   1020

aacgggctgg acgttcaccc cgtgaccccc ggcggggata tcgatttcac cgctaacgcc   1080

accattgcgc aaacagtcgc cgaacaaggc atcgttctgc ttcgcaacga gcggaatatt   1140

ttgcctttgg ctaggtctgc aaagcgaatt atcgtaatcg gcggctatgc cgatgcaggc   1200

gtactgtccg gcgccggttc aagccaggta cagggcgaag gtggaccatc agtagtggtg   1260

ccattagggg gcgccacgcc gtggtcaggc ttcgcgaatc aagcttatca ccgttccgtt   1320

ccggtcgacg cgattaaggc gatggcgcca tctgccgagg ttcgtttccg cgacggtcgc   1380

tacctctccg aggccgtcac gcaggcaaag aatgctgatg tcgttattgt gtttgcaaca   1440

cagtggtccg gcgagggctt cgaccagccg gatctcgcgc tgcctaatgg acaagatgcg   1500

ctgatcgaag ctgtggcaaa agcaaatccg aataccatcg tcgttctcga gacaggcggg   1560

cctatagtca tgccttggct cgatcacact gcagcagtgg tccaggcttg gtatccaggc   1620

gcgcgcggcg gggaggcaat cgcatctgta ctgttcggca cagtcagtcc gtccggccgt   1680

ctgccgatta ctttcccggc cagcgaagat cagcttccgc gaccgaagct ggacggattt   1740

gatacgatcg agccaaattt ctcgggcctg tctaccggtt attccgggga tctcgtggtg   1800

gactacgaca tcgaagggtc ggacgtcggg tatcgatggt tcgcacgcaa agagcataag   1860

ccgcttttcc cgttcggctt cggcctttcc tacacgcaat ttgccaactc ggggttgcaa   1920

accgacgggc gcgtcgcaag gttctcagta agaaatgtcg gtgacagaag cggtgccgtc   1980

gtcacccagc tctatctggt gagccgcgcc ggcgagacga agcggcgttt gctcggttat   2040

cagcgcttgg atctcaaggc gggcgaaacg cgaaatgcgg tcttgcaaat cgatcaacgc   2100

ctcctggcgg actggaagga cggccagtgg acaatcgtgg ctggtgaata tgaattcgcg   2160

ctgggggata atgccgagca gctcgagcga tcagtcaagg tacgcctgcc tgcccgaact   2220

tggcgggact ga                                                       2232

<210> 396
<211> 743
<212> PRT
<213> Bacteria

<220> 
<221> SIGNAL
<222> (1)...(28)

<220> 
<221> DOMAIN
<222> (86)...(300)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (371)...(633)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (364)...(367)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (595)...(598)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (596)...(621)
<223> Sugar transport proteins signature 2. Prosite id = PS00217

<400> 396
Met Ser His Ser Lys Lys Leu Ile Leu Thr Gly Ser Leu Ser Ala Val 
1               5                   10                  15      


Ala Leu Cys Ala Met Met Leu Thr Pro Ala Thr Ala Gly Lys Ala Arg 
            20                  25                  30          


Ser Leu Arg Thr Ser Glu Gln Ala Ser Glu Met Ala Ala Lys Thr Leu 
        35                  40                  45              


Ser Gln Met Thr Ala Glu Glu Lys Thr Ile Leu Thr His Gly Ile Met 
    50                  55                  60                  


Pro Leu Pro Leu Gly Pro Glu Ala Pro Lys Ile Pro Asp Asp Ala Ile 
65                  70                  75                  80  


Pro Gly Ala Gly Tyr Ile Pro Gly Ile Pro Arg Leu Asn Val Pro Ala 
                85                  90                  95      


Leu Lys Glu Thr Asp Ala Ser Leu Gly Val Ala Tyr Ala Phe Gly Ile 
            100                 105                 110         


Arg Gln Asp Gly Ala Thr Ala Leu Pro Ser Gly Leu Ala Ile Ala Ser 
        115                 120                 125             


Thr Trp Asn Asp Glu Leu Ala Glu Val Ala Gly Arg Met Ile Gly Gln 
    130                 135                 140                 


Glu Ala Arg Ala Lys Gly Phe Asn Val Met Leu Ala Gly Gly Val Asn 
145                 150                 155                 160 


Leu Ala Arg Asp Pro Arg Asn Gly Arg Asn Phe Glu Tyr Phe Gly Glu 
                165                 170                 175     


Asp Pro Leu Leu Ser Gly Val Ile Ala Gly Arg Ser Ile Arg Gly Ile 
            180                 185                 190         


Gln Ser Asn Asn Ile Ile Ser Thr Ile Lys His Phe Ala Leu Asn Gly 
        195                 200                 205             


Gln Glu Thr Gly Arg Lys Val Val Asp Ser Arg Ile Ser Glu Gly Ala 
    210                 215                 220                 


Ala Met Glu Ser Asp Leu Leu Ala Phe Lys Ile Gly Ile Glu Leu Gly 
225                 230                 235                 240 


Asn Pro Gly Ser Val Met Cys Ala Tyr Asn Leu Val Asn Gly His Pro 
                245                 250                 255     


Ser Cys Ser Asn Asp Trp Leu Leu Asn Lys Val Leu Lys Gln Asp Trp 
            260                 265                 270         


Gly Tyr Lys Gly Phe Val Met Ser Asp Trp Gly Ala Val Pro Gly Leu 
        275                 280                 285             


Glu Ala Ala Ile Asn Gly Leu Asp Gln Gln Ser Gly Ala Gln Leu Asp 
    290                 295                 300                 


Pro Ala Val Phe Phe Asp Gln Pro Leu Ala Glu Ala Gly Arg Thr Asp 
305                 310                 315                 320 


Ala Asn Tyr Arg Lys Arg Ile Asp Asp Met Asn Arg Arg Ile Leu Trp 
                325                 330                 335     


Ala Ile Tyr Ser Asn Gly Leu Asp Val His Pro Val Thr Pro Gly Gly 
            340                 345                 350         


Asp Ile Asp Phe Thr Ala Asn Ala Thr Ile Ala Gln Thr Val Ala Glu 
        355                 360                 365             


Gln Gly Ile Val Leu Leu Arg Asn Glu Arg Asn Ile Leu Pro Leu Ala 
    370                 375                 380                 


Arg Ser Ala Lys Arg Ile Ile Val Ile Gly Gly Tyr Ala Asp Ala Gly 
385                 390                 395                 400 


Val Leu Ser Gly Ala Gly Ser Ser Gln Val Gln Gly Glu Gly Gly Pro 
                405                 410                 415     


Ser Val Val Val Pro Leu Gly Gly Ala Thr Pro Trp Ser Gly Phe Ala 
            420                 425                 430         


Asn Gln Ala Tyr His Arg Ser Val Pro Val Asp Ala Ile Lys Ala Met 
        435                 440                 445             


Ala Pro Ser Ala Glu Val Arg Phe Arg Asp Gly Arg Tyr Leu Ser Glu 
    450                 455                 460                 


Ala Val Thr Gln Ala Lys Asn Ala Asp Val Val Ile Val Phe Ala Thr 
465                 470                 475                 480 


Gln Trp Ser Gly Glu Gly Phe Asp Gln Pro Asp Leu Ala Leu Pro Asn 
                485                 490                 495     


Gly Gln Asp Ala Leu Ile Glu Ala Val Ala Lys Ala Asn Pro Asn Thr 
            500                 505                 510         


Ile Val Val Leu Glu Thr Gly Gly Pro Ile Val Met Pro Trp Leu Asp 
        515                 520                 525             


His Thr Ala Ala Val Val Gln Ala Trp Tyr Pro Gly Ala Arg Gly Gly 
    530                 535                 540                 


Glu Ala Ile Ala Ser Val Leu Phe Gly Thr Val Ser Pro Ser Gly Arg 
545                 550                 555                 560 


Leu Pro Ile Thr Phe Pro Ala Ser Glu Asp Gln Leu Pro Arg Pro Lys 
                565                 570                 575     


Leu Asp Gly Phe Asp Thr Ile Glu Pro Asn Phe Ser Gly Leu Ser Thr 
            580                 585                 590         


Gly Tyr Ser Gly Asp Leu Val Val Asp Tyr Asp Ile Glu Gly Ser Asp 
        595                 600                 605             


Val Gly Tyr Arg Trp Phe Ala Arg Lys Glu His Lys Pro Leu Phe Pro 
    610                 615                 620                 


Phe Gly Phe Gly Leu Ser Tyr Thr Gln Phe Ala Asn Ser Gly Leu Gln 
625                 630                 635                 640 


Thr Asp Gly Arg Val Ala Arg Phe Ser Val Arg Asn Val Gly Asp Arg 
                645                 650                 655     


Ser Gly Ala Val Val Thr Gln Leu Tyr Leu Val Ser Arg Ala Gly Glu 
            660                 665                 670         


Thr Lys Arg Arg Leu Leu Gly Tyr Gln Arg Leu Asp Leu Lys Ala Gly 
        675                 680                 685             


Glu Thr Arg Asn Ala Val Leu Gln Ile Asp Gln Arg Leu Leu Ala Asp 
    690                 695                 700                 


Trp Lys Asp Gly Gln Trp Thr Ile Val Ala Gly Glu Tyr Glu Phe Ala 
705                 710                 715                 720 


Leu Gly Asp Asn Ala Glu Gln Leu Glu Arg Ser Val Lys Val Arg Leu 
                725                 730                 735     


Pro Ala Arg Thr Trp Arg Asp 
            740             


<210> 397
<211> 1434
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 397
gtgagcaacc ctgccacccc gcccgcagtc ggcgtcctcg acgagcgtgc gccgctgacc     60

ttcccgccgg gcttcctctg gggagcggcc actgccgcgt accagatcga aggggcagcg    120

gccgagggtg ggcgcacccc gtcgatctgg gacaccttca gccacacgga gggcaagacg    180

gtctccgggc acaccggtga cgtcgcctgc gaccactacc accggctctc cgacgacgtg    240

cggctgatgg ccgagctggg gttgaagtcg taccgcttct ccgtctcctg gccgcgggtg    300

cagccgggcg ggtcgggacc ggtgaacgcc gaagggctgg acttctaccg gcggctggtc    360

gacgagttgc tgaccaacgg catcgagccc tggatcaccc tctaccactg ggacctgccc    420

caggagttgg aggacgccgg cggttggccg gcccgggaca ccgccgcccg gttcgccgac    480

tacgcccagc tgatggcgga cgcgctgggt gaccgggtga agtactggac caccctcaac    540

gagccctggt gctcggcctt cctcggctac ggctccggcg tacacgcgcc gggccgctcg    600

gacggcgccg ccgccgtcca ggccgggcac cacctgatgc tcggccacgg gctcgcggtg    660

caggcgctgc gcgcggctcg gccggaggcg cagctcggcg tgaccgtcaa cctgtacccg    720

gtcacgccgg ccagcgacac gcccggcgac gtggacgccg cccggcgcat cgacgggctg    780

gccaaccggt tcttcctcga cccgctgctg cgcggggagt accccgcgga cctggtcgcc    840

gacctggcca aggtgaccga cttcgggcac gtgcgggacg gggacctggc cgtgatcgcc    900

acgccgctgg acctggtcgg ggtgaactac tacagccggc acgtggtggc cgcgccggca    960

gccggcgagg agccggagaa gtactggcgg gcgccgtcct gctggccggg cagcgaggag   1020

gtccggttcg tcacccgggg cgtgccggtg accgacatgg gctgggagat cgacgcaccc   1080

ggcctggtgg agacgctgcg ccgggtccac gaggagtaca ccgacctgcc gctctacgtg   1140

accgagaacg ggtccgcctt cgtcgacgcg gtggtcgacg gccgggtgga cgacaccgac   1200

cggctggcgt acttcgacgc gcacctgcgg gcctcgcacg aagcgatcag cgccggagtg   1260

cccctgcagg ggtactttgc ctggtcgctg ttggataatt tcgaatgggc ctggggttac   1320

accaagcggt tcggcatggt ctacgtcgac tacgacagcc agaagcgcat tcccaagtcc   1380

agtgccaggt ggtacgcgga ggtgattcga cgcaacggtc tggccgcaca ataa         1434

<210> 398
<211> 477
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (17)...(474)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (25)...(39)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (388)...(391)
<223> N-glycosylation site. Prosite id = PS00001

<400> 398
Met Ser Asn Pro Ala Thr Pro Pro Ala Val Gly Val Leu Asp Glu Arg 
1               5                   10                  15      


Ala Pro Leu Thr Phe Pro Pro Gly Phe Leu Trp Gly Ala Ala Thr Ala 
            20                  25                  30          


Ala Tyr Gln Ile Glu Gly Ala Ala Ala Glu Gly Gly Arg Thr Pro Ser 
        35                  40                  45              


Ile Trp Asp Thr Phe Ser His Thr Glu Gly Lys Thr Val Ser Gly His 
    50                  55                  60                  


Thr Gly Asp Val Ala Cys Asp His Tyr His Arg Leu Ser Asp Asp Val 
65                  70                  75                  80  


Arg Leu Met Ala Glu Leu Gly Leu Lys Ser Tyr Arg Phe Ser Val Ser 
                85                  90                  95      


Trp Pro Arg Val Gln Pro Gly Gly Ser Gly Pro Val Asn Ala Glu Gly 
            100                 105                 110         


Leu Asp Phe Tyr Arg Arg Leu Val Asp Glu Leu Leu Thr Asn Gly Ile 
        115                 120                 125             


Glu Pro Trp Ile Thr Leu Tyr His Trp Asp Leu Pro Gln Glu Leu Glu 
    130                 135                 140                 


Asp Ala Gly Gly Trp Pro Ala Arg Asp Thr Ala Ala Arg Phe Ala Asp 
145                 150                 155                 160 


Tyr Ala Gln Leu Met Ala Asp Ala Leu Gly Asp Arg Val Lys Tyr Trp 
                165                 170                 175     


Thr Thr Leu Asn Glu Pro Trp Cys Ser Ala Phe Leu Gly Tyr Gly Ser 
            180                 185                 190         


Gly Val His Ala Pro Gly Arg Ser Asp Gly Ala Ala Ala Val Gln Ala 
        195                 200                 205             


Gly His His Leu Met Leu Gly His Gly Leu Ala Val Gln Ala Leu Arg 
    210                 215                 220                 


Ala Ala Arg Pro Glu Ala Gln Leu Gly Val Thr Val Asn Leu Tyr Pro 
225                 230                 235                 240 


Val Thr Pro Ala Ser Asp Thr Pro Gly Asp Val Asp Ala Ala Arg Arg 
                245                 250                 255     


Ile Asp Gly Leu Ala Asn Arg Phe Phe Leu Asp Pro Leu Leu Arg Gly 
            260                 265                 270         


Glu Tyr Pro Ala Asp Leu Val Ala Asp Leu Ala Lys Val Thr Asp Phe 
        275                 280                 285             


Gly His Val Arg Asp Gly Asp Leu Ala Val Ile Ala Thr Pro Leu Asp 
    290                 295                 300                 


Leu Val Gly Val Asn Tyr Tyr Ser Arg His Val Val Ala Ala Pro Ala 
305                 310                 315                 320 


Ala Gly Glu Glu Pro Glu Lys Tyr Trp Arg Ala Pro Ser Cys Trp Pro 
                325                 330                 335     


Gly Ser Glu Glu Val Arg Phe Val Thr Arg Gly Val Pro Val Thr Asp 
            340                 345                 350         


Met Gly Trp Glu Ile Asp Ala Pro Gly Leu Val Glu Thr Leu Arg Arg 
        355                 360                 365             


Val His Glu Glu Tyr Thr Asp Leu Pro Leu Tyr Val Thr Glu Asn Gly 
    370                 375                 380                 


Ser Ala Phe Val Asp Ala Val Val Asp Gly Arg Val Asp Asp Thr Asp 
385                 390                 395                 400 


Arg Leu Ala Tyr Phe Asp Ala His Leu Arg Ala Ser His Glu Ala Ile 
                405                 410                 415     


Ser Ala Gly Val Pro Leu Gln Gly Tyr Phe Ala Trp Ser Leu Leu Asp 
            420                 425                 430         


Asn Phe Glu Trp Ala Trp Gly Tyr Thr Lys Arg Phe Gly Met Val Tyr 
        435                 440                 445             


Val Asp Tyr Asp Ser Gln Lys Arg Ile Pro Lys Ser Ser Ala Arg Trp 
    450                 455                 460                 


Tyr Ala Glu Val Ile Arg Arg Asn Gly Leu Ala Ala Gln 
465                 470                 475         


<210> 399
<211> 1434
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 399
gtgagcaacc cagcaagccc gcccgccgtc ggcgtcctcg acgccggtcc ggccctcacc     60

ttcccgcccg gcttcctctg gggcgcggcc accgccgcgt accagatcga gggcgcggcg    120

acggagggcg gccgtgcccc gtcgatctgg gacaccttca gccacaccga gggccggacg    180

gtggccgggc acaccggcga cgtggcgtgc gaccactacc accggatgcc ggacgacgtg    240

cggttgatgg ccgacctggg cctgcagtcg taccgcttct cggtgagctg gccgcgggtg    300

cagccgggcg gcaccggcgg ggtcaaccag gagggcatgg acttctaccg ccgcctggtc    360

gacgaactgc tcggccacgg catcgagccg tggctgacgc tctaccactg ggacctgccg    420

cagccgctgg aggacgcggg cggctggccg gcccgggaca ccgccgcccg cttcgccgag    480

tactcccacc tggtcgccga ggcgctcggt gaccgggtga agtacttcac cacgctgaac    540

gagccgtggt gctcggcgtt cctcggttac ggctccggcg tgcacgctcc gggccgtaac    600

gacggcgcgg acgcggtccg ggccgggcac cacctgatgc tgggccacgg cctggccgtg    660

caggcggtgc gggcggcccg cccggaggcg cagctcggca tcaccgtcaa cctgtacccg    720

gtcaccccgg cgagcgaatc ggcggcggac gccgacgcgg cccgccggat cgacgcgctg    780

gccaaccggt tcttcctgga cccggtgctg cgcggggcgt acccggcgga cctcgtcgcg    840

gacctgcgtc aggtcaccga catggggcac gtgcgcgacg gtgacctggc gaccatctcc    900

accccgctgg acatggtcgg gatcaactac tacagccggc acgtggtggc cgcgcccgtc    960

gagggcgcgg agccggagcc ctactggcgg gcgccgtcct gctggccggg cagcgaggac   1020

gtgcggttcg tcacccgggg cgtcccggtg acggacatgg actgggagat cgacgccccg   1080

ggcctggtgg agacgctgga gcgggtgcac cgggagtaca ccgacctgcc gctctacgtc   1140

accgagaacg gctcggcgtt cgtcgacgag gtcgtcgacg gccgggtgga cgacccggac   1200

cggctggcct acttctccgc gcacctgcgc gcggcgcacg ccgcgatcga ggccggcgtg   1260

ccgctcaagg gctacttcgc ctggtcgctc ctggacaact tcgagtgggc ctggggctac   1320

acgaagcggt tcggcatggt ctatgtcgac tacgacagcc aggcccggat cgcgaagtcc   1380

agcgccaggt ggtacgccga cgtgatccga cgcaacggac tgcccgcaca ataa         1434

<210> 400
<211> 477
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (17)...(474)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (25)...(39)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (388)...(391)
<223> N-glycosylation site. Prosite id = PS00001

<400> 400
Met Ser Asn Pro Ala Ser Pro Pro Ala Val Gly Val Leu Asp Ala Gly 
1               5                   10                  15      


Pro Ala Leu Thr Phe Pro Pro Gly Phe Leu Trp Gly Ala Ala Thr Ala 
            20                  25                  30          


Ala Tyr Gln Ile Glu Gly Ala Ala Thr Glu Gly Gly Arg Ala Pro Ser 
        35                  40                  45              


Ile Trp Asp Thr Phe Ser His Thr Glu Gly Arg Thr Val Ala Gly His 
    50                  55                  60                  


Thr Gly Asp Val Ala Cys Asp His Tyr His Arg Met Pro Asp Asp Val 
65                  70                  75                  80  


Arg Leu Met Ala Asp Leu Gly Leu Gln Ser Tyr Arg Phe Ser Val Ser 
                85                  90                  95      


Trp Pro Arg Val Gln Pro Gly Gly Thr Gly Gly Val Asn Gln Glu Gly 
            100                 105                 110         


Met Asp Phe Tyr Arg Arg Leu Val Asp Glu Leu Leu Gly His Gly Ile 
        115                 120                 125             


Glu Pro Trp Leu Thr Leu Tyr His Trp Asp Leu Pro Gln Pro Leu Glu 
    130                 135                 140                 


Asp Ala Gly Gly Trp Pro Ala Arg Asp Thr Ala Ala Arg Phe Ala Glu 
145                 150                 155                 160 


Tyr Ser His Leu Val Ala Glu Ala Leu Gly Asp Arg Val Lys Tyr Phe 
                165                 170                 175     


Thr Thr Leu Asn Glu Pro Trp Cys Ser Ala Phe Leu Gly Tyr Gly Ser 
            180                 185                 190         


Gly Val His Ala Pro Gly Arg Asn Asp Gly Ala Asp Ala Val Arg Ala 
        195                 200                 205             


Gly His His Leu Met Leu Gly His Gly Leu Ala Val Gln Ala Val Arg 
    210                 215                 220                 


Ala Ala Arg Pro Glu Ala Gln Leu Gly Ile Thr Val Asn Leu Tyr Pro 
225                 230                 235                 240 


Val Thr Pro Ala Ser Glu Ser Ala Ala Asp Ala Asp Ala Ala Arg Arg 
                245                 250                 255     


Ile Asp Ala Leu Ala Asn Arg Phe Phe Leu Asp Pro Val Leu Arg Gly 
            260                 265                 270         


Ala Tyr Pro Ala Asp Leu Val Ala Asp Leu Arg Gln Val Thr Asp Met 
        275                 280                 285             


Gly His Val Arg Asp Gly Asp Leu Ala Thr Ile Ser Thr Pro Leu Asp 
    290                 295                 300                 


Met Val Gly Ile Asn Tyr Tyr Ser Arg His Val Val Ala Ala Pro Val 
305                 310                 315                 320 


Glu Gly Ala Glu Pro Glu Pro Tyr Trp Arg Ala Pro Ser Cys Trp Pro 
                325                 330                 335     


Gly Ser Glu Asp Val Arg Phe Val Thr Arg Gly Val Pro Val Thr Asp 
            340                 345                 350         


Met Asp Trp Glu Ile Asp Ala Pro Gly Leu Val Glu Thr Leu Glu Arg 
        355                 360                 365             


Val His Arg Glu Tyr Thr Asp Leu Pro Leu Tyr Val Thr Glu Asn Gly 
    370                 375                 380                 


Ser Ala Phe Val Asp Glu Val Val Asp Gly Arg Val Asp Asp Pro Asp 
385                 390                 395                 400 


Arg Leu Ala Tyr Phe Ser Ala His Leu Arg Ala Ala His Ala Ala Ile 
                405                 410                 415     


Glu Ala Gly Val Pro Leu Lys Gly Tyr Phe Ala Trp Ser Leu Leu Asp 
            420                 425                 430         


Asn Phe Glu Trp Ala Trp Gly Tyr Thr Lys Arg Phe Gly Met Val Tyr 
        435                 440                 445             


Val Asp Tyr Asp Ser Gln Ala Arg Ile Ala Lys Ser Ser Ala Arg Trp 
    450                 455                 460                 


Tyr Ala Asp Val Ile Arg Arg Asn Gly Leu Pro Ala Gln 
465                 470                 475         


<210> 401
<211> 2253
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 401
atgaatgcca cactaagaat aagtttaata ttattgataa tggtttcagg ctatgcttgt     60

aataacagtt ctgagaaaag aaaaaccgat cctcgcattg aggcaatcct gaaacaaatg    120

accctggaag aaaagatcgg ccagatgagc caggtgcaca gtccgggtgg cgatctttcc    180

gatggtttca gggcagccat ttccgcaggc gaaatcggtt cggtgctgaa tgaggcaagg    240

cctgaaatgg tcagggagat gcagcgtgtt gccgttgaag aaagcaggct gggcattcca    300

ctgcttttcg gccgcgacgt gatccatggg tttaaaacaa tatttccgat caatatcggg    360

cttgcggcaa ctttcaatcc tgagcttgtt cagcaaggcg ctgaaatagc agcacgggaa    420

gccaccagtg ttggcatcaa ctggaacttt gcccccatga ttgacctttc gcgcgatccg    480

cgctggggcc gcatggccga aagttatggt gaagacccgc tgatgaacac aataatgggg    540

cttgccatgc tgcgcggttt ccagggcgat gatctgagcc tgccacatac aatggcggcc    600

tgtgccaaac attttgcagg ttatggcgct gccgaaggcg gcagggatta caacaccgcc    660

agcatccccg aagtggaact atggaacacc catttccctc cgttcaaagc cctggcagat    720

gccggcgtgg ccacctttat gaccggtttc aatgagctga acggcattcc tgccacaggc    780

aatgtttttc ttttcagaga tgttctgaaa gaacgctggc agtttgaagg atttgtggtc    840

tcggactggg cttccattat cgaaatgact gagcacggat tcactgccgg cgataaagaa    900

gctgcggaaa aagccgtcct ggccggcgta gatatggaga tggccagcac atcctataag    960

gatcatctaa aggaattggt tacatccgga agagttccgg aaaaactgct tgatgatgcg   1020

gttcgaagaa tccttaaagt gaagtttgac ctgggtttgt ttgataatcc ctacgtggat   1080

ttggacaagg tgatcacaac acctcccgct gtacatttgg aagcagccag gaaaacagcg   1140

gttcagagct ttgttttgtt gaaaaatgaa aaccggacgc tgccactgaa ccacaacatt   1200

ggcagggtag ccgtaattgg ccccatggct catgaccgtt acgagcaact gggaacctgg   1260

gtttttgacg gcgataccaa cctgagcatc acacccctga tggcttttga agaatttctg   1320

ggtaaagaaa gagtacgttt tgcaaaagga gtccaaacaa cgcggagttt aggaaaagca   1380

ggttttactg aagccatagc tgcagccagg cagtcggaag ccgtggtgat ctttgcagga   1440

gaagaagcaa tacttaccgg cgaagcccac agccgggctt atcttgattt gcctggcgcc   1500

caaaacgatc tgatcagaga aattgccaaa accggcaaac cggtggttct ggtggtgatg   1560

actccccgcc cgttaacaat tggtgaaatt tcggaacacg ttgatgctgt gctctatgcc   1620

tggcatccgg gaacaatggc agggcctgct ttggtggatg taattttcgg catggaatca   1680

ccctccggga aactgcctgt aaccttccca aaagctgccg gacagattcc ggtttattat   1740

gctcacaaaa acaccggcag gccttttaat cccgatgatt ttatccccat ggaagatatt   1800

ccggtcagaa cctttcagac ttcgctgggc aacaccagtc attatttgga catcgggttc   1860

gacccattat atccctttgg gtttggtttg tcctacaccg aattcgaata tgatgagttc   1920

aggctttcag ccgaaagcat aggattgcag gaagcattga gtgtttcggt caggctgaca   1980

aatacaggcg aatttgaagc ggaagaagta gttcagttgt acatacgcga tctggttgct   2040

tctattacac ggccggtaaa agaattaaag gatttcacca gggtcaggct caagccaggt   2100

gaaacaaaaa cagtcagttt cgctttgcat ccgaaccaac tgggctttta cgattctcat   2160

ggaaactata ttgttgagcc tggtgaattc caggtttgga ctggaggtag ttcagaagct   2220

gagctttacg attttttcac tttgactgac taa                                2253

<210> 402
<211> 750
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(19)

<220> 
<221> DOMAIN
<222> (91)...(315)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (384)...(634)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (2)...(5)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (21)...(24)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (397)...(400)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (433)...(436)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (620)...(623)
<223> N-glycosylation site. Prosite id = PS00001

<400> 402
Met Asn Ala Thr Leu Arg Ile Ser Leu Ile Leu Leu Ile Met Val Ser 
1               5                   10                  15      


Gly Tyr Ala Cys Asn Asn Ser Ser Glu Lys Arg Lys Thr Asp Pro Arg 
            20                  25                  30          


Ile Glu Ala Ile Leu Lys Gln Met Thr Leu Glu Glu Lys Ile Gly Gln 
        35                  40                  45              


Met Ser Gln Val His Ser Pro Gly Gly Asp Leu Ser Asp Gly Phe Arg 
    50                  55                  60                  


Ala Ala Ile Ser Ala Gly Glu Ile Gly Ser Val Leu Asn Glu Ala Arg 
65                  70                  75                  80  


Pro Glu Met Val Arg Glu Met Gln Arg Val Ala Val Glu Glu Ser Arg 
                85                  90                  95      


Leu Gly Ile Pro Leu Leu Phe Gly Arg Asp Val Ile His Gly Phe Lys 
            100                 105                 110         


Thr Ile Phe Pro Ile Asn Ile Gly Leu Ala Ala Thr Phe Asn Pro Glu 
        115                 120                 125             


Leu Val Gln Gln Gly Ala Glu Ile Ala Ala Arg Glu Ala Thr Ser Val 
    130                 135                 140                 


Gly Ile Asn Trp Asn Phe Ala Pro Met Ile Asp Leu Ser Arg Asp Pro 
145                 150                 155                 160 


Arg Trp Gly Arg Met Ala Glu Ser Tyr Gly Glu Asp Pro Leu Met Asn 
                165                 170                 175     


Thr Ile Met Gly Leu Ala Met Leu Arg Gly Phe Gln Gly Asp Asp Leu 
            180                 185                 190         


Ser Leu Pro His Thr Met Ala Ala Cys Ala Lys His Phe Ala Gly Tyr 
        195                 200                 205             


Gly Ala Ala Glu Gly Gly Arg Asp Tyr Asn Thr Ala Ser Ile Pro Glu 
    210                 215                 220                 


Val Glu Leu Trp Asn Thr His Phe Pro Pro Phe Lys Ala Leu Ala Asp 
225                 230                 235                 240 


Ala Gly Val Ala Thr Phe Met Thr Gly Phe Asn Glu Leu Asn Gly Ile 
                245                 250                 255     


Pro Ala Thr Gly Asn Val Phe Leu Phe Arg Asp Val Leu Lys Glu Arg 
            260                 265                 270         


Trp Gln Phe Glu Gly Phe Val Val Ser Asp Trp Ala Ser Ile Ile Glu 
        275                 280                 285             


Met Thr Glu His Gly Phe Thr Ala Gly Asp Lys Glu Ala Ala Glu Lys 
    290                 295                 300                 


Ala Val Leu Ala Gly Val Asp Met Glu Met Ala Ser Thr Ser Tyr Lys 
305                 310                 315                 320 


Asp His Leu Lys Glu Leu Val Thr Ser Gly Arg Val Pro Glu Lys Leu 
                325                 330                 335     


Leu Asp Asp Ala Val Arg Arg Ile Leu Lys Val Lys Phe Asp Leu Gly 
            340                 345                 350         


Leu Phe Asp Asn Pro Tyr Val Asp Leu Asp Lys Val Ile Thr Thr Pro 
        355                 360                 365             


Pro Ala Val His Leu Glu Ala Ala Arg Lys Thr Ala Val Gln Ser Phe 
    370                 375                 380                 


Val Leu Leu Lys Asn Glu Asn Arg Thr Leu Pro Leu Asn His Asn Ile 
385                 390                 395                 400 


Gly Arg Val Ala Val Ile Gly Pro Met Ala His Asp Arg Tyr Glu Gln 
                405                 410                 415     


Leu Gly Thr Trp Val Phe Asp Gly Asp Thr Asn Leu Ser Ile Thr Pro 
            420                 425                 430         


Leu Met Ala Phe Glu Glu Phe Leu Gly Lys Glu Arg Val Arg Phe Ala 
        435                 440                 445             


Lys Gly Val Gln Thr Thr Arg Ser Leu Gly Lys Ala Gly Phe Thr Glu 
    450                 455                 460                 


Ala Ile Ala Ala Ala Arg Gln Ser Glu Ala Val Val Ile Phe Ala Gly 
465                 470                 475                 480 


Glu Glu Ala Ile Leu Thr Gly Glu Ala His Ser Arg Ala Tyr Leu Asp 
                485                 490                 495     


Leu Pro Gly Ala Gln Asn Asp Leu Ile Arg Glu Ile Ala Lys Thr Gly 
            500                 505                 510         


Lys Pro Val Val Leu Val Val Met Thr Pro Arg Pro Leu Thr Ile Gly 
        515                 520                 525             


Glu Ile Ser Glu His Val Asp Ala Val Leu Tyr Ala Trp His Pro Gly 
    530                 535                 540                 


Thr Met Ala Gly Pro Ala Leu Val Asp Val Ile Phe Gly Met Glu Ser 
545                 550                 555                 560 


Pro Ser Gly Lys Leu Pro Val Thr Phe Pro Lys Ala Ala Gly Gln Ile 
                565                 570                 575     


Pro Val Tyr Tyr Ala His Lys Asn Thr Gly Arg Pro Phe Asn Pro Asp 
            580                 585                 590         


Asp Phe Ile Pro Met Glu Asp Ile Pro Val Arg Thr Phe Gln Thr Ser 
        595                 600                 605             


Leu Gly Asn Thr Ser His Tyr Leu Asp Ile Gly Phe Asp Pro Leu Tyr 
    610                 615                 620                 


Pro Phe Gly Phe Gly Leu Ser Tyr Thr Glu Phe Glu Tyr Asp Glu Phe 
625                 630                 635                 640 


Arg Leu Ser Ala Glu Ser Ile Gly Leu Gln Glu Ala Leu Ser Val Ser 
                645                 650                 655     


Val Arg Leu Thr Asn Thr Gly Glu Phe Glu Ala Glu Glu Val Val Gln 
            660                 665                 670         


Leu Tyr Ile Arg Asp Leu Val Ala Ser Ile Thr Arg Pro Val Lys Glu 
        675                 680                 685             


Leu Lys Asp Phe Thr Arg Val Arg Leu Lys Pro Gly Glu Thr Lys Thr 
    690                 695                 700                 


Val Ser Phe Ala Leu His Pro Asn Gln Leu Gly Phe Tyr Asp Ser His 
705                 710                 715                 720 


Gly Asn Tyr Ile Val Glu Pro Gly Glu Phe Gln Val Trp Thr Gly Gly 
                725                 730                 735     


Ser Ser Glu Ala Glu Leu Tyr Asp Phe Phe Thr Leu Thr Asp 
            740                 745                 750 


<210> 403
<211> 2268
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 403
atgacagacg aaaaattgca ggcactcctg aaagacatga gtctgaaaga aaagatcttc     60

cagctggtgc agatccccgg cagattcttt ctggaaaaca cggaagacac cggtactgcg    120

gtggaggaga atctcaccgc ggaagagctg cacctgacgg gctgcacgct cagtgtttac    180

ggcgcggagg agctgcgaaa aattcagcag gattgcgtga agaaccatcc gcatcatata    240

ccgatccttt ttatggcgga cattattcac ggataccgca cgatttatcc tatgccgctt    300

gcccagggcg ctatgtttga tgattcgctg acagaggaac tctgtgccat ggcggcggag    360

gaaatgcggc tcggaggtgt acacgttacc tttgcgccga tgcttgatct tgtgcgcgat    420

gcgcgctggg ggaggatcat ggaatctacc ggagaggacc ccatgctgaa ctgccgcctg    480

ggaaaggcga ccgtgcgggg cttccgggca ggagagaata aaatccccgg ggaaaaagga    540

gtagcttcct gtattaagca ttttgcggca tacggcgctg cggaggcagg cagggattac    600

accaacaccg aggtttcgga acatactctg agagaatatt atctcaagag ctataaggca    660

gcaatcgatg ccggcgcaga tatggtgatg acgtccttca acacgctcaa cggcatgccc    720

gccacgggga atcgctggct gatgaagcag gtgctgagag acgagtgggg atttgacggc    780

gtgctgatct ccgactgggg cgccatcggg gaaatggtac agcacggcgt atgcagggat    840

ctgcgtgaag cggcaaagct ggccattgaa aatggcgttg atgttgatat gtgttcgctt    900

gcttacgcac ggtatctgga ggagcttgtc gtttccggtg aagtggacga agcgctgata    960

gataaatcat gtctgcgggt gctgcggctg aaaaacaagc tgggcttgtt tgaagatccc   1020

ttccgcggag cggatgcagt tgcagagaag gccaatgtac tcagtgcgga gcaccgtgca   1080

cttgcccgaa aggctgtgac ggaaagtctt gtcctgctga agaatgacgg tgacgaagaa   1140

aagctcctgc ccctgaaaag aggggaaagg ctcgcctttg ttggcccgta tgcacagagc   1200

agggaactgc acagcatgtg ggcgattgca ggcagagagg atgactgcgt atctgtgcgc   1260

atggcggcgg aggagatcgc cgcagagcag ggattttgtc cggaattcgc ttccggtgcg   1320

gtgatgaccc gtcgggaaga cctcgcaacc gatagccggg agattgccgt tgaacaggtg   1380

cgccacggtg cgtatcagcc agctcttgcc gccaatgcgg agtcagaaaa actgcttgcc   1440

gaagcggttg atgtggcgaa gcacgccgac aaggtaatcg tatgtatcgg agaacaccgc   1500

cgtatgagcg gggaaggtgc aagccgcgcc gatatcagca ttcccgcacc acagctggaa   1560

ctgctggaaa aagtgtatgc cgcgaatcag aatatcgtaa ccgtggtatt cggcgggagg   1620

ccgcttgacc tgagaaaggt ctgtatgtgc agtaaagcgg tcctgtttgc ctggatgccc   1680

ggtacggaag gcggacacgg cattatggat gtactgacag gaaagtgttg cccctccggt   1740

gcactgagtg tgacgatgcc gtactgtgtg ggacaggcgc cgatctgcta taaccactac   1800

agcacaggcc gtgcaaaacc gtatgacacg gactatccga tctttatctc cgcttatatc   1860

gatgtaccga cgggtccgct gttccccttc gggtacggcc tcagctatac ggagtttgcg   1920

gtctcccctg tacagcttac gaaggctgcc gccgaaagct ccggcggcgt gctgacggaa   1980

gcgcaggtaa acgtaacaaa ctgcggtgaa cgggagggca cagcaatcgt gcagctttat   2040

atccgtgatg aagtgtcttc actgatccgc ccggtaaggg aactgaaagg gtatcagcgg   2100

attccgcttg catcggggga aatgaagcag gtcacgtttg aaattacgga agaaatgctg   2160

gcatttgtaa atgcagataa tgtatttgcg gcagagccgg gtaactttac ggtatatatc   2220

gggttggatt ccgatacaga caatgcagca acgtttaccc tgatataa                2268

<210> 404
<211> 755
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (76)...(298)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (370)...(638)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (44)...(47)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (674)...(677)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (746)...(749)
<223> N-glycosylation site. Prosite id = PS00001

<400> 404
Met Thr Asp Glu Lys Leu Gln Ala Leu Leu Lys Asp Met Ser Leu Lys 
1               5                   10                  15      


Glu Lys Ile Phe Gln Leu Val Gln Ile Pro Gly Arg Phe Phe Leu Glu 
            20                  25                  30          


Asn Thr Glu Asp Thr Gly Thr Ala Val Glu Glu Asn Leu Thr Ala Glu 
        35                  40                  45              


Glu Leu His Leu Thr Gly Cys Thr Leu Ser Val Tyr Gly Ala Glu Glu 
    50                  55                  60                  


Leu Arg Lys Ile Gln Gln Asp Cys Val Lys Asn His Pro His His Ile 
65                  70                  75                  80  


Pro Ile Leu Phe Met Ala Asp Ile Ile His Gly Tyr Arg Thr Ile Tyr 
                85                  90                  95      


Pro Met Pro Leu Ala Gln Gly Ala Met Phe Asp Asp Ser Leu Thr Glu 
            100                 105                 110         


Glu Leu Cys Ala Met Ala Ala Glu Glu Met Arg Leu Gly Gly Val His 
        115                 120                 125             


Val Thr Phe Ala Pro Met Leu Asp Leu Val Arg Asp Ala Arg Trp Gly 
    130                 135                 140                 


Arg Ile Met Glu Ser Thr Gly Glu Asp Pro Met Leu Asn Cys Arg Leu 
145                 150                 155                 160 


Gly Lys Ala Thr Val Arg Gly Phe Arg Ala Gly Glu Asn Lys Ile Pro 
                165                 170                 175     


Gly Glu Lys Gly Val Ala Ser Cys Ile Lys His Phe Ala Ala Tyr Gly 
            180                 185                 190         


Ala Ala Glu Ala Gly Arg Asp Tyr Thr Asn Thr Glu Val Ser Glu His 
        195                 200                 205             


Thr Leu Arg Glu Tyr Tyr Leu Lys Ser Tyr Lys Ala Ala Ile Asp Ala 
    210                 215                 220                 


Gly Ala Asp Met Val Met Thr Ser Phe Asn Thr Leu Asn Gly Met Pro 
225                 230                 235                 240 


Ala Thr Gly Asn Arg Trp Leu Met Lys Gln Val Leu Arg Asp Glu Trp 
                245                 250                 255     


Gly Phe Asp Gly Val Leu Ile Ser Asp Trp Gly Ala Ile Gly Glu Met 
            260                 265                 270         


Val Gln His Gly Val Cys Arg Asp Leu Arg Glu Ala Ala Lys Leu Ala 
        275                 280                 285             


Ile Glu Asn Gly Val Asp Val Asp Met Cys Ser Leu Ala Tyr Ala Arg 
    290                 295                 300                 


Tyr Leu Glu Glu Leu Val Val Ser Gly Glu Val Asp Glu Ala Leu Ile 
305                 310                 315                 320 


Asp Lys Ser Cys Leu Arg Val Leu Arg Leu Lys Asn Lys Leu Gly Leu 
                325                 330                 335     


Phe Glu Asp Pro Phe Arg Gly Ala Asp Ala Val Ala Glu Lys Ala Asn 
            340                 345                 350         


Val Leu Ser Ala Glu His Arg Ala Leu Ala Arg Lys Ala Val Thr Glu 
        355                 360                 365             


Ser Leu Val Leu Leu Lys Asn Asp Gly Asp Glu Glu Lys Leu Leu Pro 
    370                 375                 380                 


Leu Lys Arg Gly Glu Arg Leu Ala Phe Val Gly Pro Tyr Ala Gln Ser 
385                 390                 395                 400 


Arg Glu Leu His Ser Met Trp Ala Ile Ala Gly Arg Glu Asp Asp Cys 
                405                 410                 415     


Val Ser Val Arg Met Ala Ala Glu Glu Ile Ala Ala Glu Gln Gly Phe 
            420                 425                 430         


Cys Pro Glu Phe Ala Ser Gly Ala Val Met Thr Arg Arg Glu Asp Leu 
        435                 440                 445             


Ala Thr Asp Ser Arg Glu Ile Ala Val Glu Gln Val Arg His Gly Ala 
    450                 455                 460                 


Tyr Gln Pro Ala Leu Ala Ala Asn Ala Glu Ser Glu Lys Leu Leu Ala 
465                 470                 475                 480 


Glu Ala Val Asp Val Ala Lys His Ala Asp Lys Val Ile Val Cys Ile 
                485                 490                 495     


Gly Glu His Arg Arg Met Ser Gly Glu Gly Ala Ser Arg Ala Asp Ile 
            500                 505                 510         


Ser Ile Pro Ala Pro Gln Leu Glu Leu Leu Glu Lys Val Tyr Ala Ala 
        515                 520                 525             


Asn Gln Asn Ile Val Thr Val Val Phe Gly Gly Arg Pro Leu Asp Leu 
    530                 535                 540                 


Arg Lys Val Cys Met Cys Ser Lys Ala Val Leu Phe Ala Trp Met Pro 
545                 550                 555                 560 


Gly Thr Glu Gly Gly His Gly Ile Met Asp Val Leu Thr Gly Lys Cys 
                565                 570                 575     


Cys Pro Ser Gly Ala Leu Ser Val Thr Met Pro Tyr Cys Val Gly Gln 
            580                 585                 590         


Ala Pro Ile Cys Tyr Asn His Tyr Ser Thr Gly Arg Ala Lys Pro Tyr 
        595                 600                 605             


Asp Thr Asp Tyr Pro Ile Phe Ile Ser Ala Tyr Ile Asp Val Pro Thr 
    610                 615                 620                 


Gly Pro Leu Phe Pro Phe Gly Tyr Gly Leu Ser Tyr Thr Glu Phe Ala 
625                 630                 635                 640 


Val Ser Pro Val Gln Leu Thr Lys Ala Ala Ala Glu Ser Ser Gly Gly 
                645                 650                 655     


Val Leu Thr Glu Ala Gln Val Asn Val Thr Asn Cys Gly Glu Arg Glu 
            660                 665                 670         


Gly Thr Ala Ile Val Gln Leu Tyr Ile Arg Asp Glu Val Ser Ser Leu 
        675                 680                 685             


Ile Arg Pro Val Arg Glu Leu Lys Gly Tyr Gln Arg Ile Pro Leu Ala 
    690                 695                 700                 


Ser Gly Glu Met Lys Gln Val Thr Phe Glu Ile Thr Glu Glu Met Leu 
705                 710                 715                 720 


Ala Phe Val Asn Ala Asp Asn Val Phe Ala Ala Glu Pro Gly Asn Phe 
                725                 730                 735     


Thr Val Tyr Ile Gly Leu Asp Ser Asp Thr Asp Asn Ala Ala Thr Phe 
            740                 745                 750         


Thr Leu Ile 
        755 


<210> 405
<211> 2127
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 405
atgagcgagg aaacgggcaa gacgaaccgg acccctcggt acacggggaa ggcaagctcc     60

ctcccctcct ggaaatccat cctgcgggcg gaaggagcgg aaagggagga actcatccgg    120

gagctggtcc aaggcatgac cctgcggcaa aaggtcaacc agatggccgg tcgcatgacc    180

ctgttggaga tggtcatctc catcttccgc tacaactaca agcccttccg agcgggagag    240

gaccgtgggc tggggatccc tcccctgcgc ttcaccgacg gcccgcgagg agtagcgctg    300

gggaattcca cctgctttcc ggtatccatg gcccgggcgg cgacctggga cgtcgagttg    360

gaggaacgga tagccgaggc catgggcgtg gaagcccgtt cacagggggc cgacttcttc    420

ggcggggtgt gcataaacct cctccgccat ccgggatggg gaagggcgca ggagaccttc    480

ggggaggacc cttacctcct gggagagatg ggggcggcca tggtcagggg tgcccagaaa    540

cacctcatgg cctgcatcaa gcacttcgcc tgcaacagca tcgaggaatc ccgcttctac    600

gtggacgtgc gcgtcgacga acgcaccctg cgggaggtgt acctccccca tttccgacgc    660

tgcgtggagg aaggggcggc cgccgtgatg agcgcctaca atcgggtcaa cggcgaatac    720

tgcgcccaca acgctcacct gctgcgggac atcctcaaag acgagtgggg tttccgggga    780

ctggtgatgt ccgatttcgt cttcgggacg cgggacacgg tcaaggcggc ctggggcggg    840

ctggacgtcg aaatgcccca agcctggttt ttcgggaaga aactgatccg tgcggtgcgc    900

cggggagtgg tccccgagtc cttgatcgac gaggcggtca cccgcattct acgtcagaag    960

gccaggttcg ccgccctgga ccagggggat tacggacctc atcgggtggc ctgttccgaa   1020

cacgccgagc tggccctgga ggcggcccgg aagagtatcg tcctgctgaa aaacgcggac   1080

ggcatcctac ccttggatcc tggaaagata aggaaattag cggtcatcgg gaaactggcc   1140

gacctgccca acatcgggga tcgaggctcc agccgcgtgc gtccccccta cgtggtgacc   1200

atcctgcagg gactgcgcaa ccgcgccggc tcttccctgg agattatcta ccgggacggg   1260

tcggaccttg aagaggcccg cgaggcggcg cgcaccgccg acgcctgcgt ggcggtggtg   1320

ggcttaacct cccgggacga gggggaggcc atacccggac cggttaagct cggcggagac   1380

cgcgaggacc tctccctgcg cccccgggac atcgcccttg tggaagaagt tgccacagtg   1440

aacccgcgct gcatcgtggt cctggaggga ggctcggccg tcctcaccgc gggatggcgg   1500

gacaaggcgg cggctgtcct catggcctgg tacccgggca tggagggggg caacgcagtg   1560

gcggacatca ttttcggcct ggcgaacccc agcggtaaac tgccggtgac cttcccggaa   1620

agcaacgacc agctaccctt cttcgacaag aaggcgggca gcatcgagta cggttactac   1680

cacggctatc gtcttttcga caaggaaggg atgcgccccg cttttccctt cggcttcgga   1740

ctcagttaca cctcttaccg gtatcggaac ctgtgcctga gcgcggagga gatgaccccg   1800

gagggcgcca tcctggtgga ggcggagatc gtcaacgccg gcagcatggc cggcgacgag   1860

gtggtccagc tttatgtggg ctatccctcc tcgaccgtgg accgtccggt gaaggaatta   1920

aaaggattcg cccgcgtgca cctggaaccc ggagagagca agcgggtctc cttccctctg   1980

cgggctgcag acctggccta ctacgacgtg gaaagggggg cctgggtggt ggaggaaacg   2040

gaatacgagg tgttggtggg ttcctcctcc gaccccgggg acctgcacct gaggggctct   2100

ttccgcgtga caggctccac ccgatga                                       2127

<210> 406
<211> 708
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (78)...(286)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (353)...(585)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (103)...(106)
<223> N-glycosylation site. Prosite id = PS00001

<400> 406
Met Ser Glu Glu Thr Gly Lys Thr Asn Arg Thr Pro Arg Tyr Thr Gly 
1               5                   10                  15      


Lys Ala Ser Ser Leu Pro Ser Trp Lys Ser Ile Leu Arg Ala Glu Gly 
            20                  25                  30          


Ala Glu Arg Glu Glu Leu Ile Arg Glu Leu Val Gln Gly Met Thr Leu 
        35                  40                  45              


Arg Gln Lys Val Asn Gln Met Ala Gly Arg Met Thr Leu Leu Glu Met 
    50                  55                  60                  


Val Ile Ser Ile Phe Arg Tyr Asn Tyr Lys Pro Phe Arg Ala Gly Glu 
65                  70                  75                  80  


Asp Arg Gly Leu Gly Ile Pro Pro Leu Arg Phe Thr Asp Gly Pro Arg 
                85                  90                  95      


Gly Val Ala Leu Gly Asn Ser Thr Cys Phe Pro Val Ser Met Ala Arg 
            100                 105                 110         


Ala Ala Thr Trp Asp Val Glu Leu Glu Glu Arg Ile Ala Glu Ala Met 
        115                 120                 125             


Gly Val Glu Ala Arg Ser Gln Gly Ala Asp Phe Phe Gly Gly Val Cys 
    130                 135                 140                 


Ile Asn Leu Leu Arg His Pro Gly Trp Gly Arg Ala Gln Glu Thr Phe 
145                 150                 155                 160 


Gly Glu Asp Pro Tyr Leu Leu Gly Glu Met Gly Ala Ala Met Val Arg 
                165                 170                 175     


Gly Ala Gln Lys His Leu Met Ala Cys Ile Lys His Phe Ala Cys Asn 
            180                 185                 190         


Ser Ile Glu Glu Ser Arg Phe Tyr Val Asp Val Arg Val Asp Glu Arg 
        195                 200                 205             


Thr Leu Arg Glu Val Tyr Leu Pro His Phe Arg Arg Cys Val Glu Glu 
    210                 215                 220                 


Gly Ala Ala Ala Val Met Ser Ala Tyr Asn Arg Val Asn Gly Glu Tyr 
225                 230                 235                 240 


Cys Ala His Asn Ala His Leu Leu Arg Asp Ile Leu Lys Asp Glu Trp 
                245                 250                 255     


Gly Phe Arg Gly Leu Val Met Ser Asp Phe Val Phe Gly Thr Arg Asp 
            260                 265                 270         


Thr Val Lys Ala Ala Trp Gly Gly Leu Asp Val Glu Met Pro Gln Ala 
        275                 280                 285             


Trp Phe Phe Gly Lys Lys Leu Ile Arg Ala Val Arg Arg Gly Val Val 
    290                 295                 300                 


Pro Glu Ser Leu Ile Asp Glu Ala Val Thr Arg Ile Leu Arg Gln Lys 
305                 310                 315                 320 


Ala Arg Phe Ala Ala Leu Asp Gln Gly Asp Tyr Gly Pro His Arg Val 
                325                 330                 335     


Ala Cys Ser Glu His Ala Glu Leu Ala Leu Glu Ala Ala Arg Lys Ser 
            340                 345                 350         


Ile Val Leu Leu Lys Asn Ala Asp Gly Ile Leu Pro Leu Asp Pro Gly 
        355                 360                 365             


Lys Ile Arg Lys Leu Ala Val Ile Gly Lys Leu Ala Asp Leu Pro Asn 
    370                 375                 380                 


Ile Gly Asp Arg Gly Ser Ser Arg Val Arg Pro Pro Tyr Val Val Thr 
385                 390                 395                 400 


Ile Leu Gln Gly Leu Arg Asn Arg Ala Gly Ser Ser Leu Glu Ile Ile 
                405                 410                 415     


Tyr Arg Asp Gly Ser Asp Leu Glu Glu Ala Arg Glu Ala Ala Arg Thr 
            420                 425                 430         


Ala Asp Ala Cys Val Ala Val Val Gly Leu Thr Ser Arg Asp Glu Gly 
        435                 440                 445             


Glu Ala Ile Pro Gly Pro Val Lys Leu Gly Gly Asp Arg Glu Asp Leu 
    450                 455                 460                 


Ser Leu Arg Pro Arg Asp Ile Ala Leu Val Glu Glu Val Ala Thr Val 
465                 470                 475                 480 


Asn Pro Arg Cys Ile Val Val Leu Glu Gly Gly Ser Ala Val Leu Thr 
                485                 490                 495     


Ala Gly Trp Arg Asp Lys Ala Ala Ala Val Leu Met Ala Trp Tyr Pro 
            500                 505                 510         


Gly Met Glu Gly Gly Asn Ala Val Ala Asp Ile Ile Phe Gly Leu Ala 
        515                 520                 525             


Asn Pro Ser Gly Lys Leu Pro Val Thr Phe Pro Glu Ser Asn Asp Gln 
    530                 535                 540                 


Leu Pro Phe Phe Asp Lys Lys Ala Gly Ser Ile Glu Tyr Gly Tyr Tyr 
545                 550                 555                 560 


His Gly Tyr Arg Leu Phe Asp Lys Glu Gly Met Arg Pro Ala Phe Pro 
                565                 570                 575     


Phe Gly Phe Gly Leu Ser Tyr Thr Ser Tyr Arg Tyr Arg Asn Leu Cys 
            580                 585                 590         


Leu Ser Ala Glu Glu Met Thr Pro Glu Gly Ala Ile Leu Val Glu Ala 
        595                 600                 605             


Glu Ile Val Asn Ala Gly Ser Met Ala Gly Asp Glu Val Val Gln Leu 
    610                 615                 620                 


Tyr Val Gly Tyr Pro Ser Ser Thr Val Asp Arg Pro Val Lys Glu Leu 
625                 630                 635                 640 


Lys Gly Phe Ala Arg Val His Leu Glu Pro Gly Glu Ser Lys Arg Val 
                645                 650                 655     


Ser Phe Pro Leu Arg Ala Ala Asp Leu Ala Tyr Tyr Asp Val Glu Arg 
            660                 665                 670         


Gly Ala Trp Val Val Glu Glu Thr Glu Tyr Glu Val Leu Val Gly Ser 
        675                 680                 685             


Ser Ser Asp Pro Gly Asp Leu His Leu Arg Gly Ser Phe Arg Val Thr 
    690                 695                 700                 


Gly Ser Thr Arg 
705             


<210> 407
<211> 1308
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 407
atggcatttc cgaaagattt tctatggggc gtcgcagcct ccagctatca gattgaaggc     60

ggcgcgctga cggaagggcg gggcgagtgc atctggtaca acttttcgca cacgcccggt    120

aaaacgcagg atggcgacac cggcgacgtg gcctgtgacc atctgcacct gtataaggaa    180

gatgtgaagc tgatggccga cctgggggtg caggcctacc gattttcggt ggcgtggccg    240

cgcgtgatcc ccgaagggac gggccggatc aacgaacaag gactggactt ttatgacagg    300

ctggtggacg aactgctcaa gtacggcatc cagccctggc tgacgctcaa tcactgggac    360

tatccgcagg cactccagaa taagggcggt tgggcgaacc cggacagcgt ggaatggttc    420

acggaataca ccaacgtgat gacccgcgcg ctgggggatc gtgttcgcgg ctggattacg    480

cataatgaac cctggtgcgt ctcgatcctt tcgaatttgc tgggcatgca tgcccccggt    540

ctgaatgacg cgccaacggc ttaccgcgtg gcgcaccacc tgaacctggc gcacggctcg    600

gcgatggccg tgatccgcca gaactgtcct ggcgtcccgg cggggattac actgaacctg    660

tcacctgccg tgccttcgac cgatagtgag gaagatcaac aggcagcaca gttctttgac    720

gcgaccttca accgctggtt cctcgatccg gtactgaaag gtagctatcc ggcggacggg    780

atcgcgatgc tgagcgccgc gctggaaggc atcgatctgg acgcggttca ggcggccaac    840

gtaccaatgg atttcctggg catcaatttc tacaaccgca atctcatctc ggcgagcggc    900

agtcccaagt tccccgacaa tgcggaattc actgaaatgg gctgggaaat ttatccgcaa    960

gccctgaccg acctgctggt gcgggtgtcc cgcgattatg cgccgcccgc gatttacatc   1020

acggaaaatg gcgcggcgtt tgccgatcct gacccggtgg acggcattgt cgaagaccca   1080

cgccgcgtcg aatacctgaa agcccatttc caggccgccg agaacgcgat tatccagggc   1140

gtgcccctca aaggctattt cgtgtggagc ctgatggaca actttgagtg gtcgttcggc   1200

tacagcaaac gcttcggcat catccatgtc gattatgcga cgcagaagcg gacgcccaag   1260

cggagcgctc ggttttatca agaaatgatc gcgcggcagc ctgtgtag                1308

<210> 408
<211> 435
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(435)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (7)...(21)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (34)...(37)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (343)...(351)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 408
Met Ala Phe Pro Lys Asp Phe Leu Trp Gly Val Ala Ala Ser Ser Tyr 
1               5                   10                  15      


Gln Ile Glu Gly Gly Ala Leu Thr Glu Gly Arg Gly Glu Cys Ile Trp 
            20                  25                  30          


Tyr Asn Phe Ser His Thr Pro Gly Lys Thr Gln Asp Gly Asp Thr Gly 
        35                  40                  45              


Asp Val Ala Cys Asp His Leu His Leu Tyr Lys Glu Asp Val Lys Leu 
    50                  55                  60                  


Met Ala Asp Leu Gly Val Gln Ala Tyr Arg Phe Ser Val Ala Trp Pro 
65                  70                  75                  80  


Arg Val Ile Pro Glu Gly Thr Gly Arg Ile Asn Glu Gln Gly Leu Asp 
                85                  90                  95      


Phe Tyr Asp Arg Leu Val Asp Glu Leu Leu Lys Tyr Gly Ile Gln Pro 
            100                 105                 110         


Trp Leu Thr Leu Asn His Trp Asp Tyr Pro Gln Ala Leu Gln Asn Lys 
        115                 120                 125             


Gly Gly Trp Ala Asn Pro Asp Ser Val Glu Trp Phe Thr Glu Tyr Thr 
    130                 135                 140                 


Asn Val Met Thr Arg Ala Leu Gly Asp Arg Val Arg Gly Trp Ile Thr 
145                 150                 155                 160 


His Asn Glu Pro Trp Cys Val Ser Ile Leu Ser Asn Leu Leu Gly Met 
                165                 170                 175     


His Ala Pro Gly Leu Asn Asp Ala Pro Thr Ala Tyr Arg Val Ala His 
            180                 185                 190         


His Leu Asn Leu Ala His Gly Ser Ala Met Ala Val Ile Arg Gln Asn 
        195                 200                 205             


Cys Pro Gly Val Pro Ala Gly Ile Thr Leu Asn Leu Ser Pro Ala Val 
    210                 215                 220                 


Pro Ser Thr Asp Ser Glu Glu Asp Gln Gln Ala Ala Gln Phe Phe Asp 
225                 230                 235                 240 


Ala Thr Phe Asn Arg Trp Phe Leu Asp Pro Val Leu Lys Gly Ser Tyr 
                245                 250                 255     


Pro Ala Asp Gly Ile Ala Met Leu Ser Ala Ala Leu Glu Gly Ile Asp 
            260                 265                 270         


Leu Asp Ala Val Gln Ala Ala Asn Val Pro Met Asp Phe Leu Gly Ile 
        275                 280                 285             


Asn Phe Tyr Asn Arg Asn Leu Ile Ser Ala Ser Gly Ser Pro Lys Phe 
    290                 295                 300                 


Pro Asp Asn Ala Glu Phe Thr Glu Met Gly Trp Glu Ile Tyr Pro Gln 
305                 310                 315                 320 


Ala Leu Thr Asp Leu Leu Val Arg Val Ser Arg Asp Tyr Ala Pro Pro 
                325                 330                 335     


Ala Ile Tyr Ile Thr Glu Asn Gly Ala Ala Phe Ala Asp Pro Asp Pro 
            340                 345                 350         


Val Asp Gly Ile Val Glu Asp Pro Arg Arg Val Glu Tyr Leu Lys Ala 
        355                 360                 365             


His Phe Gln Ala Ala Glu Asn Ala Ile Ile Gln Gly Val Pro Leu Lys 
    370                 375                 380                 


Gly Tyr Phe Val Trp Ser Leu Met Asp Asn Phe Glu Trp Ser Phe Gly 
385                 390                 395                 400 


Tyr Ser Lys Arg Phe Gly Ile Ile His Val Asp Tyr Ala Thr Gln Lys 
                405                 410                 415     


Arg Thr Pro Lys Arg Ser Ala Arg Phe Tyr Gln Glu Met Ile Ala Arg 
            420                 425                 430         


Gln Pro Val 
        435 


<210> 409
<211> 1404
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 409
atgagcatgc aaagatttcc cgaaaacttc acttggggcg ccgccactgc ctcgtaccag     60

attgaaggag gcgcccaagc cgacggacga gccccttccg tttgggacat gatgtgccgc    120

tgggagggca aagttaactc cggccattcc ggcgacgtcg cctgtgacca ctatcaccga    180

ttccgagaag acgtccgact gatgggcgag cttgggcttc ggtcctatcg tttctcattt    240

gcctggcccc gtgtgatccc caacgggact ggcacggtca acgaagccgg actaggcttc    300

tacgaccaac tgatcgacgc tcttctcgaa cagggcatcg aaccgtttcc cactctgttc    360

cattgggact acccgcttgc cctcttcaat cgtggcggat ggctcaatcc cgactcgccg    420

aagtggtttg ccgactacac gtcggttctc gtcgatcgat tttcggaccg catttccaag    480

tggttcaccc taaatgagcc gtcctgcttc cttggcctgg gacacgtcac cggcacccac    540

gccccgggcc tcaagctcga ctatcccgag tttttcctcg gcgtcaagca tgcgatgatg    600

tctcacggaa cggcgggcca ggtgatccgg agccaaagca aaactccaaa ccctcacgtc    660

agcatcgcgc cggtcagctc gcttggcgtc ccggtcgatg attccgcaga gaacatcgtc    720

gccgcgcggg agtacacctt tggcgagccg cactcggacc gaggtttctg gcacccggcg    780

atctacctcg accccgtctg caaaggcgtc tggccggagt cgatcgagcg attcctcagc    840

tcgaggccaa ttcccgtctc ggcggacgac ctgaagacga tgcatcaggt gccggattcg    900

atcgggctta actactacag cgcggtccgc gtccagtcga tgcccgacgg ttcgattcgg    960

tcgctcctgc acgtccccgg ccacccgcga accggcttcg actggcccgt cgttcccgaa   1020

ggcatgtatt ggtcgatccg gttccaccac gagcggtacg gacttccgtg ctacatcacc   1080

gaaaacggcc tttcgggaat cgattgggtc gccgaggacg gcggcgtcca cgatccccag   1140

cggatcgatt acaccgcccg ccacctgaaa gagcttcttc gcgcccacca tgacggccac   1200

cccgtcctgg gctacttcca ttggtcgctg ctggacaact tcgaatgggc ggaagggtat   1260

cgccaccggt tcggcctcat ccacgtcgac tacgaaaccc aaaagcggac gatcaaggat   1320

tcaggccact ggtatcgaag agtcgccgat tcgaacggct cgatcctaac cgacccccaa   1380

ggcgacccgg tctcgacccg ctaa                                          1404

<210> 410
<211> 467
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (3)...(454)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (9)...(12)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (10)...(24)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (89)...(92)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (236)...(248)
<223> EF-hand calcium-binding domain. Prosite id = PS00018

<220> 
<221> SITE
<222> (362)...(370)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<220> 
<221> SITE
<222> (458)...(461)
<223> N-glycosylation site. Prosite id = PS00001

<400> 410
Met Ser Met Gln Arg Phe Pro Glu Asn Phe Thr Trp Gly Ala Ala Thr 
1               5                   10                  15      


Ala Ser Tyr Gln Ile Glu Gly Gly Ala Gln Ala Asp Gly Arg Ala Pro 
            20                  25                  30          


Ser Val Trp Asp Met Met Cys Arg Trp Glu Gly Lys Val Asn Ser Gly 
        35                  40                  45              


His Ser Gly Asp Val Ala Cys Asp His Tyr His Arg Phe Arg Glu Asp 
    50                  55                  60                  


Val Arg Leu Met Gly Glu Leu Gly Leu Arg Ser Tyr Arg Phe Ser Phe 
65                  70                  75                  80  


Ala Trp Pro Arg Val Ile Pro Asn Gly Thr Gly Thr Val Asn Glu Ala 
                85                  90                  95      


Gly Leu Gly Phe Tyr Asp Gln Leu Ile Asp Ala Leu Leu Glu Gln Gly 
            100                 105                 110         


Ile Glu Pro Phe Pro Thr Leu Phe His Trp Asp Tyr Pro Leu Ala Leu 
        115                 120                 125             


Phe Asn Arg Gly Gly Trp Leu Asn Pro Asp Ser Pro Lys Trp Phe Ala 
    130                 135                 140                 


Asp Tyr Thr Ser Val Leu Val Asp Arg Phe Ser Asp Arg Ile Ser Lys 
145                 150                 155                 160 


Trp Phe Thr Leu Asn Glu Pro Ser Cys Phe Leu Gly Leu Gly His Val 
                165                 170                 175     


Thr Gly Thr His Ala Pro Gly Leu Lys Leu Asp Tyr Pro Glu Phe Phe 
            180                 185                 190         


Leu Gly Val Lys His Ala Met Met Ser His Gly Thr Ala Gly Gln Val 
        195                 200                 205             


Ile Arg Ser Gln Ser Lys Thr Pro Asn Pro His Val Ser Ile Ala Pro 
    210                 215                 220                 


Val Ser Ser Leu Gly Val Pro Val Asp Asp Ser Ala Glu Asn Ile Val 
225                 230                 235                 240 


Ala Ala Arg Glu Tyr Thr Phe Gly Glu Pro His Ser Asp Arg Gly Phe 
                245                 250                 255     


Trp His Pro Ala Ile Tyr Leu Asp Pro Val Cys Lys Gly Val Trp Pro 
            260                 265                 270         


Glu Ser Ile Glu Arg Phe Leu Ser Ser Arg Pro Ile Pro Val Ser Ala 
        275                 280                 285             


Asp Asp Leu Lys Thr Met His Gln Val Pro Asp Ser Ile Gly Leu Asn 
    290                 295                 300                 


Tyr Tyr Ser Ala Val Arg Val Gln Ser Met Pro Asp Gly Ser Ile Arg 
305                 310                 315                 320 


Ser Leu Leu His Val Pro Gly His Pro Arg Thr Gly Phe Asp Trp Pro 
                325                 330                 335     


Val Val Pro Glu Gly Met Tyr Trp Ser Ile Arg Phe His His Glu Arg 
            340                 345                 350         


Tyr Gly Leu Pro Cys Tyr Ile Thr Glu Asn Gly Leu Ser Gly Ile Asp 
        355                 360                 365             


Trp Val Ala Glu Asp Gly Gly Val His Asp Pro Gln Arg Ile Asp Tyr 
    370                 375                 380                 


Thr Ala Arg His Leu Lys Glu Leu Leu Arg Ala His His Asp Gly His 
385                 390                 395                 400 


Pro Val Leu Gly Tyr Phe His Trp Ser Leu Leu Asp Asn Phe Glu Trp 
                405                 410                 415     


Ala Glu Gly Tyr Arg His Arg Phe Gly Leu Ile His Val Asp Tyr Glu 
            420                 425                 430         


Thr Gln Lys Arg Thr Ile Lys Asp Ser Gly His Trp Tyr Arg Arg Val 
        435                 440                 445             


Ala Asp Ser Asn Gly Ser Ile Leu Thr Asp Pro Gln Gly Asp Pro Val 
    450                 455                 460                 


Ser Thr Arg 
465         


<210> 411
<211> 2553
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 411
atgtcatgtt ttgctaaacg ttttacccct aaactgctga ctgtgctgac tacctttatc     60

gccatggcgt gttttgccgg cgggcaggct cccggcaact ggccgcgtgt gcagagtgcg    120

ttgcaacccg atccccttat cgagcaacaa attgattctt tgttggctaa gatgaccatt    180

gagcaaaagg ttgctcagtt gatccaaccg gaaattggct atctgaccgt tgcgcaaatg    240

cgccaatacg gttttggctc ttatctcaat ggcggtaata ctgcaccctt tggcaacaag    300

cgggctgagg tagcgacctg gttgcagtta gctgatgcga tgtatctggc atcggtagac    360

agtagcctgg atggtagcag tattccaacc atctggggca ctgatgccat gcacggccac    420

agtaatgttt atggcgcaac tttatttccg cataacattg gcctgggtgc ggcccgggat    480

ccagatctaa tccgccagat cggcgaagcc acggcgaaag aagttgcagt aaccggcatt    540

gaatggacct ttgcaccgac ggtggctgtg gtgcgcgatg atcgctgggg gcgcacttat    600

gaaagctact cagaggatcc tgccattgtt gctgagtatg ccggccctat ggtcagtggt    660

ttgcaagggg aaattggcga tcatttttta cacgggcatt atcggatcgc taccgctaag    720

cattttatcg gtgatggcgg tacggaaaat ggtctggatc gcggagatac cttactggat    780

gaaaagcgcc tgcgcgaaat ccatgctgcg ggttactaca cggcgatcgc tgccggcgtg    840

cagtcggtga tggcctcgtt taacagttgg aatggtaagc gggtgcatgg tgatcattac    900

ctgttaaccg aggtcttgaa aaaccaaatg gggtttgacg gttttgtgat cagcgattgg    960

aatgcgcata agtttgttga tggttgcgat ctggagcagt gtgctgcggc ttttaatgct   1020

ggcgtagatg tgatgatggt accggaacat tttgaggctt tctatcacaa tacggtacag   1080

caagtgaaag acggactgat cccgatgacg cggctggatg atgcggtacg ccgcttttta   1140

cgcgccaaaa tccgctgggg gttgtttcag cgcggtaaac cttccagccg tcctgaatca   1200

ttacaaacgc agtggtttaa tgcacctgag catcgtgagt tagcacgacg ggcggtgcga   1260

caatcgctgg ttttgctgaa aaataaccgg cagttgctgc cgttaaaccc aaacagtcgg   1320

gtgctgattg ccggcgacgg cgccgataat attgctaaac aagccggtgg ctggagtgtt   1380

tcctggcaag gcaccgataa cagcaatgcc gatttcccga atgcgacctc catttatcag   1440

gggttacggc agcaaattct tgctgctggc ggtagcgttg aactgagtgt cgatggtcac   1500

tttacggaga aaccggatgt cgccatagtg gtgattggcg aagaacctta cgcggaatgg   1560

tatggtgata ttcagcggct ggaatatcag tacgacaata agcaggatct ggcgttacta   1620

aagcggttac aggctcaggc gatcccggta gttacggtgt tcttaagtgg ccgcccgctc   1680

tggataaata aggaactgaa cgcatctgac gcctttgttg cagcctggtt gccaggttcg   1740

gaagggcagg gtgttgccga tgtcttattg cgtgatagac agggtgagat ccagttcgac   1800

tttagcggaa aacttagctt ctcctggcca aaattcgacg atcagttttt actgaacgtg   1860

cacgataagg tctatgatcc actctttgct tacggttatg gcctgaccta cgctgatcag   1920

gtgcagcttg cccgagtaca tgaacaaacc agtccggcgt cgcaaacacc gactggcagc   1980

gggcaagctt tgtttgtgcg caatctggca gatggcctgc agtggcaact ggtcgacagc   2040

catatggata aactgacgac caccagttct gcagcagtga gtgctgatgg tcgtagtgta   2100

ttgatgcaat cggtaaatct ggcctatcag gaagatggtc gtaaatttgt ctggaatgcc   2160

gggcagcggc cggcctccgc ccgtttacag tacatcaagc cgcaagtaat gcccagacag   2220

cagagtgtgc aatggttgca aatgagtatc cggctggatc aagcgccaag cggcggggtg   2280

caattacagc tgctatgtca gcagcagaac tgtgtgcact cggcgtcctt attaccgctg   2340

ctcgctggtc tgaagcgggg acaatggtat aggatggcgt ggccgcttaa ttgcgcgggg   2400

cagcccgtta tggcggctag ccagccagca tcgggtctgg ttgaggatct aatacgttta   2460

aatgccagcg gcgagtttag tctggcgatc gccgaggtgg cgctggtgga acatactgcg   2520

gaagatgcac tgctacaggg gtgtcagcct tag                                2553

<210> 412
<211> 850
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(26)

<220> 
<221> DOMAIN
<222> (124)...(347)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (423)...(639)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (309)...(326)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (481)...(484)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (575)...(578)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (833)...(836)
<223> N-glycosylation site. Prosite id = PS00001

<400> 412
Met Ser Cys Phe Ala Lys Arg Phe Thr Pro Lys Leu Leu Thr Val Leu 
1               5                   10                  15      


Thr Thr Phe Ile Ala Met Ala Cys Phe Ala Gly Gly Gln Ala Pro Gly 
            20                  25                  30          


Asn Trp Pro Arg Val Gln Ser Ala Leu Gln Pro Asp Pro Leu Ile Glu 
        35                  40                  45              


Gln Gln Ile Asp Ser Leu Leu Ala Lys Met Thr Ile Glu Gln Lys Val 
    50                  55                  60                  


Ala Gln Leu Ile Gln Pro Glu Ile Gly Tyr Leu Thr Val Ala Gln Met 
65                  70                  75                  80  


Arg Gln Tyr Gly Phe Gly Ser Tyr Leu Asn Gly Gly Asn Thr Ala Pro 
                85                  90                  95      


Phe Gly Asn Lys Arg Ala Glu Val Ala Thr Trp Leu Gln Leu Ala Asp 
            100                 105                 110         


Ala Met Tyr Leu Ala Ser Val Asp Ser Ser Leu Asp Gly Ser Ser Ile 
        115                 120                 125             


Pro Thr Ile Trp Gly Thr Asp Ala Met His Gly His Ser Asn Val Tyr 
    130                 135                 140                 


Gly Ala Thr Leu Phe Pro His Asn Ile Gly Leu Gly Ala Ala Arg Asp 
145                 150                 155                 160 


Pro Asp Leu Ile Arg Gln Ile Gly Glu Ala Thr Ala Lys Glu Val Ala 
                165                 170                 175     


Val Thr Gly Ile Glu Trp Thr Phe Ala Pro Thr Val Ala Val Val Arg 
            180                 185                 190         


Asp Asp Arg Trp Gly Arg Thr Tyr Glu Ser Tyr Ser Glu Asp Pro Ala 
        195                 200                 205             


Ile Val Ala Glu Tyr Ala Gly Pro Met Val Ser Gly Leu Gln Gly Glu 
    210                 215                 220                 


Ile Gly Asp His Phe Leu His Gly His Tyr Arg Ile Ala Thr Ala Lys 
225                 230                 235                 240 


His Phe Ile Gly Asp Gly Gly Thr Glu Asn Gly Leu Asp Arg Gly Asp 
                245                 250                 255     


Thr Leu Leu Asp Glu Lys Arg Leu Arg Glu Ile His Ala Ala Gly Tyr 
            260                 265                 270         


Tyr Thr Ala Ile Ala Ala Gly Val Gln Ser Val Met Ala Ser Phe Asn 
        275                 280                 285             


Ser Trp Asn Gly Lys Arg Val His Gly Asp His Tyr Leu Leu Thr Glu 
    290                 295                 300                 


Val Leu Lys Asn Gln Met Gly Phe Asp Gly Phe Val Ile Ser Asp Trp 
305                 310                 315                 320 


Asn Ala His Lys Phe Val Asp Gly Cys Asp Leu Glu Gln Cys Ala Ala 
                325                 330                 335     


Ala Phe Asn Ala Gly Val Asp Val Met Met Val Pro Glu His Phe Glu 
            340                 345                 350         


Ala Phe Tyr His Asn Thr Val Gln Gln Val Lys Asp Gly Leu Ile Pro 
        355                 360                 365             


Met Thr Arg Leu Asp Asp Ala Val Arg Arg Phe Leu Arg Ala Lys Ile 
    370                 375                 380                 


Arg Trp Gly Leu Phe Gln Arg Gly Lys Pro Ser Ser Arg Pro Glu Ser 
385                 390                 395                 400 


Leu Gln Thr Gln Trp Phe Asn Ala Pro Glu His Arg Glu Leu Ala Arg 
                405                 410                 415     


Arg Ala Val Arg Gln Ser Leu Val Leu Leu Lys Asn Asn Arg Gln Leu 
            420                 425                 430         


Leu Pro Leu Asn Pro Asn Ser Arg Val Leu Ile Ala Gly Asp Gly Ala 
        435                 440                 445             


Asp Asn Ile Ala Lys Gln Ala Gly Gly Trp Ser Val Ser Trp Gln Gly 
    450                 455                 460                 


Thr Asp Asn Ser Asn Ala Asp Phe Pro Asn Ala Thr Ser Ile Tyr Gln 
465                 470                 475                 480 


Gly Leu Arg Gln Gln Ile Leu Ala Ala Gly Gly Ser Val Glu Leu Ser 
                485                 490                 495     


Val Asp Gly His Phe Thr Glu Lys Pro Asp Val Ala Ile Val Val Ile 
            500                 505                 510         


Gly Glu Glu Pro Tyr Ala Glu Trp Tyr Gly Asp Ile Gln Arg Leu Glu 
        515                 520                 525             


Tyr Gln Tyr Asp Asn Lys Gln Asp Leu Ala Leu Leu Lys Arg Leu Gln 
    530                 535                 540                 


Ala Gln Ala Ile Pro Val Val Thr Val Phe Leu Ser Gly Arg Pro Leu 
545                 550                 555                 560 


Trp Ile Asn Lys Glu Leu Asn Ala Ser Asp Ala Phe Val Ala Ala Trp 
                565                 570                 575     


Leu Pro Gly Ser Glu Gly Gln Gly Val Ala Asp Val Leu Leu Arg Asp 
            580                 585                 590         


Arg Gln Gly Glu Ile Gln Phe Asp Phe Ser Gly Lys Leu Ser Phe Ser 
        595                 600                 605             


Trp Pro Lys Phe Asp Asp Gln Phe Leu Leu Asn Val His Asp Lys Val 
    610                 615                 620                 


Tyr Asp Pro Leu Phe Ala Tyr Gly Tyr Gly Leu Thr Tyr Ala Asp Gln 
625                 630                 635                 640 


Val Gln Leu Ala Arg Val His Glu Gln Thr Ser Pro Ala Ser Gln Thr 
                645                 650                 655     


Pro Thr Gly Ser Gly Gln Ala Leu Phe Val Arg Asn Leu Ala Asp Gly 
            660                 665                 670         


Leu Gln Trp Gln Leu Val Asp Ser His Met Asp Lys Leu Thr Thr Thr 
        675                 680                 685             


Ser Ser Ala Ala Val Ser Ala Asp Gly Arg Ser Val Leu Met Gln Ser 
    690                 695                 700                 


Val Asn Leu Ala Tyr Gln Glu Asp Gly Arg Lys Phe Val Trp Asn Ala 
705                 710                 715                 720 


Gly Gln Arg Pro Ala Ser Ala Arg Leu Gln Tyr Ile Lys Pro Gln Val 
                725                 730                 735     


Met Pro Arg Gln Gln Ser Val Gln Trp Leu Gln Met Ser Ile Arg Leu 
            740                 745                 750         


Asp Gln Ala Pro Ser Gly Gly Val Gln Leu Gln Leu Leu Cys Gln Gln 
        755                 760                 765             


Gln Asn Cys Val His Ser Ala Ser Leu Leu Pro Leu Leu Ala Gly Leu 
    770                 775                 780                 


Lys Arg Gly Gln Trp Tyr Arg Met Ala Trp Pro Leu Asn Cys Ala Gly 
785                 790                 795                 800 


Gln Pro Val Met Ala Ala Ser Gln Pro Ala Ser Gly Leu Val Glu Asp 
                805                 810                 815     


Leu Ile Arg Leu Asn Ala Ser Gly Glu Phe Ser Leu Ala Ile Ala Glu 
            820                 825                 830         


Val Ala Leu Val Glu His Thr Ala Glu Asp Ala Leu Leu Gln Gly Cys 
        835                 840                 845             


Gln Pro 
    850 


<210> 413
<211> 2223
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 413
atgaaatatc tacgacccct atcagtattc ctgtgtctcg tcgtcgttct tgcgctgttg     60

ttatcgacgc cgcccagttc ggcgcaacgc agggatgaca tcgagggacg cgtaaacgcg    120

ctgctggccc agatgtcgct gagcgagaag ttgggccaac tccaacaact cgacggcgaa    180

ggaaatggaa acttccgccc cgaacacctc gagctcgctc gcaaaggtct gctcggttcg    240

acgctaaacg ttcgaggcgg cggacgtccg aatcaacttc agcgcgtcgc ggtggaacag    300

tcgcgtctga agattccgct gttgttcggt ttcgacacga ttcacggtta tcgcactatc    360

tttccgatac cactcgcgga agcggcgagt tgggagccgt cgctggcgga acgctccgct    420

agtattgcag cgaaagaggc gtacgccgcc ggcctgcgct ggacattcgc gccgatggtc    480

gacatcgcgc gcgatccgcg ttggggaaga atcaccgaag gcgccggtga agatccgttc    540

ctgggcgccg ccttcgcgcg agcgcgcgtg cgcggctttc aaggcgacga ctattcacaa    600

cccggcaaga tactcgcctg cgcgaaacat tgggtcgcct acggcgcggc tgaaggtgga    660

cgcgactaca acacgaccga gatgtctgaa cagacgctgc gctcgatcta tttcccgccg    720

ttcaaagcgg cggtggatgc gggcgtcgga acgttcatga gtgctttcaa cgcactaaac    780

ggcgtcccaa cttcggcgaa ccatttcacg ttgacgaaag tcctgcgcga tgagtggaag    840

tttagcggtt ttgtcgtcag cgactacacg tccgtgaagg agctcatcaa tcacggctac    900

gcggcaaacg ataaggaagc cgcgtggttt gcgttgaacg ccggcgtcga catggagatg    960

gtcagccgtc tgttcaatca acacggaacg gaactgctgc aagagcaaaa atggtcgccg   1020

gcaacgctcg acgaagcagt gcgcaggatt ctgcgaatca agtttcgcct cggccttttc   1080

gagcggcctt atgtcgacga atcgctggaa aagactgcgt acttaactgc cgaaagtcgc   1140

gcggtggctc gtgaagtcgc aagcaagtcg atggtgctgt tgaaaaacga gcgcgacacg   1200

ttgccactcg caaagacgat ccaatcgatt gctgtaattg gaccattggc tgacgacaaa   1260

cgctcgccgc tgggttggtg gtccggcgac ggccgcgatg aagacactgt cacgcctctt   1320

accgggatca agaacaaagt ctctgcacaa acgaaggtga catacgcaaa aggctgcgac   1380

gtgacgggcg attccgatgc agggtttgcg gaagcagtcg cagcggcgcg aaactctgac   1440

gtgacgatcg tcttcgtcgg tgaatcgaaa gacatggttg gcgaggccgc ttcgcgcgcg   1500

acgctcgatc ttccgggacg gcagatggac ctcgtgcggg aagtctcacg cgcaggcaaa   1560

cccacgattg tcgtgctggt gaacggccgg ccgccggcga tcggttggat tgtagataac   1620

gtcccggcga ttctcgaatc atggatgggt gggaccgaat ccggaaacgc gattgccgac   1680

gtgctcttcg gcgacgtgaa tccgggcggt aagctgccgg ttacgtttcc tcgtgttacc   1740

ggtcaggtgc cgattcatta caaccacctg aacacgggcc gtccacctga agcgaacaat   1800

cgttacacgt ccaagtattt cgacgcgccg tggacgccgc agtttccttt cggtttcggc   1860

ttgagcttca cgcagttcag gatctcgaac gtgggaatta gcgcgacgca gattggaccg   1920

gacggcacga ttcgcgtcac tgcggacgtt gagaatgtcg gtagacgcgc cggcgatgag   1980

gtggtccaac tctacgttcg cgacgttgcg gccagcatct cgcgccccgt gaaagaacta   2040

aaaggctttc aacgcgtaac tctgcaaccc gggcagaagc gaagcctgga gttcgttctc   2100

gggcccgaac atctcggctt ctacaatcgc gatatgaagt ttgtcgtcga acccggcgag   2160

tttcgcgtga tggtcggtgc gaactcgcag gacgtgatcg agaagacgtt tgcagtcaga   2220

taa                                                                 2223

<210> 414
<211> 740
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(28)

<220> 
<221> DOMAIN
<222> (97)...(321)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (391)...(625)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (227)...(230)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (278)...(295)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<400> 414
Met Lys Tyr Leu Arg Pro Leu Ser Val Phe Leu Cys Leu Val Val Val 
1               5                   10                  15      


Leu Ala Leu Leu Leu Ser Thr Pro Pro Ser Ser Ala Gln Arg Arg Asp 
            20                  25                  30          


Asp Ile Glu Gly Arg Val Asn Ala Leu Leu Ala Gln Met Ser Leu Ser 
        35                  40                  45              


Glu Lys Leu Gly Gln Leu Gln Gln Leu Asp Gly Glu Gly Asn Gly Asn 
    50                  55                  60                  


Phe Arg Pro Glu His Leu Glu Leu Ala Arg Lys Gly Leu Leu Gly Ser 
65                  70                  75                  80  


Thr Leu Asn Val Arg Gly Gly Gly Arg Pro Asn Gln Leu Gln Arg Val 
                85                  90                  95      


Ala Val Glu Gln Ser Arg Leu Lys Ile Pro Leu Leu Phe Gly Phe Asp 
            100                 105                 110         


Thr Ile His Gly Tyr Arg Thr Ile Phe Pro Ile Pro Leu Ala Glu Ala 
        115                 120                 125             


Ala Ser Trp Glu Pro Ser Leu Ala Glu Arg Ser Ala Ser Ile Ala Ala 
    130                 135                 140                 


Lys Glu Ala Tyr Ala Ala Gly Leu Arg Trp Thr Phe Ala Pro Met Val 
145                 150                 155                 160 


Asp Ile Ala Arg Asp Pro Arg Trp Gly Arg Ile Thr Glu Gly Ala Gly 
                165                 170                 175     


Glu Asp Pro Phe Leu Gly Ala Ala Phe Ala Arg Ala Arg Val Arg Gly 
            180                 185                 190         


Phe Gln Gly Asp Asp Tyr Ser Gln Pro Gly Lys Ile Leu Ala Cys Ala 
        195                 200                 205             


Lys His Trp Val Ala Tyr Gly Ala Ala Glu Gly Gly Arg Asp Tyr Asn 
    210                 215                 220                 


Thr Thr Glu Met Ser Glu Gln Thr Leu Arg Ser Ile Tyr Phe Pro Pro 
225                 230                 235                 240 


Phe Lys Ala Ala Val Asp Ala Gly Val Gly Thr Phe Met Ser Ala Phe 
                245                 250                 255     


Asn Ala Leu Asn Gly Val Pro Thr Ser Ala Asn His Phe Thr Leu Thr 
            260                 265                 270         


Lys Val Leu Arg Asp Glu Trp Lys Phe Ser Gly Phe Val Val Ser Asp 
        275                 280                 285             


Tyr Thr Ser Val Lys Glu Leu Ile Asn His Gly Tyr Ala Ala Asn Asp 
    290                 295                 300                 


Lys Glu Ala Ala Trp Phe Ala Leu Asn Ala Gly Val Asp Met Glu Met 
305                 310                 315                 320 


Val Ser Arg Leu Phe Asn Gln His Gly Thr Glu Leu Leu Gln Glu Gln 
                325                 330                 335     


Lys Trp Ser Pro Ala Thr Leu Asp Glu Ala Val Arg Arg Ile Leu Arg 
            340                 345                 350         


Ile Lys Phe Arg Leu Gly Leu Phe Glu Arg Pro Tyr Val Asp Glu Ser 
        355                 360                 365             


Leu Glu Lys Thr Ala Tyr Leu Thr Ala Glu Ser Arg Ala Val Ala Arg 
    370                 375                 380                 


Glu Val Ala Ser Lys Ser Met Val Leu Leu Lys Asn Glu Arg Asp Thr 
385                 390                 395                 400 


Leu Pro Leu Ala Lys Thr Ile Gln Ser Ile Ala Val Ile Gly Pro Leu 
                405                 410                 415     


Ala Asp Asp Lys Arg Ser Pro Leu Gly Trp Trp Ser Gly Asp Gly Arg 
            420                 425                 430         


Asp Glu Asp Thr Val Thr Pro Leu Thr Gly Ile Lys Asn Lys Val Ser 
        435                 440                 445             


Ala Gln Thr Lys Val Thr Tyr Ala Lys Gly Cys Asp Val Thr Gly Asp 
    450                 455                 460                 


Ser Asp Ala Gly Phe Ala Glu Ala Val Ala Ala Ala Arg Asn Ser Asp 
465                 470                 475                 480 


Val Thr Ile Val Phe Val Gly Glu Ser Lys Asp Met Val Gly Glu Ala 
                485                 490                 495     


Ala Ser Arg Ala Thr Leu Asp Leu Pro Gly Arg Gln Met Asp Leu Val 
            500                 505                 510         


Arg Glu Val Ser Arg Ala Gly Lys Pro Thr Ile Val Val Leu Val Asn 
        515                 520                 525             


Gly Arg Pro Pro Ala Ile Gly Trp Ile Val Asp Asn Val Pro Ala Ile 
    530                 535                 540                 


Leu Glu Ser Trp Met Gly Gly Thr Glu Ser Gly Asn Ala Ile Ala Asp 
545                 550                 555                 560 


Val Leu Phe Gly Asp Val Asn Pro Gly Gly Lys Leu Pro Val Thr Phe 
                565                 570                 575     


Pro Arg Val Thr Gly Gln Val Pro Ile His Tyr Asn His Leu Asn Thr 
            580                 585                 590         


Gly Arg Pro Pro Glu Ala Asn Asn Arg Tyr Thr Ser Lys Tyr Phe Asp 
        595                 600                 605             


Ala Pro Trp Thr Pro Gln Phe Pro Phe Gly Phe Gly Leu Ser Phe Thr 
    610                 615                 620                 


Gln Phe Arg Ile Ser Asn Val Gly Ile Ser Ala Thr Gln Ile Gly Pro 
625                 630                 635                 640 


Asp Gly Thr Ile Arg Val Thr Ala Asp Val Glu Asn Val Gly Arg Arg 
                645                 650                 655     


Ala Gly Asp Glu Val Val Gln Leu Tyr Val Arg Asp Val Ala Ala Ser 
            660                 665                 670         


Ile Ser Arg Pro Val Lys Glu Leu Lys Gly Phe Gln Arg Val Thr Leu 
        675                 680                 685             


Gln Pro Gly Gln Lys Arg Ser Leu Glu Phe Val Leu Gly Pro Glu His 
    690                 695                 700                 


Leu Gly Phe Tyr Asn Arg Asp Met Lys Phe Val Val Glu Pro Gly Glu 
705                 710                 715                 720 


Phe Arg Val Met Val Gly Ala Asn Ser Gln Asp Val Ile Glu Lys Thr 
                725                 730                 735     


Phe Ala Val Arg 
            740 


<210> 415
<211> 1101
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 415
atgctgataa ttggaggcct tcttgtttta ctgggatttt cttcttgcgg gcggcaggca     60

gaacctgctg ctgactcttt cagggggttt catgactttg acatcaggcg tggggtgaac    120

atcagccact ggttgtcgca gagtggaagg cgtggtgctg atcgggaggc gttctttacc    180

agggcggatg tggaggccat cgccggcttc ggttatgatc acattcgttt gcccattgat    240

gaggagcaga tgtgggatga gtcgggcaac aaggaaccac gtgcctttga attgctgcat    300

gaagccattg gctgggcttt ggacaatgag ctcagggtca ttgtcgacct gcacatcatc    360

aggtcgcact attttaatgc gcctgagaac ccgctttgga ccgatcgtgc tgaacagttg    420

aaatttgttg agatgtggcg acagttgtct gatgagctgc agggctatcc gctcgatagg    480

gtggcctatg aattgatgaa tgaggccgtg gctgatgatc cggacgattg gaaccggctt    540

gtggctgaga cgatggaggc gctacggatg ctggaaccgg agcgcaagat tgtcattggc    600

tccaaccgct ggcagtctgt gcatacattt cctgacctgg tgatcccgga taatgacccg    660

catatcatat tgagttttca cttctacgaa ccatttctgc tgacgcacca caaggcctcc    720

tggacacaca tccgtgatta caccggtccg gtgaactatc cgggtttgac tgtagacccg    780

acccacctgg aggggttgtc tgaagaactg gtgacccgga ttggccatca caatggggtg    840

tatacaaaag aaacgatgga ggagatgatc atgatcccac tgcaatatgc caaagaccgg    900

gggctccccc tttattgtgg agagtgggga tgtttcccga ccatgcccca ggagatgcgc    960

ctgcaatggt acgccgatgt gcgtgcgatc ctggaaaagc atgagattgc ctgggcaaac   1020

tgggattaca agggtggttt cggtgtggtt gaccgcaacg gcgaacccca ccatgattta   1080

ttggaagtgc tcttaaaata a                                             1101

<210> 416
<211> 366
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(20)

<220> 
<221> DOMAIN
<222> (42)...(349)
<223> Cellulase (glycosyl hydrolase family 5)

<220> 
<221> SITE
<222> (40)...(43)
<223> N-glycosylation site. Prosite id = PS00001

<400> 416
Met Leu Ile Ile Gly Gly Leu Leu Val Leu Leu Gly Phe Ser Ser Cys 
1               5                   10                  15      


Gly Arg Gln Ala Glu Pro Ala Ala Asp Ser Phe Arg Gly Phe His Asp 
            20                  25                  30          


Phe Asp Ile Arg Arg Gly Val Asn Ile Ser His Trp Leu Ser Gln Ser 
        35                  40                  45              


Gly Arg Arg Gly Ala Asp Arg Glu Ala Phe Phe Thr Arg Ala Asp Val 
    50                  55                  60                  


Glu Ala Ile Ala Gly Phe Gly Tyr Asp His Ile Arg Leu Pro Ile Asp 
65                  70                  75                  80  


Glu Glu Gln Met Trp Asp Glu Ser Gly Asn Lys Glu Pro Arg Ala Phe 
                85                  90                  95      


Glu Leu Leu His Glu Ala Ile Gly Trp Ala Leu Asp Asn Glu Leu Arg 
            100                 105                 110         


Val Ile Val Asp Leu His Ile Ile Arg Ser His Tyr Phe Asn Ala Pro 
        115                 120                 125             


Glu Asn Pro Leu Trp Thr Asp Arg Ala Glu Gln Leu Lys Phe Val Glu 
    130                 135                 140                 


Met Trp Arg Gln Leu Ser Asp Glu Leu Gln Gly Tyr Pro Leu Asp Arg 
145                 150                 155                 160 


Val Ala Tyr Glu Leu Met Asn Glu Ala Val Ala Asp Asp Pro Asp Asp 
                165                 170                 175     


Trp Asn Arg Leu Val Ala Glu Thr Met Glu Ala Leu Arg Met Leu Glu 
            180                 185                 190         


Pro Glu Arg Lys Ile Val Ile Gly Ser Asn Arg Trp Gln Ser Val His 
        195                 200                 205             


Thr Phe Pro Asp Leu Val Ile Pro Asp Asn Asp Pro His Ile Ile Leu 
    210                 215                 220                 


Ser Phe His Phe Tyr Glu Pro Phe Leu Leu Thr His His Lys Ala Ser 
225                 230                 235                 240 


Trp Thr His Ile Arg Asp Tyr Thr Gly Pro Val Asn Tyr Pro Gly Leu 
                245                 250                 255     


Thr Val Asp Pro Thr His Leu Glu Gly Leu Ser Glu Glu Leu Val Thr 
            260                 265                 270         


Arg Ile Gly His His Asn Gly Val Tyr Thr Lys Glu Thr Met Glu Glu 
        275                 280                 285             


Met Ile Met Ile Pro Leu Gln Tyr Ala Lys Asp Arg Gly Leu Pro Leu 
    290                 295                 300                 


Tyr Cys Gly Glu Trp Gly Cys Phe Pro Thr Met Pro Gln Glu Met Arg 
305                 310                 315                 320 


Leu Gln Trp Tyr Ala Asp Val Arg Ala Ile Leu Glu Lys His Glu Ile 
                325                 330                 335     


Ala Trp Ala Asn Trp Asp Tyr Lys Gly Gly Phe Gly Val Val Asp Arg 
            340                 345                 350         


Asn Gly Glu Pro His His Asp Leu Leu Glu Val Leu Leu Lys 
        355                 360                 365     


<210> 417
<211> 2184
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 417
atgaacaaga aacaaagcga attcatccaa ccagagatcg aagcgaagat cgatgctttg     60

ctggaaaaaa tgaccctgct cgaaaaagtg gggcagctca cgcagttggg accatccatg    120

gtcggcggtt tcgatatgga tgcctttttg gataaccccg agttgttcaa aagcgccaaa    180

cgcgactttc atgaggactg gatcatcaaa ggcgaagtcg gttcctatct cggcgtgcaa    240

ggtgcggaag agatcaaccg cctgcaaaag attgccgtcg aaggatcgcg gctgggcatt    300

cccctcctct tcgggttgga tgtgatccac ggctaccgca ccatctttcc gatcccactg    360

gcggagacgt gcagttggga accggaactg gcacgcagaa cggcggaagt cgccgcacgc    420

gaagcctccg ctgccggtct gcattggacc ttcgcgccca tgatggatat cgcccgcgat    480

gcgcgctggg gacgcattgc cgaaggctcc ggcgaagatc cgttcctggg cagcctcttt    540

gctgcggcgc gtgtgcgtgg attccaggga gatgacctca gcgatccgca gcatgtggct    600

gcctgtgcca agcattacat tgcctatggt gcggcggtcg ctggtcgcga ttacaacacg    660

gtcgagatgg cggagcaaac cctgcatgag gtctatctgc cgccgttcac cgccgccgta    720

gaagagggtg tgctgacctt tatgagcgcc ttcaatgatc tgaacggtgt ccctacctct    780

gccaaccggt acaccctcac cgatattttg cgcggcaaac tgggcttcaa cggtcttgtc    840

gtcagcgatt cgggctccgt cggtgaactg gtcgcgcacg gatacgccgc cgaccgcaag    900

gatgccggga agaaggcgct gctcgccggt gtggatatgg acatggtcag cgagagttac    960

cgcttcgaca tccccgactt ggtggaagcg ggcatcgtgc cgctctccat tgtggacgag   1020

gcagtgcggc gaatcctgcg cgtcaaattc ctattgggct tgttcgagca cccataccgc   1080

tccaacgctg acgaggaatc cgcggcgcag ctgaccgccg aacatctcgc cgtggcacgc   1140

gagtcggcgc ggcgttcgat cgtcctgttg aagaatgagg gcggaatttt gcccatcaag   1200

gagaatcaga agatcgccct catcggtcca ttcgcggata atcaggcaga catgctcggc   1260

tcgtggtcgt ttaccggagc ggcgaaagat gtcgtcacga tcctcagcgg gatacaagcc   1320

gctgcgaaag cggaagtgct ttactcacag ggatgcgatg ctaagggaga acaacccgcc   1380

gatttcactc atgccgttga aaccgctcaa caggcggatg tcattgtcgc ggttgtcggt   1440

gaaccgatgg gcatgagcgg agaagccgcc agccgcatgc atctcggtct gaccggtcag   1500

caggaagcgc tgctggaagc gttgaaagcg acaggcaaac cgctggtcgt gcttctgagc   1560

aacgggcgtc cacttacggt cccgtgggtt gccctccatg ccgatgcgat cctggagacc   1620

tggcagcttg gcatacaagc cggtcacgcc gtggcagatg tgctcttcgg cgcctacaac   1680

ccgagcggaa aactcgccgc cactttcccc tattcggtgg ggcaatgccc gatctattac   1740

agccacccga gcacgggacg tcccgccacc gatttctatt tcacgtccaa gtacaatgat   1800

ggtccggtga agccgctgta cccgttcggc ttcgggttga gctacaccac gttcgaatat   1860

tccaatctga aggttagcgc ggatgctgaa aaagtgatgg tcagcgctgt ggtgaagaat   1920

tccggcagcc tggcagggga ggaagtggtt cagttatatg tccaggatgt ggttggcagc   1980

cgcgtgcgcc cggtgaagga attaaaaggc ttccagaaga tcatgctgcc agcgggggaa   2040

gattgcacag tgaccttcgg gttgaacgtc tctgacctgg gcttctacga tgtggacatg   2100

aaatatgtcg tcgagccggg acagttcaag gtttgggtgg gcacgaactc agcagaaggc   2160

ttggaaggcg aattcaggtt ataa                                          2184

<210> 418
<211> 727
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (92)...(316)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (387)...(617)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (273)...(290)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (699)...(702)
<223> N-glycosylation site. Prosite id = PS00001

<400> 418
Met Asn Lys Lys Gln Ser Glu Phe Ile Gln Pro Glu Ile Glu Ala Lys 
1               5                   10                  15      


Ile Asp Ala Leu Leu Glu Lys Met Thr Leu Leu Glu Lys Val Gly Gln 
            20                  25                  30          


Leu Thr Gln Leu Gly Pro Ser Met Val Gly Gly Phe Asp Met Asp Ala 
        35                  40                  45              


Phe Leu Asp Asn Pro Glu Leu Phe Lys Ser Ala Lys Arg Asp Phe His 
    50                  55                  60                  


Glu Asp Trp Ile Ile Lys Gly Glu Val Gly Ser Tyr Leu Gly Val Gln 
65                  70                  75                  80  


Gly Ala Glu Glu Ile Asn Arg Leu Gln Lys Ile Ala Val Glu Gly Ser 
                85                  90                  95      


Arg Leu Gly Ile Pro Leu Leu Phe Gly Leu Asp Val Ile His Gly Tyr 
            100                 105                 110         


Arg Thr Ile Phe Pro Ile Pro Leu Ala Glu Thr Cys Ser Trp Glu Pro 
        115                 120                 125             


Glu Leu Ala Arg Arg Thr Ala Glu Val Ala Ala Arg Glu Ala Ser Ala 
    130                 135                 140                 


Ala Gly Leu His Trp Thr Phe Ala Pro Met Met Asp Ile Ala Arg Asp 
145                 150                 155                 160 


Ala Arg Trp Gly Arg Ile Ala Glu Gly Ser Gly Glu Asp Pro Phe Leu 
                165                 170                 175     


Gly Ser Leu Phe Ala Ala Ala Arg Val Arg Gly Phe Gln Gly Asp Asp 
            180                 185                 190         


Leu Ser Asp Pro Gln His Val Ala Ala Cys Ala Lys His Tyr Ile Ala 
        195                 200                 205             


Tyr Gly Ala Ala Val Ala Gly Arg Asp Tyr Asn Thr Val Glu Met Ala 
    210                 215                 220                 


Glu Gln Thr Leu His Glu Val Tyr Leu Pro Pro Phe Thr Ala Ala Val 
225                 230                 235                 240 


Glu Glu Gly Val Leu Thr Phe Met Ser Ala Phe Asn Asp Leu Asn Gly 
                245                 250                 255     


Val Pro Thr Ser Ala Asn Arg Tyr Thr Leu Thr Asp Ile Leu Arg Gly 
            260                 265                 270         


Lys Leu Gly Phe Asn Gly Leu Val Val Ser Asp Ser Gly Ser Val Gly 
        275                 280                 285             


Glu Leu Val Ala His Gly Tyr Ala Ala Asp Arg Lys Asp Ala Gly Lys 
    290                 295                 300                 


Lys Ala Leu Leu Ala Gly Val Asp Met Asp Met Val Ser Glu Ser Tyr 
305                 310                 315                 320 


Arg Phe Asp Ile Pro Asp Leu Val Glu Ala Gly Ile Val Pro Leu Ser 
                325                 330                 335     


Ile Val Asp Glu Ala Val Arg Arg Ile Leu Arg Val Lys Phe Leu Leu 
            340                 345                 350         


Gly Leu Phe Glu His Pro Tyr Arg Ser Asn Ala Asp Glu Glu Ser Ala 
        355                 360                 365             


Ala Gln Leu Thr Ala Glu His Leu Ala Val Ala Arg Glu Ser Ala Arg 
    370                 375                 380                 


Arg Ser Ile Val Leu Leu Lys Asn Glu Gly Gly Ile Leu Pro Ile Lys 
385                 390                 395                 400 


Glu Asn Gln Lys Ile Ala Leu Ile Gly Pro Phe Ala Asp Asn Gln Ala 
                405                 410                 415     


Asp Met Leu Gly Ser Trp Ser Phe Thr Gly Ala Ala Lys Asp Val Val 
            420                 425                 430         


Thr Ile Leu Ser Gly Ile Gln Ala Ala Ala Lys Ala Glu Val Leu Tyr 
        435                 440                 445             


Ser Gln Gly Cys Asp Ala Lys Gly Glu Gln Pro Ala Asp Phe Thr His 
    450                 455                 460                 


Ala Val Glu Thr Ala Gln Gln Ala Asp Val Ile Val Ala Val Val Gly 
465                 470                 475                 480 


Glu Pro Met Gly Met Ser Gly Glu Ala Ala Ser Arg Met His Leu Gly 
                485                 490                 495     


Leu Thr Gly Gln Gln Glu Ala Leu Leu Glu Ala Leu Lys Ala Thr Gly 
            500                 505                 510         


Lys Pro Leu Val Val Leu Leu Ser Asn Gly Arg Pro Leu Thr Val Pro 
        515                 520                 525             


Trp Val Ala Leu His Ala Asp Ala Ile Leu Glu Thr Trp Gln Leu Gly 
    530                 535                 540                 


Ile Gln Ala Gly His Ala Val Ala Asp Val Leu Phe Gly Ala Tyr Asn 
545                 550                 555                 560 


Pro Ser Gly Lys Leu Ala Ala Thr Phe Pro Tyr Ser Val Gly Gln Cys 
                565                 570                 575     


Pro Ile Tyr Tyr Ser His Pro Ser Thr Gly Arg Pro Ala Thr Asp Phe 
            580                 585                 590         


Tyr Phe Thr Ser Lys Tyr Asn Asp Gly Pro Val Lys Pro Leu Tyr Pro 
        595                 600                 605             


Phe Gly Phe Gly Leu Ser Tyr Thr Thr Phe Glu Tyr Ser Asn Leu Lys 
    610                 615                 620                 


Val Ser Ala Asp Ala Glu Lys Val Met Val Ser Ala Val Val Lys Asn 
625                 630                 635                 640 


Ser Gly Ser Leu Ala Gly Glu Glu Val Val Gln Leu Tyr Val Gln Asp 
                645                 650                 655     


Val Val Gly Ser Arg Val Arg Pro Val Lys Glu Leu Lys Gly Phe Gln 
            660                 665                 670         


Lys Ile Met Leu Pro Ala Gly Glu Asp Cys Thr Val Thr Phe Gly Leu 
        675                 680                 685             


Asn Val Ser Asp Leu Gly Phe Tyr Asp Val Asp Met Lys Tyr Val Val 
    690                 695                 700                 


Glu Pro Gly Gln Phe Lys Val Trp Val Gly Thr Asn Ser Ala Glu Gly 
705                 710                 715                 720 


Leu Glu Gly Glu Phe Arg Leu 
                725         


<210> 419
<211> 2277
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 419
atgtccgata ttcacgcgct cgtagccagt atgaccttag aagaaaaagc ggcgctctgc     60

accggagcca gcgcctggac gacaacgccg gtcaaacggc tcaacctgcc cgaactgctc    120

gtctccgatg ggccgcacgg catccgccgc atcgccgacg tttacgccat ggcccagcaa    180

agcctgcccg ccacctgctt ccccaccgcc tccagcctgg ccgccacctg ggatggtgat    240

ttgctgttcc agatggggca ggcgttggcc gaagaggcca tcgccctggg cgtggacgtc    300

atcctcggcc ccggcgtcaa catgaaacgc tcgcccctct gcggccgcaa ctttgaatac    360

ttctccgaag acccattcca ggcgggggcg atggccgcca gcctgatcaa aggcattcag    420

agcaagggcg tcggcacgtc gctgaagcat tacgccgcca acaaccagga atttgaacgc    480

ttttcgatca acgcccaggt ggacgaacgg acgctgcgcg agatttattt accggcgttt    540

gaaacggccg ttacccaggc ccaaccctgg accgtcatgt gttcctacaa caaaatcaac    600

ggaacctacg gctccgaaca cacccaactg ctcagcgaca tcctcaaaaa agagtggggc    660

ttccagggct tcgtcgtctc cgactggggc gctgtccacg accgggtggc cgccctcaaa    720

gccgggctag acctggaaat gcccggcccc aaacagacgc gggtcaacgc cgtcatcgag    780

gcggtgcgca acggcgaact ggacgaagcg acgctggacg aagccgtccg gcgcatcttg    840

cgcgtcaccc tctgcgccgc gcaaacgccc aagggcggcg agtttgacgc cgccgcccac    900

cacgccctgg cccgcaaagt ggccgccgag ggcatggtcc tcctcaagaa caacggcctg    960

ctgcccctgc aaaacccgca gcacatcgcc gtcatcggcc gcgccgcgca aaaagcccac   1020

ttccagggcg gcggcagctc gcacatcaac cccacccagg tagacgtgcc cttcagcgaa   1080

ctgcaaaagc tggccgacaa cgccgaactg agcttcgccg ccggctatcc cgaagacgac   1140

agccgggacc aggcgctcat tgacgaagcg accgccatcg cccaaaccgc cgatgtggcc   1200

ctgctctaca tcgctctgcc ctccttcaag gaatcggaag ggtacgaccg accagacctg   1260

gacctgacgg cgcagcaggt ggccttgatt caggcggtaa cggccgttca acccaacacc   1320

gtcgtcatcc tcaacaacgg cgccccggtc gtcatgggcg aatggcttga tggggcggcg   1380

gccgtgctgg aagcgtggat gatgggtcag gcgggcggcg gagccatcgc cgacgtcctg   1440

tttggtcggg tcaaccccag cggcaaactg gccgaaacct acccccaccg cttaaccgac   1500

acccccgcct atctcaactt ccccggcgaa aacggcgtgg tgcgctacgg cgaagggctg   1560

ttcatcggct accgctacta cgacgccaaa gagatgcccg tcctgttccc ctttggctac   1620

ggcctcagct ataccagctt cgcctatagc aatttgcgcg tctcgaccga cagcttccgc   1680

gacgtagacg gcctcatcgt ctcggtagac gtgaccaaca ccggcgcggt ggcgggcaag   1740

gaagtggtgc agctctacgt tcgtgaccag gaggcgcggc tggtgcggcc ggtcaaagaa   1800

ctcaaaggct tcgccaaagt cgccctacaa ccgggcgaga cccaaaccgt caccatcccg   1860

ctcgacttcc gcgccttcgc ctactatgat cccgcctatc ggcaatggat cgccgaagag   1920

gggcaattcg acattctggt gggcgcgtcg gcgaccgata ttcgctgcca aacggccgtt   1980

accctgcact ccaccgccca gttgcccacc atcctccacg aagaatccac catccgccac   2040

tggttcaacg acccggccgg gaaaaccatt ctccagccca tgttcgccga actcatggcc   2100

aacggcccct tcagccagga cgaatccggc caggacgcca tcggcatgga cacgctcagt   2160

ttcctgatgg atttgcccct gcgcagcttc ctccatttcc aggaaggctc gctcacccag   2220

cccgccgacg acatcgccga catgctgctg gagcaggtgc gcggcgtgaa gcactaa      2277

<210> 420
<211> 758
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (28)...(248)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (312)...(524)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (203)...(206)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (217)...(234)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<400> 420
Met Ser Asp Ile His Ala Leu Val Ala Ser Met Thr Leu Glu Glu Lys 
1               5                   10                  15      


Ala Ala Leu Cys Thr Gly Ala Ser Ala Trp Thr Thr Thr Pro Val Lys 
            20                  25                  30          


Arg Leu Asn Leu Pro Glu Leu Leu Val Ser Asp Gly Pro His Gly Ile 
        35                  40                  45              


Arg Arg Ile Ala Asp Val Tyr Ala Met Ala Gln Gln Ser Leu Pro Ala 
    50                  55                  60                  


Thr Cys Phe Pro Thr Ala Ser Ser Leu Ala Ala Thr Trp Asp Gly Asp 
65                  70                  75                  80  


Leu Leu Phe Gln Met Gly Gln Ala Leu Ala Glu Glu Ala Ile Ala Leu 
                85                  90                  95      


Gly Val Asp Val Ile Leu Gly Pro Gly Val Asn Met Lys Arg Ser Pro 
            100                 105                 110         


Leu Cys Gly Arg Asn Phe Glu Tyr Phe Ser Glu Asp Pro Phe Gln Ala 
        115                 120                 125             


Gly Ala Met Ala Ala Ser Leu Ile Lys Gly Ile Gln Ser Lys Gly Val 
    130                 135                 140                 


Gly Thr Ser Leu Lys His Tyr Ala Ala Asn Asn Gln Glu Phe Glu Arg 
145                 150                 155                 160 


Phe Ser Ile Asn Ala Gln Val Asp Glu Arg Thr Leu Arg Glu Ile Tyr 
                165                 170                 175     


Leu Pro Ala Phe Glu Thr Ala Val Thr Gln Ala Gln Pro Trp Thr Val 
            180                 185                 190         


Met Cys Ser Tyr Asn Lys Ile Asn Gly Thr Tyr Gly Ser Glu His Thr 
        195                 200                 205             


Gln Leu Leu Ser Asp Ile Leu Lys Lys Glu Trp Gly Phe Gln Gly Phe 
    210                 215                 220                 


Val Val Ser Asp Trp Gly Ala Val His Asp Arg Val Ala Ala Leu Lys 
225                 230                 235                 240 


Ala Gly Leu Asp Leu Glu Met Pro Gly Pro Lys Gln Thr Arg Val Asn 
                245                 250                 255     


Ala Val Ile Glu Ala Val Arg Asn Gly Glu Leu Asp Glu Ala Thr Leu 
            260                 265                 270         


Asp Glu Ala Val Arg Arg Ile Leu Arg Val Thr Leu Cys Ala Ala Gln 
        275                 280                 285             


Thr Pro Lys Gly Gly Glu Phe Asp Ala Ala Ala His His Ala Leu Ala 
    290                 295                 300                 


Arg Lys Val Ala Ala Glu Gly Met Val Leu Leu Lys Asn Asn Gly Leu 
305                 310                 315                 320 


Leu Pro Leu Gln Asn Pro Gln His Ile Ala Val Ile Gly Arg Ala Ala 
                325                 330                 335     


Gln Lys Ala His Phe Gln Gly Gly Gly Ser Ser His Ile Asn Pro Thr 
            340                 345                 350         


Gln Val Asp Val Pro Phe Ser Glu Leu Gln Lys Leu Ala Asp Asn Ala 
        355                 360                 365             


Glu Leu Ser Phe Ala Ala Gly Tyr Pro Glu Asp Asp Ser Arg Asp Gln 
    370                 375                 380                 


Ala Leu Ile Asp Glu Ala Thr Ala Ile Ala Gln Thr Ala Asp Val Ala 
385                 390                 395                 400 


Leu Leu Tyr Ile Ala Leu Pro Ser Phe Lys Glu Ser Glu Gly Tyr Asp 
                405                 410                 415     


Arg Pro Asp Leu Asp Leu Thr Ala Gln Gln Val Ala Leu Ile Gln Ala 
            420                 425                 430         


Val Thr Ala Val Gln Pro Asn Thr Val Val Ile Leu Asn Asn Gly Ala 
        435                 440                 445             


Pro Val Val Met Gly Glu Trp Leu Asp Gly Ala Ala Ala Val Leu Glu 
    450                 455                 460                 


Ala Trp Met Met Gly Gln Ala Gly Gly Gly Ala Ile Ala Asp Val Leu 
465                 470                 475                 480 


Phe Gly Arg Val Asn Pro Ser Gly Lys Leu Ala Glu Thr Tyr Pro His 
                485                 490                 495     


Arg Leu Thr Asp Thr Pro Ala Tyr Leu Asn Phe Pro Gly Glu Asn Gly 
            500                 505                 510         


Val Val Arg Tyr Gly Glu Gly Leu Phe Ile Gly Tyr Arg Tyr Tyr Asp 
        515                 520                 525             


Ala Lys Glu Met Pro Val Leu Phe Pro Phe Gly Tyr Gly Leu Ser Tyr 
    530                 535                 540                 


Thr Ser Phe Ala Tyr Ser Asn Leu Arg Val Ser Thr Asp Ser Phe Arg 
545                 550                 555                 560 


Asp Val Asp Gly Leu Ile Val Ser Val Asp Val Thr Asn Thr Gly Ala 
                565                 570                 575     


Val Ala Gly Lys Glu Val Val Gln Leu Tyr Val Arg Asp Gln Glu Ala 
            580                 585                 590         


Arg Leu Val Arg Pro Val Lys Glu Leu Lys Gly Phe Ala Lys Val Ala 
        595                 600                 605             


Leu Gln Pro Gly Glu Thr Gln Thr Val Thr Ile Pro Leu Asp Phe Arg 
    610                 615                 620                 


Ala Phe Ala Tyr Tyr Asp Pro Ala Tyr Arg Gln Trp Ile Ala Glu Glu 
625                 630                 635                 640 


Gly Gln Phe Asp Ile Leu Val Gly Ala Ser Ala Thr Asp Ile Arg Cys 
                645                 650                 655     


Gln Thr Ala Val Thr Leu His Ser Thr Ala Gln Leu Pro Thr Ile Leu 
            660                 665                 670         


His Glu Glu Ser Thr Ile Arg His Trp Phe Asn Asp Pro Ala Gly Lys 
        675                 680                 685             


Thr Ile Leu Gln Pro Met Phe Ala Glu Leu Met Ala Asn Gly Pro Phe 
    690                 695                 700                 


Ser Gln Asp Glu Ser Gly Gln Asp Ala Ile Gly Met Asp Thr Leu Ser 
705                 710                 715                 720 


Phe Leu Met Asp Leu Pro Leu Arg Ser Phe Leu His Phe Gln Glu Gly 
                725                 730                 735     


Ser Leu Thr Gln Pro Ala Asp Asp Ile Ala Asp Met Leu Leu Glu Gln 
            740                 745                 750         


Val Arg Gly Val Lys His 
        755             


<210> 421
<211> 2253
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 421
atgcaagcag tgtcctggcc atcgctgacc agcgcggtac ccaaacaatc cgaaacagaa     60

gcacagattg atgagttgct ggcgcgcatg accctggagg aaaaggtcgg ccagctgatc    120

cagccggaac tgcgccatgt cacgccggaa gacgtcaggg aatttcatct gggttcggta    180

ctcaacggcg gcggttcgtt tcccaatggc gatcagtacg ccccggtcag cgactgggtc    240

gcggtggccg acggcttcta tgaagcctcc gtggacaccc ggggaggccg taccggtatt    300

cccatcatgt ggggcacgga cgcggtgcat ggcctgggca atgtcatcgg cgcgaccctg    360

tttccccaca atatcgccct gggcgccgcc cgcgatcccg aactgattcg ggccatcggc    420

gaggtgaccg cgaaggaaat agccattacc ggcctggact ggaacttctc gcccaccgtg    480

gcggttgccc gggatgagcg ctggggccgg acctacgagg cgtactccga ggacccggaa    540

atcgtggcgc agtacgcggg cgagatggtc aaaggcctgc agggtgagcc ggggaccggg    600

gagtttctgc gcgactggcg cgtggtggcg acggcgaaac attttattgc cgatggcggc    660

accattgacg gtatagacag aggcgacaat accgacccgg aagaaaagct ccgggatatt    720

cacggcccgg gctacttcag tgccatcgag gcgggcgtgc aggtggtgat ggcctccttc    780

agcagttggc acggcgagcc catgcacggc cacaaatacc tgttgaccga tgtcctcaag    840

gggcaactgg gctttgatgg cgtggtgctg ggcgactgga gcggccacgg gtttattccc    900

ggctgtaccg ccctcgattg tccggatgcc ctgctggcgg gcctggacgt gtatatgatt    960

ccggacccgg agtggaagca gctttactac aacctgattg accaggtgcg ggggggaatt   1020

attccccagg cgcgcctgga cgatgcggtg tgccgcatgc tgcgggtgaa aatgcgcgcg   1080

ggcatgttcg acaaggccag gccctcccga cgcccctggg ccaaccgggc cgacaggctg   1140

ggctcgcccg agcaccgggc cgtggcccgc cgggcggtgc gtgaatccct ggtgatgctg   1200

aaaaaccgcg gcaacctgct gcccctggcc cccaaccagc gggtgctggt ggccggggac   1260

ggcgcccaca atatcggcaa gcaggcgggt ggctggagcg tgacctggca ggggacgggt   1320

accaccaaag aggattttcc cggcgccacc accatcttcg agggcatcgc gcaggtggtg   1380

aacgccgccg ggggcgaggc ggtgttgagc ggggacggca gctttgacca gcgcccggat   1440

gtggccattg tggtgttcgg ggaagacccc tacgccgaga tgcaggggga tatggccaac   1500

accctgtaca agccgggcga cgactcggat ctggcgctgc tgcgccggtt gcgggcccag   1560

gagattccgg tggtggccct gtttatcacc ggccggcccc tgtgggtgaa ccgcgagctc   1620

aacgcctccg acgcctttgt ggtgatctgg caaccgggca ccgaaggtgg cggtgtggcc   1680

gatgtcctct tcggtgacag tgaggggcgg gctcgtcacc ccatgcgggg gcgcctgacg   1740

tttacctggc ccaggcgccc ggatcagggg cccgtcaatc gcggcgacga agattatgac   1800

ccgctgttcc cctatggttt tggcctgggc tatggcgacc cggatacgct ggggcaattg   1860

ccggaagagg gcgtggcggt agcggcaacc cccgatgtgc tggatatctt ctatcgccgg   1920

cccatggggc cctggcagtt ggagatcgag ggccggctca acgaccgggt ggccatgacc   1980

ggcagccgtg ccgacacctc cacggtcggc gtcgacgccg tggaccggga ggtgcaggag   2040

gatgcccgcc atgtggtctg gagcggcgag ggctttggca ggctggcgct ggcatccgcc   2100

aagcgcgtgg acctgagcga ctacctgggc gccaacgccg ccctggtgtt tgatgtcaag   2160

gtgcacaagg cgcccactcg taccctgtgg ctgcgtctgg cgagcgcctc ctgctgccac   2220

gccgatatcg acgtgaccga cgaattcttt gga                                2253

<210> 422
<211> 751
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (93)...(320)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (397)...(613)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (549)...(552)
<223> N-glycosylation site. Prosite id = PS00001

<400> 422
Met Gln Ala Val Ser Trp Pro Ser Leu Thr Ser Ala Val Pro Lys Gln 
1               5                   10                  15      


Ser Glu Thr Glu Ala Gln Ile Asp Glu Leu Leu Ala Arg Met Thr Leu 
            20                  25                  30          


Glu Glu Lys Val Gly Gln Leu Ile Gln Pro Glu Leu Arg His Val Thr 
        35                  40                  45              


Pro Glu Asp Val Arg Glu Phe His Leu Gly Ser Val Leu Asn Gly Gly 
    50                  55                  60                  


Gly Ser Phe Pro Asn Gly Asp Gln Tyr Ala Pro Val Ser Asp Trp Val 
65                  70                  75                  80  


Ala Val Ala Asp Gly Phe Tyr Glu Ala Ser Val Asp Thr Arg Gly Gly 
                85                  90                  95      


Arg Thr Gly Ile Pro Ile Met Trp Gly Thr Asp Ala Val His Gly Leu 
            100                 105                 110         


Gly Asn Val Ile Gly Ala Thr Leu Phe Pro His Asn Ile Ala Leu Gly 
        115                 120                 125             


Ala Ala Arg Asp Pro Glu Leu Ile Arg Ala Ile Gly Glu Val Thr Ala 
    130                 135                 140                 


Lys Glu Ile Ala Ile Thr Gly Leu Asp Trp Asn Phe Ser Pro Thr Val 
145                 150                 155                 160 


Ala Val Ala Arg Asp Glu Arg Trp Gly Arg Thr Tyr Glu Ala Tyr Ser 
                165                 170                 175     


Glu Asp Pro Glu Ile Val Ala Gln Tyr Ala Gly Glu Met Val Lys Gly 
            180                 185                 190         


Leu Gln Gly Glu Pro Gly Thr Gly Glu Phe Leu Arg Asp Trp Arg Val 
        195                 200                 205             


Val Ala Thr Ala Lys His Phe Ile Ala Asp Gly Gly Thr Ile Asp Gly 
    210                 215                 220                 


Ile Asp Arg Gly Asp Asn Thr Asp Pro Glu Glu Lys Leu Arg Asp Ile 
225                 230                 235                 240 


His Gly Pro Gly Tyr Phe Ser Ala Ile Glu Ala Gly Val Gln Val Val 
                245                 250                 255     


Met Ala Ser Phe Ser Ser Trp His Gly Glu Pro Met His Gly His Lys 
            260                 265                 270         


Tyr Leu Leu Thr Asp Val Leu Lys Gly Gln Leu Gly Phe Asp Gly Val 
        275                 280                 285             


Val Leu Gly Asp Trp Ser Gly His Gly Phe Ile Pro Gly Cys Thr Ala 
    290                 295                 300                 


Leu Asp Cys Pro Asp Ala Leu Leu Ala Gly Leu Asp Val Tyr Met Ile 
305                 310                 315                 320 


Pro Asp Pro Glu Trp Lys Gln Leu Tyr Tyr Asn Leu Ile Asp Gln Val 
                325                 330                 335     


Arg Gly Gly Ile Ile Pro Gln Ala Arg Leu Asp Asp Ala Val Cys Arg 
            340                 345                 350         


Met Leu Arg Val Lys Met Arg Ala Gly Met Phe Asp Lys Ala Arg Pro 
        355                 360                 365             


Ser Arg Arg Pro Trp Ala Asn Arg Ala Asp Arg Leu Gly Ser Pro Glu 
    370                 375                 380                 


His Arg Ala Val Ala Arg Arg Ala Val Arg Glu Ser Leu Val Met Leu 
385                 390                 395                 400 


Lys Asn Arg Gly Asn Leu Leu Pro Leu Ala Pro Asn Gln Arg Val Leu 
                405                 410                 415     


Val Ala Gly Asp Gly Ala His Asn Ile Gly Lys Gln Ala Gly Gly Trp 
            420                 425                 430         


Ser Val Thr Trp Gln Gly Thr Gly Thr Thr Lys Glu Asp Phe Pro Gly 
        435                 440                 445             


Ala Thr Thr Ile Phe Glu Gly Ile Ala Gln Val Val Asn Ala Ala Gly 
    450                 455                 460                 


Gly Glu Ala Val Leu Ser Gly Asp Gly Ser Phe Asp Gln Arg Pro Asp 
465                 470                 475                 480 


Val Ala Ile Val Val Phe Gly Glu Asp Pro Tyr Ala Glu Met Gln Gly 
                485                 490                 495     


Asp Met Ala Asn Thr Leu Tyr Lys Pro Gly Asp Asp Ser Asp Leu Ala 
            500                 505                 510         


Leu Leu Arg Arg Leu Arg Ala Gln Glu Ile Pro Val Val Ala Leu Phe 
        515                 520                 525             


Ile Thr Gly Arg Pro Leu Trp Val Asn Arg Glu Leu Asn Ala Ser Asp 
    530                 535                 540                 


Ala Phe Val Val Ile Trp Gln Pro Gly Thr Glu Gly Gly Gly Val Ala 
545                 550                 555                 560 


Asp Val Leu Phe Gly Asp Ser Glu Gly Arg Ala Arg His Pro Met Arg 
                565                 570                 575     


Gly Arg Leu Thr Phe Thr Trp Pro Arg Arg Pro Asp Gln Gly Pro Val 
            580                 585                 590         


Asn Arg Gly Asp Glu Asp Tyr Asp Pro Leu Phe Pro Tyr Gly Phe Gly 
        595                 600                 605             


Leu Gly Tyr Gly Asp Pro Asp Thr Leu Gly Gln Leu Pro Glu Glu Gly 
    610                 615                 620                 


Val Ala Val Ala Ala Thr Pro Asp Val Leu Asp Ile Phe Tyr Arg Arg 
625                 630                 635                 640 


Pro Met Gly Pro Trp Gln Leu Glu Ile Glu Gly Arg Leu Asn Asp Arg 
                645                 650                 655     


Val Ala Met Thr Gly Ser Arg Ala Asp Thr Ser Thr Val Gly Val Asp 
            660                 665                 670         


Ala Val Asp Arg Glu Val Gln Glu Asp Ala Arg His Val Val Trp Ser 
        675                 680                 685             


Gly Glu Gly Phe Gly Arg Leu Ala Leu Ala Ser Ala Lys Arg Val Asp 
    690                 695                 700                 


Leu Ser Asp Tyr Leu Gly Ala Asn Ala Ala Leu Val Phe Asp Val Lys 
705                 710                 715                 720 


Val His Lys Ala Pro Thr Arg Thr Leu Trp Leu Arg Leu Ala Ser Ala 
                725                 730                 735     


Ser Cys Cys His Ala Asp Ile Asp Val Thr Asp Glu Phe Phe Gly 
            740                 745                 750     


<210> 423
<211> 1338
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 423
atgaatcacg ctgcacgccg ccgtacttta ctgggcctgg gcacggcact tgccggcgct     60

acgctgctgc cgcgcggcgc tgccgccgcc acggcacgcg gacctttccc ccagggcttc    120

ctgtggggcg cggccatcgc cggccaccaa gcggaaggcg acaacgtcgc gagcgacgcc    180

tggctgctgg agaacattca gccgacggaa ttcaaggagc cgtcgggcgc cgccgtcgat    240

cactaccgcc tgtacgacca ggacatcgcg accctcgcgt cgctgggcct gaataccttc    300

cgcttctcga tcgagtgggc gcgcgtggaa cccgtcgaag gcatgttctc ggtcgccgcg    360

ctggaacact atcgcgacgt gctgcagtcc tgccgccggc atcgggtgaa agcgatggtc    420

agcttcaacc acttcgtcac gcccgcgtgg ttcgcggcac ggggcggctg ggaaacggac    480

gggtcggcgc agctgtacgc ccgctactgc gacaaggttg cgcgccacct gggcgacctg    540

atcgattacg cgacgacgtt caacgaaccg aatctgccgc gcctgttgtt cggcatcccg    600

gggccgctgg ccggcatggc cgacaacccg cgcatgaagg cgatgctcgc gaaggcgggg    660

cagctggcgg gcaccggcaa gtggtcgtca tggatcttcg gcgacttcgc gcgcatcgag    720

gcgggcctgc tgcaggcgca cgcggccggc tacgcggcca tcaaggccgt gcgaccgcag    780

ctgccggtcg gcttctcgat cgccatcgcg gacgaccagg ccatcgacgg cggcgaggcg    840

atggtcgcgc gcaagcgggc gatcgcttac gagccgtggt atcgggcgat cgccgaacac    900

ggcgacttca tcggcgtgca gacctatacg cgcgagctga tcggccctga cggcgtgcgc    960

ccgccgccaa aagacgccac cttcacgtcg gcgcacatgg agtattaccc gcaggcgctg   1020

gaggcgacga tccgctacac ggcgcagcac gtgaagctgc cgatctacgt gacggaaaac   1080

ggcatctcga cggacgacga tacccagcgc atcgcgtaca tccgcacggc cgtgggaggt   1140

gttgcgaact gcctgaaaga cgggattccc gtgaagagct atatccactg gtcgctgctc   1200

gacaacttcg agtggatctt cggttacggc ccgcactacg gcctgatcgg cgtcgaccgc   1260

gcgacgatga aacgcaccgt caagccgagc gcgcgcgtgc tcgggaagat cgggcaggcg   1320

aacggcatcg cggcgtga                                                 1338

<210> 424
<211> 445
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (32)...(443)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (360)...(368)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 424
Met Asn His Ala Ala Arg Arg Arg Thr Leu Leu Gly Leu Gly Thr Ala 
1               5                   10                  15      


Leu Ala Gly Ala Thr Leu Leu Pro Arg Gly Ala Ala Ala Ala Thr Ala 
            20                  25                  30          


Arg Gly Pro Phe Pro Gln Gly Phe Leu Trp Gly Ala Ala Ile Ala Gly 
        35                  40                  45              


His Gln Ala Glu Gly Asp Asn Val Ala Ser Asp Ala Trp Leu Leu Glu 
    50                  55                  60                  


Asn Ile Gln Pro Thr Glu Phe Lys Glu Pro Ser Gly Ala Ala Val Asp 
65                  70                  75                  80  


His Tyr Arg Leu Tyr Asp Gln Asp Ile Ala Thr Leu Ala Ser Leu Gly 
                85                  90                  95      


Leu Asn Thr Phe Arg Phe Ser Ile Glu Trp Ala Arg Val Glu Pro Val 
            100                 105                 110         


Glu Gly Met Phe Ser Val Ala Ala Leu Glu His Tyr Arg Asp Val Leu 
        115                 120                 125             


Gln Ser Cys Arg Arg His Arg Val Lys Ala Met Val Ser Phe Asn His 
    130                 135                 140                 


Phe Val Thr Pro Ala Trp Phe Ala Ala Arg Gly Gly Trp Glu Thr Asp 
145                 150                 155                 160 


Gly Ser Ala Gln Leu Tyr Ala Arg Tyr Cys Asp Lys Val Ala Arg His 
                165                 170                 175     


Leu Gly Asp Leu Ile Asp Tyr Ala Thr Thr Phe Asn Glu Pro Asn Leu 
            180                 185                 190         


Pro Arg Leu Leu Phe Gly Ile Pro Gly Pro Leu Ala Gly Met Ala Asp 
        195                 200                 205             


Asn Pro Arg Met Lys Ala Met Leu Ala Lys Ala Gly Gln Leu Ala Gly 
    210                 215                 220                 


Thr Gly Lys Trp Ser Ser Trp Ile Phe Gly Asp Phe Ala Arg Ile Glu 
225                 230                 235                 240 


Ala Gly Leu Leu Gln Ala His Ala Ala Gly Tyr Ala Ala Ile Lys Ala 
                245                 250                 255     


Val Arg Pro Gln Leu Pro Val Gly Phe Ser Ile Ala Ile Ala Asp Asp 
            260                 265                 270         


Gln Ala Ile Asp Gly Gly Glu Ala Met Val Ala Arg Lys Arg Ala Ile 
        275                 280                 285             


Ala Tyr Glu Pro Trp Tyr Arg Ala Ile Ala Glu His Gly Asp Phe Ile 
    290                 295                 300                 


Gly Val Gln Thr Tyr Thr Arg Glu Leu Ile Gly Pro Asp Gly Val Arg 
305                 310                 315                 320 


Pro Pro Pro Lys Asp Ala Thr Phe Thr Ser Ala His Met Glu Tyr Tyr 
                325                 330                 335     


Pro Gln Ala Leu Glu Ala Thr Ile Arg Tyr Thr Ala Gln His Val Lys 
            340                 345                 350         


Leu Pro Ile Tyr Val Thr Glu Asn Gly Ile Ser Thr Asp Asp Asp Thr 
        355                 360                 365             


Gln Arg Ile Ala Tyr Ile Arg Thr Ala Val Gly Gly Val Ala Asn Cys 
    370                 375                 380                 


Leu Lys Asp Gly Ile Pro Val Lys Ser Tyr Ile His Trp Ser Leu Leu 
385                 390                 395                 400 


Asp Asn Phe Glu Trp Ile Phe Gly Tyr Gly Pro His Tyr Gly Leu Ile 
                405                 410                 415     


Gly Val Asp Arg Ala Thr Met Lys Arg Thr Val Lys Pro Ser Ala Arg 
            420                 425                 430         


Val Leu Gly Lys Ile Gly Gln Ala Asn Gly Ile Ala Ala 
        435                 440                 445 


<210> 425
<211> 1704
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 425
atggatttgc aggccaaacc gttttatctc gatacccgac aagaacaatg ggtccatgcg     60

acccgtgatg cgatgacttt ggaggagaag atcgggcagt tgttctgccc gattggcatc    120

acgtatgatg agagcgagct tgcaaccttg cttggtacgg tgcccattgg cggaatcatg    180

tttcgtccag gtccggccgc ccaagtgcag gcagctcacc gattcctgca agagcggagt    240

aagacacccc tacttttggc tgcgaaccta gaatcgggag ggattgggac agcttccgaa    300

ggtacgctgt ttgggaccca aatgcaggta gcagctaccg gtcaggtggc catggccgaa    360

acgttgggac gcattgctgg acgcgaggga agagccgtgg gtttgaactg ggcttttgca    420

ccagtaattg acatcgattg gaatttccgc aacccgataa caaacacacg cacctatggc    480

tcagatcccg acacggtgcg caaaatggga agtgcctata tgaaggcgct tcatgaagaa    540

ggcttggccg tgtcaatcaa acattttcct ggtgacggcg tggacgagcg agaccagcat    600

ttgctaccgt cggtcaatga tctatcacca gacgcctggc acgcatcgtt tggagccatc    660

tatcgagctc tcattgacca aggtgctcag actgtcatga tcggtcatat tctgcttccg    720

aaggtgcagc agtttttgcg accagagatg cgagacgagg atgtaatgcc agcgaccttg    780

gcccccgagc tcctgcagga ccttcttcgt gaggaactag ggtttaacgg catgatcgtg    840

accgacgcaa caccaatggc tggttttatg caaatgatgc cacgttctca ggctgttcct    900

cttagcatcg ctgccggatg cgatatgttt ttgttcaatc agaacttgga ggaagattac    960

cggtttatga tggacggcat tgccgatggt ttgctgaccg aagaaagggt agatgcggct   1020

gttacgagga tcttggccct gaaggcggcc ctcggactgc cagagcaaaa ggaaaccgga   1080

acgttggtcc cgggacccga agccttggat gtaatcggct gcgaggagca tgagcgctgg   1140

gcagaggaat gtgctgacca aagcgtgaca ttggttaaag atacccagga tctcctgccg   1200

ttatcaccaa aaaggcacag gcgcattctc ctttacgttc tcggcgatgc cgatgttcct   1260

ggtgcccatt caggcggtac gagtcgacat cctcagttca tcgagttaat ggagaaggcg   1320

ggttttaaaa taacggtttt cgatccagca gaaggactca ctttccggcg caaatctact   1380

gctgacattg tgggcgagta cgacgttgcc atttacttcg ctaatattgg cacgtacagc   1440

aaccaaaccg tgattcgagt aaactgggct ccaccgatgg gcgtggatgt acctcggttc   1500

attcatgaac tgccgactat gttcatttcc gttagtggtc cgtaccattt gcaggatgta   1560

ccacggatca agaccttcat caatggctat acgccaagcc cgcaggtagt gacggcggta   1620

gtggacaaga tcctcggtcg cagtgagttc aaaggtacga gtccagtaga tccctactgc   1680

ggtctctggg acacgtacct atag                                          1704

<210> 426
<211> 567
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (74)...(312)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> SITE
<222> (488)...(491)
<223> N-glycosylation site. Prosite id = PS00001

<400> 426
Met Asp Leu Gln Ala Lys Pro Phe Tyr Leu Asp Thr Arg Gln Glu Gln 
1               5                   10                  15      


Trp Val His Ala Thr Arg Asp Ala Met Thr Leu Glu Glu Lys Ile Gly 
            20                  25                  30          


Gln Leu Phe Cys Pro Ile Gly Ile Thr Tyr Asp Glu Ser Glu Leu Ala 
        35                  40                  45              


Thr Leu Leu Gly Thr Val Pro Ile Gly Gly Ile Met Phe Arg Pro Gly 
    50                  55                  60                  


Pro Ala Ala Gln Val Gln Ala Ala His Arg Phe Leu Gln Glu Arg Ser 
65                  70                  75                  80  


Lys Thr Pro Leu Leu Leu Ala Ala Asn Leu Glu Ser Gly Gly Ile Gly 
                85                  90                  95      


Thr Ala Ser Glu Gly Thr Leu Phe Gly Thr Gln Met Gln Val Ala Ala 
            100                 105                 110         


Thr Gly Gln Val Ala Met Ala Glu Thr Leu Gly Arg Ile Ala Gly Arg 
        115                 120                 125             


Glu Gly Arg Ala Val Gly Leu Asn Trp Ala Phe Ala Pro Val Ile Asp 
    130                 135                 140                 


Ile Asp Trp Asn Phe Arg Asn Pro Ile Thr Asn Thr Arg Thr Tyr Gly 
145                 150                 155                 160 


Ser Asp Pro Asp Thr Val Arg Lys Met Gly Ser Ala Tyr Met Lys Ala 
                165                 170                 175     


Leu His Glu Glu Gly Leu Ala Val Ser Ile Lys His Phe Pro Gly Asp 
            180                 185                 190         


Gly Val Asp Glu Arg Asp Gln His Leu Leu Pro Ser Val Asn Asp Leu 
        195                 200                 205             


Ser Pro Asp Ala Trp His Ala Ser Phe Gly Ala Ile Tyr Arg Ala Leu 
    210                 215                 220                 


Ile Asp Gln Gly Ala Gln Thr Val Met Ile Gly His Ile Leu Leu Pro 
225                 230                 235                 240 


Lys Val Gln Gln Phe Leu Arg Pro Glu Met Arg Asp Glu Asp Val Met 
                245                 250                 255     


Pro Ala Thr Leu Ala Pro Glu Leu Leu Gln Asp Leu Leu Arg Glu Glu 
            260                 265                 270         


Leu Gly Phe Asn Gly Met Ile Val Thr Asp Ala Thr Pro Met Ala Gly 
        275                 280                 285             


Phe Met Gln Met Met Pro Arg Ser Gln Ala Val Pro Leu Ser Ile Ala 
    290                 295                 300                 


Ala Gly Cys Asp Met Phe Leu Phe Asn Gln Asn Leu Glu Glu Asp Tyr 
305                 310                 315                 320 


Arg Phe Met Met Asp Gly Ile Ala Asp Gly Leu Leu Thr Glu Glu Arg 
                325                 330                 335     


Val Asp Ala Ala Val Thr Arg Ile Leu Ala Leu Lys Ala Ala Leu Gly 
            340                 345                 350         


Leu Pro Glu Gln Lys Glu Thr Gly Thr Leu Val Pro Gly Pro Glu Ala 
        355                 360                 365             


Leu Asp Val Ile Gly Cys Glu Glu His Glu Arg Trp Ala Glu Glu Cys 
    370                 375                 380                 


Ala Asp Gln Ser Val Thr Leu Val Lys Asp Thr Gln Asp Leu Leu Pro 
385                 390                 395                 400 


Leu Ser Pro Lys Arg His Arg Arg Ile Leu Leu Tyr Val Leu Gly Asp 
                405                 410                 415     


Ala Asp Val Pro Gly Ala His Ser Gly Gly Thr Ser Arg His Pro Gln 
            420                 425                 430         


Phe Ile Glu Leu Met Glu Lys Ala Gly Phe Lys Ile Thr Val Phe Asp 
        435                 440                 445             


Pro Ala Glu Gly Leu Thr Phe Arg Arg Lys Ser Thr Ala Asp Ile Val 
    450                 455                 460                 


Gly Glu Tyr Asp Val Ala Ile Tyr Phe Ala Asn Ile Gly Thr Tyr Ser 
465                 470                 475                 480 


Asn Gln Thr Val Ile Arg Val Asn Trp Ala Pro Pro Met Gly Val Asp 
                485                 490                 495     


Val Pro Arg Phe Ile His Glu Leu Pro Thr Met Phe Ile Ser Val Ser 
            500                 505                 510         


Gly Pro Tyr His Leu Gln Asp Val Pro Arg Ile Lys Thr Phe Ile Asn 
        515                 520                 525             


Gly Tyr Thr Pro Ser Pro Gln Val Val Thr Ala Val Val Asp Lys Ile 
    530                 535                 540                 


Leu Gly Arg Ser Glu Phe Lys Gly Thr Ser Pro Val Asp Pro Tyr Cys 
545                 550                 555                 560 


Gly Leu Trp Asp Thr Tyr Leu 
                565         


<210> 427
<211> 954
<212> DNA
<213> Thermotoga maritima MSB8

<400> 427
atgggtgttg atccttttga aaggaacaaa atattgggaa gaggcattaa tataggaaat     60

gcgcttgaag caccaaatga gggagactgg ggagtggtga taaaagatga gttcttcgac    120

attataaaag aagccggttt ctctcatgtt cgaattccaa taagatggag tacgcacgct    180

tacgcgtttc ctccttataa aatcatggat cgcttcttca aaagagtgga tgaagtgata    240

aacggagccc tgaaaagagg actggctgtt gttataaata ttcatcacta cgaggagtta    300

atgaatgatc cagaagaaca caaggaaaga tttcttgctc tttggaaaca aattgctgat    360

cgttataaag actatcccga aactctattt tttgaaattc tgaatgaacc tcacggaaat    420

cttactccgg aaaaatggaa tgaactgctt gaggaagctc taaaagttat aagatcaatt    480

gacaaaaagc acactataat tataggcaca gctgaatggg ggggtatatc tgcccttgaa    540

aaactgtctg tcccaaaatg ggaaaaaaat tctatagtta caattcacta ctacaatcct    600

ttcgaattta cccatcaagg agctgagtgg gtggaaggat ctgagaaatg gttgggaaga    660

aagtggggat ctccagatga tcagaaacat ttgatagaag aattcaattt tatagaagaa    720

tggtcaaaaa agaacaaaag accaatttac ataggtgagt ttggtgccta cagaaaagct    780

gaccttgaat caagaataaa atggacctcc tttgtcgttc gcgaaatgga gaaaaggaga    840

tggagctggg catactggga attttgttcc ggttttggtg tttatgatac tctgagaaaa    900

acctggaata aagatctttt agaagcttta ataggaggag atagcattga ataa          954

<210> 428
<211> 317
<212> PRT
<213> Thermotoga maritima MSB8

<220> 
<221> DOMAIN
<222> (19)...(296)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 428
Met Gly Val Asp Pro Phe Glu Arg Asn Lys Ile Leu Gly Arg Gly Ile 
1               5                   10                  15      


Asn Ile Gly Asn Ala Leu Glu Ala Pro Asn Glu Gly Asp Trp Gly Val 
            20                  25                  30          


Val Ile Lys Asp Glu Phe Phe Asp Ile Ile Lys Glu Ala Gly Phe Ser 
        35                  40                  45              


His Val Arg Ile Pro Ile Arg Trp Ser Thr His Ala Tyr Ala Phe Pro 
    50                  55                  60                  


Pro Tyr Lys Ile Met Asp Arg Phe Phe Lys Arg Val Asp Glu Val Ile 
65                  70                  75                  80  


Asn Gly Ala Leu Lys Arg Gly Leu Ala Val Val Ile Asn Ile His His 
                85                  90                  95      


Tyr Glu Glu Leu Met Asn Asp Pro Glu Glu His Lys Glu Arg Phe Leu 
            100                 105                 110         


Ala Leu Trp Lys Gln Ile Ala Asp Arg Tyr Lys Asp Tyr Pro Glu Thr 
        115                 120                 125             


Leu Phe Phe Glu Ile Leu Asn Glu Pro His Gly Asn Leu Thr Pro Glu 
    130                 135                 140                 


Lys Trp Asn Glu Leu Leu Glu Glu Ala Leu Lys Val Ile Arg Ser Ile 
145                 150                 155                 160 


Asp Lys Lys His Thr Ile Ile Ile Gly Thr Ala Glu Trp Gly Gly Ile 
                165                 170                 175     


Ser Ala Leu Glu Lys Leu Ser Val Pro Lys Trp Glu Lys Asn Ser Ile 
            180                 185                 190         


Val Thr Ile His Tyr Tyr Asn Pro Phe Glu Phe Thr His Gln Gly Ala 
        195                 200                 205             


Glu Trp Val Glu Gly Ser Glu Lys Trp Leu Gly Arg Lys Trp Gly Ser 
    210                 215                 220                 


Pro Asp Asp Gln Lys His Leu Ile Glu Glu Phe Asn Phe Ile Glu Glu 
225                 230                 235                 240 


Trp Ser Lys Lys Asn Lys Arg Pro Ile Tyr Ile Gly Glu Phe Gly Ala 
                245                 250                 255     


Tyr Arg Lys Ala Asp Leu Glu Ser Arg Ile Lys Trp Thr Ser Phe Val 
            260                 265                 270         


Val Arg Glu Met Glu Lys Arg Arg Trp Ser Trp Ala Tyr Trp Glu Phe 
        275                 280                 285             


Cys Ser Gly Phe Gly Val Tyr Asp Thr Leu Arg Lys Thr Trp Asn Lys 
    290                 295                 300                 


Asp Leu Leu Glu Ala Leu Ile Gly Gly Asp Ser Ile Glu 
305                 310                 315         


<210> 429
<211> 894
<212> DNA
<213> Thermotoga maritima MSB8

<400> 429
atggctcagt gggactttaa ttttgttaga atccctatgt gtcatcttct ctggtcagac     60

cggggcaacc catttattat cagagaagat ttttttgaga aaatcgatcg tgtaattttc    120

tggggagaga aatatggaat acatatatgt atttctcttc acagggcacc tggctattct    180

gttaacaagg aagtagaaga gaaaaccaat ctgtggaaag atgaaacagc tcaagaagcg    240

ttcattcatc actggtcttt tatcgcacgt cgttacaaag gaatttcttc cacacacctg    300

agttttaact taataaatga gcctccattt cctgatccac aaatcatgag tgttgaagat    360

cacaactctc ttatcaagag aactattaca gaaattcgaa aaatagatcc cgaaagatta    420

attataatag atggattagg ctatgggaat attccagtgg atgatttaac aattgagaat    480

acagtgcaat catgcagagg gtacattccc ttcagtgtta ctcattacaa agcggaatgg    540

gtggatagta aggactttcc tgttcctgag tggccaaatg gatggcattt tggggaatac    600

tggaacagag aaaagttatt ggaacattat ttaacgtgga taaaactcag acaaaaagga    660

atagaagtat tctgtggaga aatgggagct tacaacaaaa cacctcacga tgtggtttta    720

aaatggcttg aagatctttt agaaattttt aaaactttga acatagggtt tgccttatgg    780

aattttagag gtccttttgg tattttagat tcggaaagga aagacgttga atacgaagaa    840

tggtatggac ataaactgga taggaaaatg ttggaactat tgagaaaata ttag          894

<210> 430
<211> 297
<212> PRT
<213> Thermotoga maritima MSB8

<220> 
<221> DOMAIN
<222> (1)...(271)
<223> Cellulase (glycosyl hydrolase family 5)

<400> 430
Met Ala Gln Trp Asp Phe Asn Phe Val Arg Ile Pro Met Cys His Leu 
1               5                   10                  15      


Leu Trp Ser Asp Arg Gly Asn Pro Phe Ile Ile Arg Glu Asp Phe Phe 
            20                  25                  30          


Glu Lys Ile Asp Arg Val Ile Phe Trp Gly Glu Lys Tyr Gly Ile His 
        35                  40                  45              


Ile Cys Ile Ser Leu His Arg Ala Pro Gly Tyr Ser Val Asn Lys Glu 
    50                  55                  60                  


Val Glu Glu Lys Thr Asn Leu Trp Lys Asp Glu Thr Ala Gln Glu Ala 
65                  70                  75                  80  


Phe Ile His His Trp Ser Phe Ile Ala Arg Arg Tyr Lys Gly Ile Ser 
                85                  90                  95      


Ser Thr His Leu Ser Phe Asn Leu Ile Asn Glu Pro Pro Phe Pro Asp 
            100                 105                 110         


Pro Gln Ile Met Ser Val Glu Asp His Asn Ser Leu Ile Lys Arg Thr 
        115                 120                 125             


Ile Thr Glu Ile Arg Lys Ile Asp Pro Glu Arg Leu Ile Ile Ile Asp 
    130                 135                 140                 


Gly Leu Gly Tyr Gly Asn Ile Pro Val Asp Asp Leu Thr Ile Glu Asn 
145                 150                 155                 160 


Thr Val Gln Ser Cys Arg Gly Tyr Ile Pro Phe Ser Val Thr His Tyr 
                165                 170                 175     


Lys Ala Glu Trp Val Asp Ser Lys Asp Phe Pro Val Pro Glu Trp Pro 
            180                 185                 190         


Asn Gly Trp His Phe Gly Glu Tyr Trp Asn Arg Glu Lys Leu Leu Glu 
        195                 200                 205             


His Tyr Leu Thr Trp Ile Lys Leu Arg Gln Lys Gly Ile Glu Val Phe 
    210                 215                 220                 


Cys Gly Glu Met Gly Ala Tyr Asn Lys Thr Pro His Asp Val Val Leu 
225                 230                 235                 240 


Lys Trp Leu Glu Asp Leu Leu Glu Ile Phe Lys Thr Leu Asn Ile Gly 
                245                 250                 255     


Phe Ala Leu Trp Asn Phe Arg Gly Pro Phe Gly Ile Leu Asp Ser Glu 
            260                 265                 270         


Arg Lys Asp Val Glu Tyr Glu Glu Trp Tyr Gly His Lys Leu Asp Arg 
        275                 280                 285             


Lys Met Leu Glu Leu Leu Arg Lys Tyr 
    290                 295         


<210> 431
<211> 1230
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 431
gtgaccggca ccgtgccgac cggcttcctg tggggcgtgg cgacggcggg ccaccagaac     60

gagggagaca acgtcaccag cgacacgtgg ttcctggagc acgtgcggcc cacggtgttc    120

cgggagccgt ccggcgcggc gtgcggatcg ttccggctgt gggagaccga cctcgacctc    180

gtcgcggcga tggggctgac cgcgtaccgc ttctccgtcg agtgggcgcg ggtcgagccg    240

gaggagggcc ggttctccga cgctgcgctg gcgcactacg cggcggtggt cgacggctgc    300

ctggcccgcg ggctggcgcc gatcgtgacg ctgaaccact tcaccgcgcc gcactggttc    360

gcctgccggg gcggctggct ggacccggac gcgccgcagc tgttcgcgcg ctacacggac    420

cgggtgatgg cccggttcgg cgcccgcatg tcccacgtcg tcacgctgaa cgagccgaac    480

ctgtcccggg tgctggcctg gtccgggctg cccgacgtcg tcgccgagct ggagcgcgcg    540

acgctggcag cggcgtccgc cgcggcgggc gtgccgcgct accgggtcgg caacgtcgtg    600

ctgccggagg agtacgacgc gctggccgac ggcatggcgg cggcgcacgt cgcagcgaag    660

gcggtcgtca agcggcaccg gccggacctg ccggtcgggc tgtcgctcgc cgtcgtcgac    720

gaccaggtcg cgggcgacga cgcggcgata cgagatcgca agcggcgcga cgcctacggg    780

cgctggctgg acctcgtccg cgccgacgac ttcgtcgggg tgcagaacta cgagcgcgtc    840

gtgtacgacg cgcgcggccg ggtcgagcgc cccggcccgc gcaaccagat gggctcgctg    900

atcgagccgg gctcgctcgc cggggcggtg cggtacgtgc acgaggcgac cggccgtccc    960

gtgctggtga ccgagcacgg catcgccacc gccgacgaca cgcagcgggt ggcgttcctc   1020

gcgccggcga tcgaggggct gctggccgcc gccgccgacg gcgtgcccgt gctcgggtac   1080

tgccactgga cgctgctgga caacttcgag tggatcttcg gctacgagca ccagttgggc   1140

ttgcacgagg tcgaccgcgt caccctcgcg cgcaccccga agcccagcgc ccacgagtac   1200

gcccgcctcg tcgccaccca ccacccctga                                    1230

<210> 432
<211> 409
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (1)...(409)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (24)...(27)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (162)...(165)
<223> N-glycosylation site. Prosite id = PS00001

<400> 432
Met Thr Gly Thr Val Pro Thr Gly Phe Leu Trp Gly Val Ala Thr Ala 
1               5                   10                  15      


Gly His Gln Asn Glu Gly Asp Asn Val Thr Ser Asp Thr Trp Phe Leu 
            20                  25                  30          


Glu His Val Arg Pro Thr Val Phe Arg Glu Pro Ser Gly Ala Ala Cys 
        35                  40                  45              


Gly Ser Phe Arg Leu Trp Glu Thr Asp Leu Asp Leu Val Ala Ala Met 
    50                  55                  60                  


Gly Leu Thr Ala Tyr Arg Phe Ser Val Glu Trp Ala Arg Val Glu Pro 
65                  70                  75                  80  


Glu Glu Gly Arg Phe Ser Asp Ala Ala Leu Ala His Tyr Ala Ala Val 
                85                  90                  95      


Val Asp Gly Cys Leu Ala Arg Gly Leu Ala Pro Ile Val Thr Leu Asn 
            100                 105                 110         


His Phe Thr Ala Pro His Trp Phe Ala Cys Arg Gly Gly Trp Leu Asp 
        115                 120                 125             


Pro Asp Ala Pro Gln Leu Phe Ala Arg Tyr Thr Asp Arg Val Met Ala 
    130                 135                 140                 


Arg Phe Gly Ala Arg Met Ser His Val Val Thr Leu Asn Glu Pro Asn 
145                 150                 155                 160 


Leu Ser Arg Val Leu Ala Trp Ser Gly Leu Pro Asp Val Val Ala Glu 
                165                 170                 175     


Leu Glu Arg Ala Thr Leu Ala Ala Ala Ser Ala Ala Ala Gly Val Pro 
            180                 185                 190         


Arg Tyr Arg Val Gly Asn Val Val Leu Pro Glu Glu Tyr Asp Ala Leu 
        195                 200                 205             


Ala Asp Gly Met Ala Ala Ala His Val Ala Ala Lys Ala Val Val Lys 
    210                 215                 220                 


Arg His Arg Pro Asp Leu Pro Val Gly Leu Ser Leu Ala Val Val Asp 
225                 230                 235                 240 


Asp Gln Val Ala Gly Asp Asp Ala Ala Ile Arg Asp Arg Lys Arg Arg 
                245                 250                 255     


Asp Ala Tyr Gly Arg Trp Leu Asp Leu Val Arg Ala Asp Asp Phe Val 
            260                 265                 270         


Gly Val Gln Asn Tyr Glu Arg Val Val Tyr Asp Ala Arg Gly Arg Val 
        275                 280                 285             


Glu Arg Pro Gly Pro Arg Asn Gln Met Gly Ser Leu Ile Glu Pro Gly 
    290                 295                 300                 


Ser Leu Ala Gly Ala Val Arg Tyr Val His Glu Ala Thr Gly Arg Pro 
305                 310                 315                 320 


Val Leu Val Thr Glu His Gly Ile Ala Thr Ala Asp Asp Thr Gln Arg 
                325                 330                 335     


Val Ala Phe Leu Ala Pro Ala Ile Glu Gly Leu Leu Ala Ala Ala Ala 
            340                 345                 350         


Asp Gly Val Pro Val Leu Gly Tyr Cys His Trp Thr Leu Leu Asp Asn 
        355                 360                 365             


Phe Glu Trp Ile Phe Gly Tyr Glu His Gln Leu Gly Leu His Glu Val 
    370                 375                 380                 


Asp Arg Val Thr Leu Ala Arg Thr Pro Lys Pro Ser Ala His Glu Tyr 
385                 390                 395                 400 


Ala Arg Leu Val Ala Thr His His Pro 
                405                 


<210> 433
<211> 1212
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 433
atgtcttcga atcagtatga ttacatcata gttggagctg gtctttctgg cggaatttta     60

gctagattgt tagcagagag tttagacaaa aggatcttaa tcgtagacag aagaaatcac    120

atttcaggta acatatatga ttttgttgac tcatgtggta ttaaggttca aaagtatgga    180

ccacatgtat tccatacaaa ttctgacgat gtttataatt ttatttctaa atattgtgag    240

cctgtaaaat atcgtaccaa atgtgaagct gtcatagatg gaattagcac accatctcct    300

tttaatttta aaactatcga tcaattttac gataaagaaa aagcacaaat cttaaagaat    360

aagttgcagt cttactatcc aaatgtgcat tcagtgactg ttgttgatat gttgaattca    420

tctgattcag atatcaaaag tcacgctcaa tttttatttg ataaagatta caaactatat    480

acagcaaaac agtggaactt aagcccagat gaaattgatc cttctgtatt aaaaagagtt    540

ccaattgaac tgtcttatga tgatacttat tttcatgata aatatgaatt catgcctaag    600

gatggatttc tcgaattcta taattgttta gttagccata aaaacattga aattaagact    660

aacatagaag cgcttgaaca tatatcattt gatgaagctg aacattctgt tatgtgggat    720

gataggttag taaacttgat ttatactggt gctattgatg agttgtttca gtgtaaattt    780

ggagttcttc cttatagatc cttgtgtttt aaatacgctc atccgaagac atgctcttat    840

caaaatgtgt ctattgttgc ttaccctcag gttgaaggct acacacgtat aactgagtac    900

actaagatgc cttaccagga ttgtaatgga attacaacaa ttgcttatga gtatcccatt    960

gagtatcaag ctaattttga aaatgataat aaatcatcag atgtaactga gccatactac   1020

cctgttttaa cagaaaatag tcaaaagata tttagcctct ataaaagtta tgccgataga   1080

tttaaaaatt taaccttatg tgggcgattg gctgacttca aatactacaa tatggatcaa   1140

gttgtattaa gagcttttga tgtatacgaa tcattaagga aaaaagaaaa tgttgcaaaa   1200

gtgggaattt aa                                                       1212

<210> 434
<211> 403
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (155)...(375)
<223> UDP-galactopyranose mutase

<220> 
<221> SITE
<222> (141)...(144)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (210)...(216)
<223> Immunoglobulins and major histocompatibility complex proteins signature. Prosite id = PS00290

<220> 
<221> SITE
<222> (286)...(289)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (335)...(338)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (368)...(371)
<223> N-glycosylation site. Prosite id = PS00001

<400> 434
Met Ser Ser Asn Gln Tyr Asp Tyr Ile Ile Val Gly Ala Gly Leu Ser 
1               5                   10                  15      


Gly Gly Ile Leu Ala Arg Leu Leu Ala Glu Ser Leu Asp Lys Arg Ile 
            20                  25                  30          


Leu Ile Val Asp Arg Arg Asn His Ile Ser Gly Asn Ile Tyr Asp Phe 
        35                  40                  45              


Val Asp Ser Cys Gly Ile Lys Val Gln Lys Tyr Gly Pro His Val Phe 
    50                  55                  60                  


His Thr Asn Ser Asp Asp Val Tyr Asn Phe Ile Ser Lys Tyr Cys Glu 
65                  70                  75                  80  


Pro Val Lys Tyr Arg Thr Lys Cys Glu Ala Val Ile Asp Gly Ile Ser 
                85                  90                  95      


Thr Pro Ser Pro Phe Asn Phe Lys Thr Ile Asp Gln Phe Tyr Asp Lys 
            100                 105                 110         


Glu Lys Ala Gln Ile Leu Lys Asn Lys Leu Gln Ser Tyr Tyr Pro Asn 
        115                 120                 125             


Val His Ser Val Thr Val Val Asp Met Leu Asn Ser Ser Asp Ser Asp 
    130                 135                 140                 


Ile Lys Ser His Ala Gln Phe Leu Phe Asp Lys Asp Tyr Lys Leu Tyr 
145                 150                 155                 160 


Thr Ala Lys Gln Trp Asn Leu Ser Pro Asp Glu Ile Asp Pro Ser Val 
                165                 170                 175     


Leu Lys Arg Val Pro Ile Glu Leu Ser Tyr Asp Asp Thr Tyr Phe His 
            180                 185                 190         


Asp Lys Tyr Glu Phe Met Pro Lys Asp Gly Phe Leu Glu Phe Tyr Asn 
        195                 200                 205             


Cys Leu Val Ser His Lys Asn Ile Glu Ile Lys Thr Asn Ile Glu Ala 
    210                 215                 220                 


Leu Glu His Ile Ser Phe Asp Glu Ala Glu His Ser Val Met Trp Asp 
225                 230                 235                 240 


Asp Arg Leu Val Asn Leu Ile Tyr Thr Gly Ala Ile Asp Glu Leu Phe 
                245                 250                 255     


Gln Cys Lys Phe Gly Val Leu Pro Tyr Arg Ser Leu Cys Phe Lys Tyr 
            260                 265                 270         


Ala His Pro Lys Thr Cys Ser Tyr Gln Asn Val Ser Ile Val Ala Tyr 
        275                 280                 285             


Pro Gln Val Glu Gly Tyr Thr Arg Ile Thr Glu Tyr Thr Lys Met Pro 
    290                 295                 300                 


Tyr Gln Asp Cys Asn Gly Ile Thr Thr Ile Ala Tyr Glu Tyr Pro Ile 
305                 310                 315                 320 


Glu Tyr Gln Ala Asn Phe Glu Asn Asp Asn Lys Ser Ser Asp Val Thr 
                325                 330                 335     


Glu Pro Tyr Tyr Pro Val Leu Thr Glu Asn Ser Gln Lys Ile Phe Ser 
            340                 345                 350         


Leu Tyr Lys Ser Tyr Ala Asp Arg Phe Lys Asn Leu Thr Leu Cys Gly 
        355                 360                 365             


Arg Leu Ala Asp Phe Lys Tyr Tyr Asn Met Asp Gln Val Val Leu Arg 
    370                 375                 380                 


Ala Phe Asp Val Tyr Glu Ser Leu Arg Lys Lys Glu Asn Val Ala Lys 
385                 390                 395                 400 


Val Gly Ile 
            


<210> 435
<211> 1200
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 435
atgttttccg tcaatcgcca tgattttggg ccgaacttca ctttcggcgt cgccaccgcc     60

agctatcaga tcgagggcaa cgccaccggc gagggccgcg gttcctcgat ctgggatacg    120

ttttcggcca ctcccggtaa cgtcgagggc ggcgacaccg gcgccgttgc cgatgaccat    180

tataaccgct gggaggaaga cctcgacctc atccgcgacg gcggctttga cgcctaccgc    240

ttctcggtag cgtggccgcg gctgctgccc gagggtatcg gcgccatcaa ccaggccggc    300

atcgacttct acgaccgcct gatcgacggg atgctggcgc gtggcatcaa gccattcatg    360

acgctgtatc attgggatct gccgtcggcg ctgcaggaca agggcggctg gatgaaccgc    420

gatatcgccg gctggctcgg cgaatacgct gccctctgcg gcaagcattt cggtgatcgg    480

gtcgccgcca ccgccaccat caacgagccg tggtgcgtgg ccttcctcag ccatttcctc    540

ggcattcatg cccccggcct gcgcgacatg cgcgccgcgg cccgcgccat gcaccatgtg    600

ctttatgccc acggccacgc cgtcgccgcg ctgcgcgccg agggcgtcaa gaacatcggc    660

atcgtcacca atctgcagaa gtgcgagccg gtatcgggga gcgacgccga ccgcgaggcg    720

accgacctct tcgacggcat cttcaacggc tggtatctgg gcggcctcta caaggggcag    780

tacccggcca acgtagtgaa aatgttcgaa aaatacctgc ccgccggctt cgagcgcgac    840

atggacaagg tctcgacgcc gctcgactgg gccggcgtca actattattc gcgcacgctc    900

gttgccgccg atccgagcgg ccccaccggc ttcaagaccg tcgagggcaa gctggagaag    960

accgatatcg gctgggagat ctatccccag ggcctcaccg acctgctggt ccgggtgtcg   1020

cgcgactaca ccaaggtgcc gatctacgtc accgagaacg gcatggcgga agtcgacggc   1080

gagacggatc cgcgccgcgt cacctattat gaagatcacc tcaaggcgct gctcgccgcc   1140

cgcgccgccg gcgtcgacgt gcgcggctat ttcgcctggt cgctgatgga caatttctag   1200

<210> 436
<211> 399
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (5)...(399)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (12)...(15)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (27)...(30)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (353)...(361)
<223> Glycosyl hydrolases family 1 active site. Prosite id = PS00572

<400> 436
Met Phe Ser Val Asn Arg His Asp Phe Gly Pro Asn Phe Thr Phe Gly 
1               5                   10                  15      


Val Ala Thr Ala Ser Tyr Gln Ile Glu Gly Asn Ala Thr Gly Glu Gly 
            20                  25                  30          


Arg Gly Ser Ser Ile Trp Asp Thr Phe Ser Ala Thr Pro Gly Asn Val 
        35                  40                  45              


Glu Gly Gly Asp Thr Gly Ala Val Ala Asp Asp His Tyr Asn Arg Trp 
    50                  55                  60                  


Glu Glu Asp Leu Asp Leu Ile Arg Asp Gly Gly Phe Asp Ala Tyr Arg 
65                  70                  75                  80  


Phe Ser Val Ala Trp Pro Arg Leu Leu Pro Glu Gly Ile Gly Ala Ile 
                85                  90                  95      


Asn Gln Ala Gly Ile Asp Phe Tyr Asp Arg Leu Ile Asp Gly Met Leu 
            100                 105                 110         


Ala Arg Gly Ile Lys Pro Phe Met Thr Leu Tyr His Trp Asp Leu Pro 
        115                 120                 125             


Ser Ala Leu Gln Asp Lys Gly Gly Trp Met Asn Arg Asp Ile Ala Gly 
    130                 135                 140                 


Trp Leu Gly Glu Tyr Ala Ala Leu Cys Gly Lys His Phe Gly Asp Arg 
145                 150                 155                 160 


Val Ala Ala Thr Ala Thr Ile Asn Glu Pro Trp Cys Val Ala Phe Leu 
                165                 170                 175     


Ser His Phe Leu Gly Ile His Ala Pro Gly Leu Arg Asp Met Arg Ala 
            180                 185                 190         


Ala Ala Arg Ala Met His His Val Leu Tyr Ala His Gly His Ala Val 
        195                 200                 205             


Ala Ala Leu Arg Ala Glu Gly Val Lys Asn Ile Gly Ile Val Thr Asn 
    210                 215                 220                 


Leu Gln Lys Cys Glu Pro Val Ser Gly Ser Asp Ala Asp Arg Glu Ala 
225                 230                 235                 240 


Thr Asp Leu Phe Asp Gly Ile Phe Asn Gly Trp Tyr Leu Gly Gly Leu 
                245                 250                 255     


Tyr Lys Gly Gln Tyr Pro Ala Asn Val Val Lys Met Phe Glu Lys Tyr 
            260                 265                 270         


Leu Pro Ala Gly Phe Glu Arg Asp Met Asp Lys Val Ser Thr Pro Leu 
        275                 280                 285             


Asp Trp Ala Gly Val Asn Tyr Tyr Ser Arg Thr Leu Val Ala Ala Asp 
    290                 295                 300                 


Pro Ser Gly Pro Thr Gly Phe Lys Thr Val Glu Gly Lys Leu Glu Lys 
305                 310                 315                 320 


Thr Asp Ile Gly Trp Glu Ile Tyr Pro Gln Gly Leu Thr Asp Leu Leu 
                325                 330                 335     


Val Arg Val Ser Arg Asp Tyr Thr Lys Val Pro Ile Tyr Val Thr Glu 
            340                 345                 350         


Asn Gly Met Ala Glu Val Asp Gly Glu Thr Asp Pro Arg Arg Val Thr 
        355                 360                 365             


Tyr Tyr Glu Asp His Leu Lys Ala Leu Leu Ala Ala Arg Ala Ala Gly 
    370                 375                 380                 


Val Asp Val Arg Gly Tyr Phe Ala Trp Ser Leu Met Asp Asn Phe 
385                 390                 395                 


<210> 437
<211> 1443
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 437
atgacgacct tcaacgtcag tgccgtggca accgccccgg cccctactgc ctccacgacc     60

cgacccgccg ccgcatcggc aggttgcgcc cccggtggac tgcccaccgc ggcggcccgc    120

cagttccccg ccgacttcgt ctgggcggtc gccaccagcg ccttccagat cgaaggcgcg    180

gccgaccggg atggcaaggg cccgtccatc tgggacacct tctgccgcca gcccggcgcc    240

attgccgaca acagccatgg cgacatcgcc tgcgaacact acgaccgctg ggaagccgac    300

ctggacctga tccagagcct gggcgcccag gcctatcgct tctccatctc ctggccgcgg    360

gtgcgtgcca ccggcgacgg cccgtggaat gaagccggcc tggcttttta cgagaaactg    420

gtggacggca tcaatgcgcg cggcatgaag gcctatgtga cgctcaatca ctgggacctg    480

ccgcaagcgc tgcaggacat cggcggctgg ggaaaccgcg ccaccgtgga ccgctttgtt    540

gaatatgccg aagccatcgg ccgccggatc ggccacaagg tggcctccat cgcgacgcac    600

aacgagcctt gggtggtggc gcagctgggc catgaggtgg gcatcttcgc gcctggcctg    660

aaggaccgcc gcctcgcggc ccaggtctcg caccacctgc tgctcagcca cggccgcgcc    720

gtgcggcgcc tgcgtgccct ggagctgccg gcctcgctgg gcatcgtgct caacctctcg    780

ccgatctatc ccgccacgga caccccggaa gaccgggcca aggcccgcct cgaagacggc    840

aagctgcgcc ggtggtacat ggacccgctg ttcaagggcc actatccgca ggatgtgctg    900

gatcacctgg gcgatgacgc accgcaggtg caagacggcg acatggccga cattcagcag    960

ccgatcgatt tcgtgggggt gaactactac tcccgcggca tggccagcgc cgacaacagc   1020

ttcgattcaa agaccagcgg cctgccgctg accgccatgg gctgggaggt ctatccccag   1080

ggcctgaccg acctgctggt ctggctgcac cgcgactatc ccgaagcgaa gcggctgtac   1140

gtgaccgaaa acggcggcgc cttccctgat gtcgtgggcg ccgacggccg cgtgcacgat   1200

gccgaccgca ccagttatct ggacacccac atcgccgccg tcggagacgc catcgcccag   1260

ggcgtgccga tgggcggcta catggtgtgg agcctgctcg acaacttcga atgggcctcc   1320

ggctacgaga agcgcttcgg catcgtccac gtggactacg ccacgcagaa gcgcaccccc   1380

aaggacagcg ccctggcctt ccgcgatttc gtgcgcggac tcaagccggc ggcccaggac   1440

tga                                                                 1443

<210> 438
<211> 480
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(25)

<220> 
<221> DOMAIN
<222> (38)...(476)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (5)...(8)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (46)...(60)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (261)...(264)
<223> N-glycosylation site. Prosite id = PS00001

<400> 438
Met Thr Thr Phe Asn Val Ser Ala Val Ala Thr Ala Pro Ala Pro Thr 
1               5                   10                  15      


Ala Ser Thr Thr Arg Pro Ala Ala Ala Ser Ala Gly Cys Ala Pro Gly 
            20                  25                  30          


Gly Leu Pro Thr Ala Ala Ala Arg Gln Phe Pro Ala Asp Phe Val Trp 
        35                  40                  45              


Ala Val Ala Thr Ser Ala Phe Gln Ile Glu Gly Ala Ala Asp Arg Asp 
    50                  55                  60                  


Gly Lys Gly Pro Ser Ile Trp Asp Thr Phe Cys Arg Gln Pro Gly Ala 
65                  70                  75                  80  


Ile Ala Asp Asn Ser His Gly Asp Ile Ala Cys Glu His Tyr Asp Arg 
                85                  90                  95      


Trp Glu Ala Asp Leu Asp Leu Ile Gln Ser Leu Gly Ala Gln Ala Tyr 
            100                 105                 110         


Arg Phe Ser Ile Ser Trp Pro Arg Val Arg Ala Thr Gly Asp Gly Pro 
        115                 120                 125             


Trp Asn Glu Ala Gly Leu Ala Phe Tyr Glu Lys Leu Val Asp Gly Ile 
    130                 135                 140                 


Asn Ala Arg Gly Met Lys Ala Tyr Val Thr Leu Asn His Trp Asp Leu 
145                 150                 155                 160 


Pro Gln Ala Leu Gln Asp Ile Gly Gly Trp Gly Asn Arg Ala Thr Val 
                165                 170                 175     


Asp Arg Phe Val Glu Tyr Ala Glu Ala Ile Gly Arg Arg Ile Gly His 
            180                 185                 190         


Lys Val Ala Ser Ile Ala Thr His Asn Glu Pro Trp Val Val Ala Gln 
        195                 200                 205             


Leu Gly His Glu Val Gly Ile Phe Ala Pro Gly Leu Lys Asp Arg Arg 
    210                 215                 220                 


Leu Ala Ala Gln Val Ser His His Leu Leu Leu Ser His Gly Arg Ala 
225                 230                 235                 240 


Val Arg Arg Leu Arg Ala Leu Glu Leu Pro Ala Ser Leu Gly Ile Val 
                245                 250                 255     


Leu Asn Leu Ser Pro Ile Tyr Pro Ala Thr Asp Thr Pro Glu Asp Arg 
            260                 265                 270         


Ala Lys Ala Arg Leu Glu Asp Gly Lys Leu Arg Arg Trp Tyr Met Asp 
        275                 280                 285             


Pro Leu Phe Lys Gly His Tyr Pro Gln Asp Val Leu Asp His Leu Gly 
    290                 295                 300                 


Asp Asp Ala Pro Gln Val Gln Asp Gly Asp Met Ala Asp Ile Gln Gln 
305                 310                 315                 320 


Pro Ile Asp Phe Val Gly Val Asn Tyr Tyr Ser Arg Gly Met Ala Ser 
                325                 330                 335     


Ala Asp Asn Ser Phe Asp Ser Lys Thr Ser Gly Leu Pro Leu Thr Ala 
            340                 345                 350         


Met Gly Trp Glu Val Tyr Pro Gln Gly Leu Thr Asp Leu Leu Val Trp 
        355                 360                 365             


Leu His Arg Asp Tyr Pro Glu Ala Lys Arg Leu Tyr Val Thr Glu Asn 
    370                 375                 380                 


Gly Gly Ala Phe Pro Asp Val Val Gly Ala Asp Gly Arg Val His Asp 
385                 390                 395                 400 


Ala Asp Arg Thr Ser Tyr Leu Asp Thr His Ile Ala Ala Val Gly Asp 
                405                 410                 415     


Ala Ile Ala Gln Gly Val Pro Met Gly Gly Tyr Met Val Trp Ser Leu 
            420                 425                 430         


Leu Asp Asn Phe Glu Trp Ala Ser Gly Tyr Glu Lys Arg Phe Gly Ile 
        435                 440                 445             


Val His Val Asp Tyr Ala Thr Gln Lys Arg Thr Pro Lys Asp Ser Ala 
    450                 455                 460                 


Leu Ala Phe Arg Asp Phe Val Arg Gly Leu Lys Pro Ala Ala Gln Asp 
465                 470                 475                 480 


<210> 439
<211> 1254
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 439
atgactcaca aaaccaagtc gattgcatcg ctttccctca tccttatgct gctggctgtg     60

ccgttggcgc tggcgtcgtc ggtaccggtg gccgaaccgg aagacgttgg cttttccgcc    120

gagcggctgc aaaggataca cgagttggtc gaccgccatc tggaggcggg gagtttcgcc    180

ggcgctgtca ctctggtggc gcggcacggc catattgcgc atttcgaggc ccacggcctg    240

atggacctcg aaacgagaaa gccgatggcg aaggatgcca ttttccgcat catgtcgatg    300

acgaagcctg tggtcggcgt cgccgtcctg atgctgatgg aagaaggaaa agttcgcctc    360

aacgacccct tgtcgagatt catacccgag cttaaagacc tgaaggtcgc tgtcgttcag    420

ggcgagagtg cggccggagc tgaatctcgc ttttacacaa tcccggcgga ccgtgaaatt    480

acggttcggg atctgctgac tcacacatcc ggcttcgtca gcggcggcac cagcaatcgc    540

gaagcccagc gcgccagcgg cggcattcga ccgaacgaca cgctggccac gtacctgccg    600

cggctggcag cggctccgct cgagttccag ccgggaacgc gctgggccta tagcgcggcg    660

gcgggattcg acgcgctggc acgcgtggta gaggtcgcct ccgggcagac cttcgacgaa    720

tttgcgcggc aacgcttatt cgaaccgctt ggaatgaagg acaccttctt ctacgccgcc    780

gatccgccgg tggaccgctt tgcaacgctg tatcgacgga ctgaaagcgg attggagaag    840

cagcaagatc cgaatttcat gaacggcaca tacctttccg ggggaggagg actgttcagc    900

accgccgaag actatcttca gttcgggcag atgctggtca atggcggcga gttgaacgga    960

acacggctgc tcagtccgag gaccgtcgag ttgatgcggt ccgtctacgc tcccgacagc   1020

ctgccgggac ggccagcggg cgaaggctat ggcctgagcg tccgcgtgat caacgattcg   1080

attcaaagag ggaccatgct ctccgacggc gccttcggtt ggagcggcgc ctttgggacc   1140

cacttctggg tcgatccgaa ggagaaggtc gtcggtattt ttatgaccca aacctcgacg   1200

ccggcaatcc ggccggatct ggagaatgcc gtgatgcagg ccatcatgga gtag         1254

<210> 440
<211> 417
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(25)

<220> 
<221> DOMAIN
<222> (46)...(417)
<223> Beta-lactamase

<220> 
<221> SITE
<222> (194)...(197)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (292)...(295)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (323)...(326)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (363)...(366)
<223> N-glycosylation site. Prosite id = PS00001

<400> 440
Met Thr His Lys Thr Lys Ser Ile Ala Ser Leu Ser Leu Ile Leu Met 
1               5                   10                  15      


Leu Leu Ala Val Pro Leu Ala Leu Ala Ser Ser Val Pro Val Ala Glu 
            20                  25                  30          


Pro Glu Asp Val Gly Phe Ser Ala Glu Arg Leu Gln Arg Ile His Glu 
        35                  40                  45              


Leu Val Asp Arg His Leu Glu Ala Gly Ser Phe Ala Gly Ala Val Thr 
    50                  55                  60                  


Leu Val Ala Arg His Gly His Ile Ala His Phe Glu Ala His Gly Leu 
65                  70                  75                  80  


Met Asp Leu Glu Thr Arg Lys Pro Met Ala Lys Asp Ala Ile Phe Arg 
                85                  90                  95      


Ile Met Ser Met Thr Lys Pro Val Val Gly Val Ala Val Leu Met Leu 
            100                 105                 110         


Met Glu Glu Gly Lys Val Arg Leu Asn Asp Pro Leu Ser Arg Phe Ile 
        115                 120                 125             


Pro Glu Leu Lys Asp Leu Lys Val Ala Val Val Gln Gly Glu Ser Ala 
    130                 135                 140                 


Ala Gly Ala Glu Ser Arg Phe Tyr Thr Ile Pro Ala Asp Arg Glu Ile 
145                 150                 155                 160 


Thr Val Arg Asp Leu Leu Thr His Thr Ser Gly Phe Val Ser Gly Gly 
                165                 170                 175     


Thr Ser Asn Arg Glu Ala Gln Arg Ala Ser Gly Gly Ile Arg Pro Asn 
            180                 185                 190         


Asp Thr Leu Ala Thr Tyr Leu Pro Arg Leu Ala Ala Ala Pro Leu Glu 
        195                 200                 205             


Phe Gln Pro Gly Thr Arg Trp Ala Tyr Ser Ala Ala Ala Gly Phe Asp 
    210                 215                 220                 


Ala Leu Ala Arg Val Val Glu Val Ala Ser Gly Gln Thr Phe Asp Glu 
225                 230                 235                 240 


Phe Ala Arg Gln Arg Leu Phe Glu Pro Leu Gly Met Lys Asp Thr Phe 
                245                 250                 255     


Phe Tyr Ala Ala Asp Pro Pro Val Asp Arg Phe Ala Thr Leu Tyr Arg 
            260                 265                 270         


Arg Thr Glu Ser Gly Leu Glu Lys Gln Gln Asp Pro Asn Phe Met Asn 
        275                 280                 285             


Gly Thr Tyr Leu Ser Gly Gly Gly Gly Leu Phe Ser Thr Ala Glu Asp 
    290                 295                 300                 


Tyr Leu Gln Phe Gly Gln Met Leu Val Asn Gly Gly Glu Leu Asn Gly 
305                 310                 315                 320 


Thr Arg Leu Leu Ser Pro Arg Thr Val Glu Leu Met Arg Ser Val Tyr 
                325                 330                 335     


Ala Pro Asp Ser Leu Pro Gly Arg Pro Ala Gly Glu Gly Tyr Gly Leu 
            340                 345                 350         


Ser Val Arg Val Ile Asn Asp Ser Ile Gln Arg Gly Thr Met Leu Ser 
        355                 360                 365             


Asp Gly Ala Phe Gly Trp Ser Gly Ala Phe Gly Thr His Phe Trp Val 
    370                 375                 380                 


Asp Pro Lys Glu Lys Val Val Gly Ile Phe Met Thr Gln Thr Ser Thr 
385                 390                 395                 400 


Pro Ala Ile Arg Pro Asp Leu Glu Asn Ala Val Met Gln Ala Ile Met 
                405                 410                 415     


Glu 
    

<210> 441
<211> 2025
<212> DNA
<213> Thermotoga maritima MSB8

<400> 441
gtggactaca ggatgtgctg gctggagtac agaggtttac cagctgatgt cgccggaaaa     60

ctcaaagact ggttttcaag tgtttccatt ctggaacccg gttcttcagt tttgaaagac    120

gagatcagaa gattttctga aagatcgatt ggcatcactc ccagatttta ttccagaccc    180

ttgaagaaag aaaaatacat catggtggga cgtttggaat cccttcccat caagcttgat    240

gtgaatcttg gtgaggaagg tttcatgctg agaacgattg agtggaatgg ttcgaaaatt    300

ctgcttgtaa ctggcgaaac aaagaaggcg cttgtttacg ggatttttga tttgatgaag    360

agaataagac ttggtgaaga tatcgagaag atgaacgtcc tggcgaagcc gaaagcgaaa    420

tttcgtatgc ttaaccactg ggacaacctc gatggaacga tcgagagagg atacgcaggg    480

aactccattt ttttcaaaga caacagaatt atcataaatc agagaacgaa agactacgcc    540

agacttttag cctcaatagg tataaacggt gttgttataa acaacgtgaa cgtcaaaaag    600

cgagaagttt acttgataga ctctatctac cttaaaaaat tgaaaaagtt ggctgatatc    660

ttcagagagt atggaattaa gatctatcta agtataaact tcgcttctcc tgtttatctg    720

ggagggctgg atacggccga tcccctcgac gaaagagtgg cgcgctggtg gagagaaaaa    780

gcgaggggaa tatacgatta tattccagat tttggaggat ttcttgtcaa agccgattct    840

gagttcaatc ctggaccgca catgtttgga agaacgcatg cagaaggggc aaacatgctt    900

gcaagggctc tggcaccgtt cggtggagta gtaatatgga gagcttttgt ttacaactgc    960

ctccaggact ggagagatta caagacggac agagcaaagg cggcttatga caatttcaag   1020

cctcttgatg ggcagttcga cgacaacgtg atcattcaaa taaagtatgg tcctatggat   1080

tttcaggtga gagaacccgt caatccgctt ttcggaggaa tggagaagac aaatcaaatt   1140

cttgagcttc aaatcacgca ggaatacaca ggtcagcaga ttcatctgtg ttttttagga   1200

accctctgga aggagatact ggaattcgac acgtttgcga agggtgaggg ttcttacgtg   1260

aagagaatcg tggatggaac tctctttgac cgtgaaaaca acggttttgc cggagtatca   1320

aacgttggtg acagtgtgaa ctggacgggc cacgatctcg ctcaggcgaa tttatacgcg   1380

ttcggaagac ttgcgtggaa tcccgatgaa gaaatcgaaa ggatagttga agaatggata   1440

aaactcactt tcggcgatga cgagaaggtt ctggaaaatg tttcatacat gttgatgaaa   1500

tcccacagga cgtatgagaa atacaccact ccgttcgggt tgggatggat ggtgaatcct   1560

ggtcaccact acggtccaaa tccggaggga tacgaatatt cgaaatgggg tacttaccac   1620

agagccaact gggaagcgat cggtgtcgat agaacttcca gaggcactgg ttacaccctt   1680

caataccatt caccgtggaa agagatatac gatgatatca acacgtgtcc cgaggatctt   1740

cttcttttct tccacagggt acgatacgat catcgtttga aatccggaaa gacactcctt   1800

cagacgatgt acgatctcca ctttgaaggg gtggaggaag tagaagaatt cataaagaaa   1860

tgggaggaac tgaaagacag ggtatcacca gatatttttg agagagtgaa agagcgtctt   1920

catatgcaac tcgaacacgc gaaggagtgg cgtgatgtaa tcaacacata cttttacagg   1980

agaacgggta tccctgatga gaagggaaga aagatatatc cgtga                   2025

<210> 442
<211> 674
<212> PRT
<213> Thermotoga maritima MSB8

<220> 
<221> DOMAIN
<222> (6)...(118)
<223> Glycosyl hydrolase family 67 N-terminus

<220> 
<221> DOMAIN
<222> (119)...(448)
<223> Glycosyl hydrolase family 67 middle domain

<220> 
<221> DOMAIN
<222> (449)...(673)
<223> Glycosyl hydrolase family 67 C-terminus

<220> 
<221> SITE
<222> (97)...(100)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (453)...(456)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (500)...(503)
<223> N-glycosylation site. Prosite id = PS00001

<400> 442
Met Asp Tyr Arg Met Cys Trp Leu Glu Tyr Arg Gly Leu Pro Ala Asp 
1               5                   10                  15      


Val Ala Gly Lys Leu Lys Asp Trp Phe Ser Ser Val Ser Ile Leu Glu 
            20                  25                  30          


Pro Gly Ser Ser Val Leu Lys Asp Glu Ile Arg Arg Phe Ser Glu Arg 
        35                  40                  45              


Ser Ile Gly Ile Thr Pro Arg Phe Tyr Ser Arg Pro Leu Lys Lys Glu 
    50                  55                  60                  


Lys Tyr Ile Met Val Gly Arg Leu Glu Ser Leu Pro Ile Lys Leu Asp 
65                  70                  75                  80  


Val Asn Leu Gly Glu Glu Gly Phe Met Leu Arg Thr Ile Glu Trp Asn 
                85                  90                  95      


Gly Ser Lys Ile Leu Leu Val Thr Gly Glu Thr Lys Lys Ala Leu Val 
            100                 105                 110         


Tyr Gly Ile Phe Asp Leu Met Lys Arg Ile Arg Leu Gly Glu Asp Ile 
        115                 120                 125             


Glu Lys Met Asn Val Leu Ala Lys Pro Lys Ala Lys Phe Arg Met Leu 
    130                 135                 140                 


Asn His Trp Asp Asn Leu Asp Gly Thr Ile Glu Arg Gly Tyr Ala Gly 
145                 150                 155                 160 


Asn Ser Ile Phe Phe Lys Asp Asn Arg Ile Ile Ile Asn Gln Arg Thr 
                165                 170                 175     


Lys Asp Tyr Ala Arg Leu Leu Ala Ser Ile Gly Ile Asn Gly Val Val 
            180                 185                 190         


Ile Asn Asn Val Asn Val Lys Lys Arg Glu Val Tyr Leu Ile Asp Ser 
        195                 200                 205             


Ile Tyr Leu Lys Lys Leu Lys Lys Leu Ala Asp Ile Phe Arg Glu Tyr 
    210                 215                 220                 


Gly Ile Lys Ile Tyr Leu Ser Ile Asn Phe Ala Ser Pro Val Tyr Leu 
225                 230                 235                 240 


Gly Gly Leu Asp Thr Ala Asp Pro Leu Asp Glu Arg Val Ala Arg Trp 
                245                 250                 255     


Trp Arg Glu Lys Ala Arg Gly Ile Tyr Asp Tyr Ile Pro Asp Phe Gly 
            260                 265                 270         


Gly Phe Leu Val Lys Ala Asp Ser Glu Phe Asn Pro Gly Pro His Met 
        275                 280                 285             


Phe Gly Arg Thr His Ala Glu Gly Ala Asn Met Leu Ala Arg Ala Leu 
    290                 295                 300                 


Ala Pro Phe Gly Gly Val Val Ile Trp Arg Ala Phe Val Tyr Asn Cys 
305                 310                 315                 320 


Leu Gln Asp Trp Arg Asp Tyr Lys Thr Asp Arg Ala Lys Ala Ala Tyr 
                325                 330                 335     


Asp Asn Phe Lys Pro Leu Asp Gly Gln Phe Asp Asp Asn Val Ile Ile 
            340                 345                 350         


Gln Ile Lys Tyr Gly Pro Met Asp Phe Gln Val Arg Glu Pro Val Asn 
        355                 360                 365             


Pro Leu Phe Gly Gly Met Glu Lys Thr Asn Gln Ile Leu Glu Leu Gln 
    370                 375                 380                 


Ile Thr Gln Glu Tyr Thr Gly Gln Gln Ile His Leu Cys Phe Leu Gly 
385                 390                 395                 400 


Thr Leu Trp Lys Glu Ile Leu Glu Phe Asp Thr Phe Ala Lys Gly Glu 
                405                 410                 415     


Gly Ser Tyr Val Lys Arg Ile Val Asp Gly Thr Leu Phe Asp Arg Glu 
            420                 425                 430         


Asn Asn Gly Phe Ala Gly Val Ser Asn Val Gly Asp Ser Val Asn Trp 
        435                 440                 445             


Thr Gly His Asp Leu Ala Gln Ala Asn Leu Tyr Ala Phe Gly Arg Leu 
    450                 455                 460                 


Ala Trp Asn Pro Asp Glu Glu Ile Glu Arg Ile Val Glu Glu Trp Ile 
465                 470                 475                 480 


Lys Leu Thr Phe Gly Asp Asp Glu Lys Val Leu Glu Asn Val Ser Tyr 
                485                 490                 495     


Met Leu Met Lys Ser His Arg Thr Tyr Glu Lys Tyr Thr Thr Pro Phe 
            500                 505                 510         


Gly Leu Gly Trp Met Val Asn Pro Gly His His Tyr Gly Pro Asn Pro 
        515                 520                 525             


Glu Gly Tyr Glu Tyr Ser Lys Trp Gly Thr Tyr His Arg Ala Asn Trp 
    530                 535                 540                 


Glu Ala Ile Gly Val Asp Arg Thr Ser Arg Gly Thr Gly Tyr Thr Leu 
545                 550                 555                 560 


Gln Tyr His Ser Pro Trp Lys Glu Ile Tyr Asp Asp Ile Asn Thr Cys 
                565                 570                 575     


Pro Glu Asp Leu Leu Leu Phe Phe His Arg Val Arg Tyr Asp His Arg 
            580                 585                 590         


Leu Lys Ser Gly Lys Thr Leu Leu Gln Thr Met Tyr Asp Leu His Phe 
        595                 600                 605             


Glu Gly Val Glu Glu Val Glu Glu Phe Ile Lys Lys Trp Glu Glu Leu 
    610                 615                 620                 


Lys Asp Arg Val Ser Pro Asp Ile Phe Glu Arg Val Lys Glu Arg Leu 
625                 630                 635                 640 


His Met Gln Leu Glu His Ala Lys Glu Trp Arg Asp Val Ile Asn Thr 
                645                 650                 655     


Tyr Phe Tyr Arg Arg Thr Gly Ile Pro Asp Glu Lys Gly Arg Lys Ile 
            660                 665                 670         


Tyr Pro 
        

<210> 443
<211> 1077
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 443
atgaacttca gtctcaggaa ggctgcagcg gcgctggctt gcgtcgcggg cctgtatgca     60

tcatcggcgg gcgctcagac ctgcctgacc aacaaccaga ccggcaacaa cggcgggtac    120

tactactcgt tctggaagga cagcggcaac gtcaccttct gcctgcagtc cggcgggcga    180

tacacgtccc agtggagcaa cgtcaacaac tgggtgggcg gcaagggctg gaacccgggt    240

gggcgacgca ccgtcaccta ttccggcacc tacaacccca atggcaattc gtacctgacc    300

ctgtacggct ggaccacgaa tccactggtc gagtactaca tcgtcgacag ctggggttcc    360

tggcgcccac cgggctcggg atacatgggc acggtcacca gcgatggcgg cacctacgac    420

atctatcgca cgcagcgtgt gaaccagcct tccatcatcg gcaccgcgac gttctaccaa    480

tactggagcg tgcggcaatc gaagcgcgtg ggtggcacca tcacctcggg caatcacttc    540

gatgcctggg cctcgctggg catgaacctc ggcacgcaca actacatggt gatggccacc    600

gagggctacc agagcagcgg cagctcggac atcacggtgg gcagcggcag ttcgtcgtcg    660

agcagcagct cgtccagcag tagcagctcg tcgtccagta gcagcagcag ttcttcgtcc    720

agcagcagcg gtggcggcgg caccaagagc ttcaccgtgc gcgcacgcgg cacggcgggt    780

ggcgagtcca tcaccttgcg ggtgaacaac cagaacgtgc agacctggac gctgggcacc    840

agcatgcaga actacacggc gtccacctcg ctgagcggcg gcatcacggt ggccttcacc    900

aacgacggcg gcaaccgcga cgtccaggtg gattacatca tcgtgaatgg ccagacgcgc    960

cagtccgagg cgcagaccta caacaccggc ctgtatgcca atggccgctg cggtggtggc   1020

tctaacagcg agtggatgca ctgcaacggc gccatcggct acggcaacac gccctag      1077

<210> 444
<211> 358
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(25)

<220> 
<221> DOMAIN
<222> (35)...(213)
<223> Glycosyl hydrolases family 11

<220> 
<221> SITE
<222> (2)...(5)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (32)...(35)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (50)...(53)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (109)...(119)
<223> Glycosyl hydrolases family 11 active site signature 1. Prosite id = PS00776

<220> 
<221> SITE
<222> (201)...(212)
<223> Glycosyl hydrolases family 11 active site signature 2. Prosite id = PS00777

<220> 
<221> SITE
<222> (288)...(291)
<223> N-glycosylation site. Prosite id = PS00001

<400> 444
Met Asn Phe Ser Leu Arg Lys Ala Ala Ala Ala Leu Ala Cys Val Ala 
1               5                   10                  15      


Gly Leu Tyr Ala Ser Ser Ala Gly Ala Gln Thr Cys Leu Thr Asn Asn 
            20                  25                  30          


Gln Thr Gly Asn Asn Gly Gly Tyr Tyr Tyr Ser Phe Trp Lys Asp Ser 
        35                  40                  45              


Gly Asn Val Thr Phe Cys Leu Gln Ser Gly Gly Arg Tyr Thr Ser Gln 
    50                  55                  60                  


Trp Ser Asn Val Asn Asn Trp Val Gly Gly Lys Gly Trp Asn Pro Gly 
65                  70                  75                  80  


Gly Arg Arg Thr Val Thr Tyr Ser Gly Thr Tyr Asn Pro Asn Gly Asn 
                85                  90                  95      


Ser Tyr Leu Thr Leu Tyr Gly Trp Thr Thr Asn Pro Leu Val Glu Tyr 
            100                 105                 110         


Tyr Ile Val Asp Ser Trp Gly Ser Trp Arg Pro Pro Gly Ser Gly Tyr 
        115                 120                 125             


Met Gly Thr Val Thr Ser Asp Gly Gly Thr Tyr Asp Ile Tyr Arg Thr 
    130                 135                 140                 


Gln Arg Val Asn Gln Pro Ser Ile Ile Gly Thr Ala Thr Phe Tyr Gln 
145                 150                 155                 160 


Tyr Trp Ser Val Arg Gln Ser Lys Arg Val Gly Gly Thr Ile Thr Ser 
                165                 170                 175     


Gly Asn His Phe Asp Ala Trp Ala Ser Leu Gly Met Asn Leu Gly Thr 
            180                 185                 190         


His Asn Tyr Met Val Met Ala Thr Glu Gly Tyr Gln Ser Ser Gly Ser 
        195                 200                 205             


Ser Asp Ile Thr Val Gly Ser Gly Ser Ser Ser Ser Ser Ser Ser Ser 
    210                 215                 220                 


Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser Ser 
225                 230                 235                 240 


Ser Ser Ser Gly Gly Gly Gly Thr Lys Ser Phe Thr Val Arg Ala Arg 
                245                 250                 255     


Gly Thr Ala Gly Gly Glu Ser Ile Thr Leu Arg Val Asn Asn Gln Asn 
            260                 265                 270         


Val Gln Thr Trp Thr Leu Gly Thr Ser Met Gln Asn Tyr Thr Ala Ser 
        275                 280                 285             


Thr Ser Leu Ser Gly Gly Ile Thr Val Ala Phe Thr Asn Asp Gly Gly 
    290                 295                 300                 


Asn Arg Asp Val Gln Val Asp Tyr Ile Ile Val Asn Gly Gln Thr Arg 
305                 310                 315                 320 


Gln Ser Glu Ala Gln Thr Tyr Asn Thr Gly Leu Tyr Ala Asn Gly Arg 
                325                 330                 335     


Cys Gly Gly Gly Ser Asn Ser Glu Trp Met His Cys Asn Gly Ala Ile 
            340                 345                 350         


Gly Tyr Gly Asn Thr Pro 
        355             

<210> 445
<211> 1605
<212> DNA
<213> Agaricus bisporus ATCC 62489

<400> 445
atgtctgctg cactgtctta tcgcatttac aagaatgctc tcctcttcac tgccttcttg     60

actgccgctc gggcacagca agttggcacc tcaaccgctg aggttcaccc gtctttgacc    120

tggtcgcaat gtaccagcgg tggaagctgc accacgcaga gcggctcggt tgtcatcgat    180

gccaactggc gatgggtcca caacgtcggt ggctctacca actgctacac cggtaacgaa    240

tgggacacca cgctttgccc tgacgatgtt acctgcgcga ccaactgcgc tttggacggt    300

gccacatacg aagctaccta cggtgtaacc accagcggtg atgagcttcg tctgaacttt    360

gttactactg cctcgcagaa gaacattgga tctcgtctgt acctgatgga ggatgacacc    420

acctaccaga tattcgatct gctgaatcag gaattcacct tcgacgtgga cgtctctaac    480

cttccctgtg gtttgaacgg tgccctctac tttgtcgcta tggactctga cggtggcatg    540

gctcgcttcc ccgcaaacaa ggccggagcc caatatggaa ccggttactg tgactctcag    600

tgcccccggg acctcaaatt catcgatggt gaggccaact gcgatggctg ggttccttcc    660

acaaccgatg taaactctgg tattggtaac cacggttcct gctgcgcgga aatggatatc    720

tgggaggcta actcgatctc gaacgcggtc actcctcacc cttgtgaaac tccaacccag    780

actatgtgcg aggaagatgc ttgtggtggt acttacagca ctgatcgcta cggtggcact    840

tgcgatcctg atggatgtga tttcaaccct taccgcatgg gtaacaccac attcttcggt    900

cctgacatga ccgtcgatac caactccgtc ttcactgttg tgactcagtt catcaccgat    960

gatggcacct ccacgggtac tctgagtgag atcaagcggt tctacgtgca ggacggcgtc   1020

gtcatcccca actcggcctc taccatcagt ggtgtgagtg gtaactctat tacctcagac   1080

ttctgcactg cccagaagac tgccttcggt gatgaggacg ttttcgacga gcgcggaggg   1140

ttggcgggtc ttggtgccgg tctcgctgac ggcatggtcc tggtcatgag tctctgggat   1200

gaccactatg ccgacatgct ctggctcgac agcatctacc caacaaacga tacctccacc   1260

acccccggtg ccgctcgtgg tacctgtgac atctcatctg gagtcccagc taccattgaa   1320

gactccgacg caagcgccta tgtcatctac tcaaacatta aggttggtcc gattggctct   1380

accttctcgt cgggtacctc cagttctagc actagctcga gcactacttc caagactact   1440

tccacatcca ccaagaccac ctctaccact accaccacgg ctgctagcac cactggtgct   1500

gctcactacg cccagtgtgg tggatctgga tggactggtg ccactgcttg tgccagcccc   1560

tacacctgca ccgcgcagaa tgcctactac tctcagtgcc tataa                   1605

<210> 446
<211> 534
<212> PRT
<213> Agaricus bisporus ATCC 62489

<220> 
<221> DOMAIN
<222> (27)...(464)
<223> Glycosyl hydrolase family 7

<220> 
<221> DOMAIN
<222> (502)...(530)
<223> Fungal cellulose binding domain

<220> 
<221> SITE
<222> (299)...(302)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (422)...(425)
<223> N-glycosylation site. Prosite id = PS00001

<400> 446
Met Ser Ala Ala Leu Ser Tyr Arg Ile Tyr Lys Asn Ala Leu Leu Phe 
1               5                   10                  15      


Thr Ala Phe Leu Thr Ala Ala Arg Ala Gln Gln Val Gly Thr Ser Thr 
            20                  25                  30          


Ala Glu Val His Pro Ser Leu Thr Trp Ser Gln Cys Thr Ser Gly Gly 
        35                  40                  45              


Ser Cys Thr Thr Gln Ser Gly Ser Val Val Ile Asp Ala Asn Trp Arg 
    50                  55                  60                  


Trp Val His Asn Val Gly Gly Ser Thr Asn Cys Tyr Thr Gly Asn Glu 
65                  70                  75                  80  


Trp Asp Thr Thr Leu Cys Pro Asp Asp Val Thr Cys Ala Thr Asn Cys 
                85                  90                  95      


Ala Leu Asp Gly Ala Thr Tyr Glu Ala Thr Tyr Gly Val Thr Thr Ser 
            100                 105                 110         


Gly Asp Glu Leu Arg Leu Asn Phe Val Thr Thr Ala Ser Gln Lys Asn 
        115                 120                 125             


Ile Gly Ser Arg Leu Tyr Leu Met Glu Asp Asp Thr Thr Tyr Gln Ile 
    130                 135                 140                 


Phe Asp Leu Leu Asn Gln Glu Phe Thr Phe Asp Val Asp Val Ser Asn 
145                 150                 155                 160 


Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val Ala Met Asp Ser 
                165                 170                 175     


Asp Gly Gly Met Ala Arg Phe Pro Ala Asn Lys Ala Gly Ala Gln Tyr 
            180                 185                 190         


Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp Leu Lys Phe Ile 
        195                 200                 205             


Asp Gly Glu Ala Asn Cys Asp Gly Trp Val Pro Ser Thr Thr Asp Val 
    210                 215                 220                 


Asn Ser Gly Ile Gly Asn His Gly Ser Cys Cys Ala Glu Met Asp Ile 
225                 230                 235                 240 


Trp Glu Ala Asn Ser Ile Ser Asn Ala Val Thr Pro His Pro Cys Glu 
                245                 250                 255     


Thr Pro Thr Gln Thr Met Cys Glu Glu Asp Ala Cys Gly Gly Thr Tyr 
            260                 265                 270         


Ser Thr Asp Arg Tyr Gly Gly Thr Cys Asp Pro Asp Gly Cys Asp Phe 
        275                 280                 285             


Asn Pro Tyr Arg Met Gly Asn Thr Thr Phe Phe Gly Pro Asp Met Thr 
    290                 295                 300                 


Val Asp Thr Asn Ser Val Phe Thr Val Val Thr Gln Phe Ile Thr Asp 
305                 310                 315                 320 


Asp Gly Thr Ser Thr Gly Thr Leu Ser Glu Ile Lys Arg Phe Tyr Val 
                325                 330                 335     


Gln Asp Gly Val Val Ile Pro Asn Ser Ala Ser Thr Ile Ser Gly Val 
            340                 345                 350         


Ser Gly Asn Ser Ile Thr Ser Asp Phe Cys Thr Ala Gln Lys Thr Ala 
        355                 360                 365             


Phe Gly Asp Glu Asp Val Phe Asp Glu Arg Gly Gly Leu Ala Gly Leu 
    370                 375                 380                 


Gly Ala Gly Leu Ala Asp Gly Met Val Leu Val Met Ser Leu Trp Asp 
385                 390                 395                 400 


Asp His Tyr Ala Asp Met Leu Trp Leu Asp Ser Ile Tyr Pro Thr Asn 
                405                 410                 415     


Asp Thr Ser Thr Thr Pro Gly Ala Ala Arg Gly Thr Cys Asp Ile Ser 
            420                 425                 430         


Ser Gly Val Pro Ala Thr Ile Glu Asp Ser Asp Ala Ser Ala Tyr Val 
        435                 440                 445             


Ile Tyr Ser Asn Ile Lys Val Gly Pro Ile Gly Ser Thr Phe Ser Ser 
    450                 455                 460                 


Gly Thr Ser Ser Ser Ser Thr Ser Ser Ser Thr Thr Ser Lys Thr Thr 
465                 470                 475                 480 


Ser Thr Ser Thr Lys Thr Thr Ser Thr Thr Thr Thr Thr Ala Ala Ser 
                485                 490                 495     


Thr Thr Gly Ala Ala His Tyr Ala Gln Cys Gly Gly Ser Gly Trp Thr 
            500                 505                 510         


Gly Ala Thr Ala Cys Ala Ser Pro Tyr Thr Cys Thr Ala Gln Asn Ala 
        515                 520                 525             


Tyr Tyr Ser Gln Cys Leu 
    530                 

<210> 447
<211> 1200
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 447
atgatcgttg gattctcgtt tatgctgctg cttcctttag ggatgacgaa tgcattggca     60

aaaacggaac cagcgtacgc taaaaagccg cgaatcagcg cattgcacgc ccctcaattg    120

gatcagcgct acaaagattc cttcactatt ggggcggccg ttgaacctta tcagttgcaa    180

aacgaaaaag acgtccaaat gctgaaacgc cattttaaca gcattgtcgc tgagaacgtt    240

atgaaaccga tcaacatcca acccgaagaa ggaaagttca attttgctga ggcggatcaa    300

atcgtccgat ttgctaaaaa acatcatatg gatattcgtt tccatacact cgtttggcac    360

agccaagtac ctcaatggtt ctttcttgac aaggaaggca agccgatggt caatgaaacg    420

gatccggcaa agcgcgaaca aaataaacag ctgttactga aacggctcga aatccatatt    480

aaaacgattg tcgaacggta taaagacgac atcaaatatt gggacgtcgt gaacgaggta    540

gtcggggatg atggaaaatt gcgcaattcc ccgtggtatc aaatcgccgg catcgattat    600

atcaaggtag cattccaaac ggcgagaaca tatggcggca acaagattaa actgtacatc    660

aacgattaca ataccgaagt ggaaccgaag cgaagcgctc tttataactt agtgaaacaa    720

ttaaaagaag aaggcgttcc cattgacggg attggccacc agtcccacat ccaaattggc    780

tggccttctg aagaagaaat cgaaaaaacg atcaacatgt ttgccgatct agggttagac    840

aatcaaatta cggagctgga tgtgagcatg tacggctggc cgccgcgcgc ctacccgtcg    900

tatgacgcca ttccggaaca aaagtttttg gaccaagcgg ctcgctatga ccgattgttt    960

aagctgtacg aaaaacttgg cgataaaatc agcaacgtca ccttctgggg catcgccgac   1020

aaccatacgt ggctcgacag ccgtgcggat gtgtactatg acgccaacgg gaatgttgtg   1080

gttgacccga acgctccgta cgcaaaagtg gaaaaaggga aaggaaaaga tgcgccgttt   1140

ctgttcgacc ccgaatacca cgtaaaacct gcgtattggg ccattatcga tcataagtga   1200

<210> 448
<211> 399
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(20)

<220> 
<221> DOMAIN
<222> (39)...(399)
<223> Glycosyl hydrolase family 10

<220> 
<221> SITE
<222> (140)...(143)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (282)...(292)
<223> Glycosyl hydrolases family 10 active site. Prosite id = PS00591

<220> 
<221> SITE
<222> (337)...(340)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (346)...(349)
<223> N-glycosylation site. Prosite id = PS00001

<400> 448
Met Ile Val Gly Phe Ser Phe Met Leu Leu Leu Pro Leu Gly Met Thr 
1               5                   10                  15      


Asn Ala Leu Ala Lys Thr Glu Pro Ala Tyr Ala Lys Lys Pro Arg Ile 
            20                  25                  30          


Ser Ala Leu His Ala Pro Gln Leu Asp Gln Arg Tyr Lys Asp Ser Phe 
        35                  40                  45              


Thr Ile Gly Ala Ala Val Glu Pro Tyr Gln Leu Gln Asn Glu Lys Asp 
    50                  55                  60                  


Val Gln Met Leu Lys Arg His Phe Asn Ser Ile Val Ala Glu Asn Val 
65                  70                  75                  80  


Met Lys Pro Ile Asn Ile Gln Pro Glu Glu Gly Lys Phe Asn Phe Ala 
                85                  90                  95      


Glu Ala Asp Gln Ile Val Arg Phe Ala Lys Lys His His Met Asp Ile 
            100                 105                 110         


Arg Phe His Thr Leu Val Trp His Ser Gln Val Pro Gln Trp Phe Phe 
        115                 120                 125             


Leu Asp Lys Glu Gly Lys Pro Met Val Asn Glu Thr Asp Pro Ala Lys 
    130                 135                 140                 


Arg Glu Gln Asn Lys Gln Leu Leu Leu Lys Arg Leu Glu Ile His Ile 
145                 150                 155                 160 


Lys Thr Ile Val Glu Arg Tyr Lys Asp Asp Ile Lys Tyr Trp Asp Val 
                165                 170                 175     


Val Asn Glu Val Val Gly Asp Asp Gly Lys Leu Arg Asn Ser Pro Trp 
            180                 185                 190         


Tyr Gln Ile Ala Gly Ile Asp Tyr Ile Lys Val Ala Phe Gln Thr Ala 
        195                 200                 205             


Arg Thr Tyr Gly Gly Asn Lys Ile Lys Leu Tyr Ile Asn Asp Tyr Asn 
    210                 215                 220                 


Thr Glu Val Glu Pro Lys Arg Ser Ala Leu Tyr Asn Leu Val Lys Gln 
225                 230                 235                 240 


Leu Lys Glu Glu Gly Val Pro Ile Asp Gly Ile Gly His Gln Ser His 
                245                 250                 255     


Ile Gln Ile Gly Trp Pro Ser Glu Glu Glu Ile Glu Lys Thr Ile Asn 
            260                 265                 270         


Met Phe Ala Asp Leu Gly Leu Asp Asn Gln Ile Thr Glu Leu Asp Val 
        275                 280                 285             


Ser Met Tyr Gly Trp Pro Pro Arg Ala Tyr Pro Ser Tyr Asp Ala Ile 
    290                 295                 300                 


Pro Glu Gln Lys Phe Leu Asp Gln Ala Ala Arg Tyr Asp Arg Leu Phe 
305                 310                 315                 320 


Lys Leu Tyr Glu Lys Leu Gly Asp Lys Ile Ser Asn Val Thr Phe Trp 
                325                 330                 335     


Gly Ile Ala Asp Asn His Thr Trp Leu Asp Ser Arg Ala Asp Val Tyr 
            340                 345                 350         


Tyr Asp Ala Asn Gly Asn Val Val Val Asp Pro Asn Ala Pro Tyr Ala 
        355                 360                 365             


Lys Val Glu Lys Gly Lys Gly Lys Asp Ala Pro Phe Leu Phe Asp Pro 
    370                 375                 380                 


Glu Tyr His Val Lys Pro Ala Tyr Trp Ala Ile Ile Asp His Lys 
385                 390                 395                 

<210> 449
<211> 1350
<212> DNA
<213> Cochliobolus heterostrophus ATCC 48331

<400> 449
atgagatttc cttcaatttt tactgctgtt ttattcgcag catcctccgc attagctgct     60

ccagtcaaca ctacaacaga agatgaaacg gcacaaattc cggctgaagc tgtcatcggt    120

tactcagatt tagaagggga tttcgatgtt gctgttttgc cattttccaa cagcacaaat    180

aacgggttat tgtttataaa tactactatt gccagcattg ctgctaaaga agaaggggta    240

tctctcgaga aaagagaggc tttccccaaa ccagagccca gcaataccac cgtcaaccca    300

tggattggca aggaccgcta tgttgtcgaa agctacggca agaagcttga ccaaaccatt    360

gcttccttcc tatcaaagaa tgactccctc aacgcagcac gaacccgcac tgtggctaag    420

aagacatcca ccttcgtatg ggttacatcg cgcgctggac ttagccagat ccccgaagcc    480

attcaacaag cccgccgtca acaaaagggc cgaaagagga tgattgtcgg ccttgtcctt    540

tacaatctcc ccgaccgtga ctgctctgct ggagagtctg ccggagagct cagctccgcc    600

aaagacggcc tgaacatcta caagcacgaa ttcgtcgaca agtacgcaca gctcgtatcc    660

gaggccaagg atctcgactt cgccattgtt ctcgagcccg actcgctcgg caacgctgtc    720

accaaccaag gtattccctt ctgcgccaac gccaccccca tctacgagca aggtatcgct    780

tacgccattg ccaagctgca gttccccaat gtatccctat acatggatgc tgctcatgga    840

ggctggcttg gatgggccga caacctgaaa cccacagccc agatctttgc cagagttgtc    900

gaagcggcta agaagatcaa cccagcggcc aagatccgtg gttactctac caacgtatcc    960

aactacaacc ctttcaacgc caaggtccgc gagaactaca ccgaatggtc tccatcgtgg   1020

gatgaaagcc actacgctac ctctcttgga gcggcaatgg cagctgaggg catgccaacc   1080

aacttcatca tcgaccaggg ccgtgttgcc ctccccggtg ctcgcaagga gtggggcgag   1140

tggtgcaacg tttcgccctc tggctttgga ctgcgccctg gagctgcacc aaacaacacc   1200

aacgtcgact cgattgtctg gattaagccc ggtggtgaga gtgacggagc ctgcggtctg   1260

gctggtgcgc ctgttgctgg tgcttggttc gacgactacg ttcagatgct cgtcaagaac   1320

gccgaccctc ctcttgagcc tacctactag                                    1350

<210> 450
<211> 449
<212> PRT
<213> Cochliobolus heterostrophus ATCC 48331

<220> 
<221> SIGNAL
<222> (1)...(19)

<220> 
<221> DOMAIN
<222> (1)...(86)
<223> Mating factor alpha precursor N-terminus

<220> 
<221> DOMAIN
<222> (121)...(431)
<223> Glycosyl hydrolases family 6

<220> 
<221> SITE
<222> (23)...(26)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (57)...(60)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (68)...(71)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (96)...(99)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (128)...(131)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (231)...(240)
<223> Glycosyl hydrolases family 6 signature 2. Prosite id = PS00656

<220> 
<221> SITE
<222> (274)...(277)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (322)...(325)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (337)...(340)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (404)...(407)
<223> N-glycosylation site. Prosite id = PS00001

<400> 450
Met Arg Phe Pro Ser Ile Phe Thr Ala Val Leu Phe Ala Ala Ser Ser 
1               5                   10                  15      


Ala Leu Ala Ala Pro Val Asn Thr Thr Thr Glu Asp Glu Thr Ala Gln 
            20                  25                  30          


Ile Pro Ala Glu Ala Val Ile Gly Tyr Ser Asp Leu Glu Gly Asp Phe 
        35                  40                  45              


Asp Val Ala Val Leu Pro Phe Ser Asn Ser Thr Asn Asn Gly Leu Leu 
    50                  55                  60                  


Phe Ile Asn Thr Thr Ile Ala Ser Ile Ala Ala Lys Glu Glu Gly Val 
65                  70                  75                  80  


Ser Leu Glu Lys Arg Glu Ala Phe Pro Lys Pro Glu Pro Ser Asn Thr 
                85                  90                  95      


Thr Val Asn Pro Trp Ile Gly Lys Asp Arg Tyr Val Val Glu Ser Tyr 
            100                 105                 110         


Gly Lys Lys Leu Asp Gln Thr Ile Ala Ser Phe Leu Ser Lys Asn Asp 
        115                 120                 125             


Ser Leu Asn Ala Ala Arg Thr Arg Thr Val Ala Lys Lys Thr Ser Thr 
    130                 135                 140                 


Phe Val Trp Val Thr Ser Arg Ala Gly Leu Ser Gln Ile Pro Glu Ala 
145                 150                 155                 160 


Ile Gln Gln Ala Arg Arg Gln Gln Lys Gly Arg Lys Arg Met Ile Val 
                165                 170                 175     


Gly Leu Val Leu Tyr Asn Leu Pro Asp Arg Asp Cys Ser Ala Gly Glu 
            180                 185                 190         


Ser Ala Gly Glu Leu Ser Ser Ala Lys Asp Gly Leu Asn Ile Tyr Lys 
        195                 200                 205             


His Glu Phe Val Asp Lys Tyr Ala Gln Leu Val Ser Glu Ala Lys Asp 
    210                 215                 220                 


Leu Asp Phe Ala Ile Val Leu Glu Pro Asp Ser Leu Gly Asn Ala Val 
225                 230                 235                 240 


Thr Asn Gln Gly Ile Pro Phe Cys Ala Asn Ala Thr Pro Ile Tyr Glu 
                245                 250                 255     


Gln Gly Ile Ala Tyr Ala Ile Ala Lys Leu Gln Phe Pro Asn Val Ser 
            260                 265                 270         


Leu Tyr Met Asp Ala Ala His Gly Gly Trp Leu Gly Trp Ala Asp Asn 
        275                 280                 285             


Leu Lys Pro Thr Ala Gln Ile Phe Ala Arg Val Val Glu Ala Ala Lys 
    290                 295                 300                 


Lys Ile Asn Pro Ala Ala Lys Ile Arg Gly Tyr Ser Thr Asn Val Ser 
305                 310                 315                 320 


Asn Tyr Asn Pro Phe Asn Ala Lys Val Arg Glu Asn Tyr Thr Glu Trp 
                325                 330                 335     


Ser Pro Ser Trp Asp Glu Ser His Tyr Ala Thr Ser Leu Gly Ala Ala 
            340                 345                 350         


Met Ala Ala Glu Gly Met Pro Thr Asn Phe Ile Ile Asp Gln Gly Arg 
        355                 360                 365             


Val Ala Leu Pro Gly Ala Arg Lys Glu Trp Gly Glu Trp Cys Asn Val 
    370                 375                 380                 


Ser Pro Ser Gly Phe Gly Leu Arg Pro Gly Ala Ala Pro Asn Asn Thr 
385                 390                 395                 400 


Asn Val Asp Ser Ile Val Trp Ile Lys Pro Gly Gly Glu Ser Asp Gly 
                405                 410                 415     


Ala Cys Gly Leu Ala Gly Ala Pro Val Ala Gly Ala Trp Phe Asp Asp 
            420                 425                 430         


Tyr Val Gln Met Leu Val Lys Asn Ala Asp Pro Pro Leu Glu Pro Thr 
        435                 440                 445             


Tyr 
    

<210> 451
<211> 1545
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 451
atgtatcgcg tcatcgcaac cgcctcggct cttattgcca ctgctcgggc tcaacaggtc     60

tgctctttga ccaccgagac caagcctgcc ttgacctggt ccaagtgtac atccagcggc    120

tgcagcgatg tcaagggctc cgttggtatt gatgccaact ggcgatggac tcaccagact    180

tctggatcta ccaactgtta caccggaaac aagtgggaca cctccgtctg cactgatggt    240

aagacctgcg ccgagaagtg ctgtcttgat ggcgccgact atgctggcac ctacggaatc    300

acttccagcg gcaaccagct cagtcttgga ttcgttaccc agggtcccta cagcaagaac    360

atcggcagcc gaacatacct tatggagaac gagaacacct accagatgtt ccagcttctg    420

ggtaacgagt tcacctttga cgtcgatgtc tctggtatcg gctgcggtct gaacggtgca    480

ctctacttcg tcagcatgga cgaggatggt ggcaaggcca ggtactccgg aaacaaggcc    540

ggagccaagt acggaactgg ctactgtgat gctcaatgcc ctcgtgatgt caagttcatc    600

aacggagttg ccaactctga aggctggaag ccctctgaca gtgatgctaa cgctggtgtt    660

ggcaatctgg gcacctgctg ccccgagatg gatatctggg aggccaactc catctcaacc    720

gccttcactc ctcatccttg caccaagctc acgcagcact cttgcactgg cgactcttgt    780

ggtggaacct actctagtga ccgatatggc ggtacttgcg atgccgatgg ttgtgacttc    840

aacgcctacc gccagggcaa caagaccttc tacggtcctg gatccaactt caacgttgat    900

accaccaaga agatgactgt cgtcactcag ttccacaagg gcagcaacgg acgtctttct    960

gagatcaccc gtctgtatgt ccagaacggc aaggtcattg ccaactccga gtccaagatt   1020

tcaggcaacc ccggtagctc tctcacctct gacttctgct ccaagcagaa gagcgtcttt   1080

ggcgatatcg atgacttctc taagaagggt ggctggaacg gcatgagcga tgctctctcc   1140

gcccccatgg ttcttgttat gtctctctgg cacgaccacc actccaacat gctctggctc   1200

gactctacct acccaactga ctctaccaag gtcggatccc agcgaggttc ttgcgctacc   1260

acctctggca agccctccga ccttgagcga gatgttccca actccaaggt ttccttctcc   1320

aacatcaagt tcggtcccat cggaagcacc tacaagagcg acggcaccac ccccaacccc   1380

cctgccagca gcagcaccac tggttcttcc actcccacca acccccctgc cggcagcgtc   1440

gaccagtggg gacagtgcgg tggccagaac tacagcggcc ccacgacctg caagtctccc   1500

ttcacctgca agaagatcaa cgacttctac tcccagtgcc agtaa                   1545

<210> 452
<211> 514
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (19)...(453)
<223> Glycosyl hydrolase family 7

<220> 
<221> DOMAIN
<222> (482)...(510)
<223> Fungal cellulose binding domain

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (493)...(520)
<223> Cellulose-binding domain, fungal type. Prosite id = PS00562

<220> 
<221> SITE
<222> (497)...(500)
<223> N-glycosylation site. Prosite id = PS00001

<400> 452
Met Tyr Arg Val Ile Ala Thr Ala Ser Ala Leu Ile Ala Thr Ala Arg 
1               5                   10                  15      


Ala Gln Gln Val Cys Ser Leu Thr Thr Glu Thr Lys Pro Ala Leu Thr 
            20                  25                  30          


Trp Ser Lys Cys Thr Ser Ser Gly Cys Ser Asp Val Lys Gly Ser Val 
        35                  40                  45              


Gly Ile Asp Ala Asn Trp Arg Trp Thr His Gln Thr Ser Gly Ser Thr 
    50                  55                  60                  


Asn Cys Tyr Thr Gly Asn Lys Trp Asp Thr Ser Val Cys Thr Asp Gly 
65                  70                  75                  80  


Lys Thr Cys Ala Glu Lys Cys Cys Leu Asp Gly Ala Asp Tyr Ala Gly 
                85                  90                  95      


Thr Tyr Gly Ile Thr Ser Ser Gly Asn Gln Leu Ser Leu Gly Phe Val 
            100                 105                 110         


Thr Gln Gly Pro Tyr Ser Lys Asn Ile Gly Ser Arg Thr Tyr Leu Met 
        115                 120                 125             


Glu Asn Glu Asn Thr Tyr Gln Met Phe Gln Leu Leu Gly Asn Glu Phe 
    130                 135                 140                 


Thr Phe Asp Val Asp Val Ser Gly Ile Gly Cys Gly Leu Asn Gly Ala 
145                 150                 155                 160 


Leu Tyr Phe Val Ser Met Asp Glu Asp Gly Gly Lys Ala Arg Tyr Ser 
                165                 170                 175     


Gly Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ala Gln 
            180                 185                 190         


Cys Pro Arg Asp Val Lys Phe Ile Asn Gly Val Ala Asn Ser Glu Gly 
        195                 200                 205             


Trp Lys Pro Ser Asp Ser Asp Ala Asn Ala Gly Val Gly Asn Leu Gly 
    210                 215                 220                 


Thr Cys Cys Pro Glu Met Asp Ile Trp Glu Ala Asn Ser Ile Ser Thr 
225                 230                 235                 240 


Ala Phe Thr Pro His Pro Cys Thr Lys Leu Thr Gln His Ser Cys Thr 
                245                 250                 255     


Gly Asp Ser Cys Gly Gly Thr Tyr Ser Ser Asp Arg Tyr Gly Gly Thr 
            260                 265                 270         


Cys Asp Ala Asp Gly Cys Asp Phe Asn Ala Tyr Arg Gln Gly Asn Lys 
        275                 280                 285             


Thr Phe Tyr Gly Pro Gly Ser Asn Phe Asn Val Asp Thr Thr Lys Lys 
    290                 295                 300                 


Met Thr Val Val Thr Gln Phe His Lys Gly Ser Asn Gly Arg Leu Ser 
305                 310                 315                 320 


Glu Ile Thr Arg Leu Tyr Val Gln Asn Gly Lys Val Ile Ala Asn Ser 
                325                 330                 335     


Glu Ser Lys Ile Ser Gly Asn Pro Gly Ser Ser Leu Thr Ser Asp Phe 
            340                 345                 350         


Cys Ser Lys Gln Lys Ser Val Phe Gly Asp Ile Asp Asp Phe Ser Lys 
        355                 360                 365             


Lys Gly Gly Trp Asn Gly Met Ser Asp Ala Leu Ser Ala Pro Met Val 
    370                 375                 380                 


Leu Val Met Ser Leu Trp His Asp His His Ser Asn Met Leu Trp Leu 
385                 390                 395                 400 


Asp Ser Thr Tyr Pro Thr Asp Ser Thr Lys Val Gly Ser Gln Arg Gly 
                405                 410                 415     


Ser Cys Ala Thr Thr Ser Gly Lys Pro Ser Asp Leu Glu Arg Asp Val 
            420                 425                 430         


Pro Asn Ser Lys Val Ser Phe Ser Asn Ile Lys Phe Gly Pro Ile Gly 
        435                 440                 445             


Ser Thr Tyr Lys Ser Asp Gly Thr Thr Pro Asn Pro Pro Ala Ser Ser 
    450                 455                 460                 


Ser Thr Thr Gly Ser Ser Thr Pro Thr Asn Pro Pro Ala Gly Ser Val 
465                 470                 475                 480 


Asp Gln Trp Gly Gln Cys Gly Gly Gln Asn Tyr Ser Gly Pro Thr Thr 
                485                 490                 495     


Cys Lys Ser Pro Phe Thr Cys Lys Lys Ile Asn Asp Phe Tyr Ser Gln 
            500                 505                 510         


Cys Gln 
        

<210> 453
<211> 2094
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 453
atggacagag aacaagcaaa acggaaggcg gaagcgctgg tcggtcaaat gacggtggag     60

gaggccgctt cgcagctgcg gttcgacgcg ccggccatcg aaagactgaa cataccggca    120

tacaactggt ggaacgaggg actgcacggc ctggccagaa gcggtgtcgc cacgagcttt    180

ccgcaggcga tcgccatggc cgccgctttc gacccggaca tgatggaacg ggtgggcgac    240

gtcatcgcaa cggaagcccg cgccagatac aatacctcat ccgtccgcgg cgaccgcggt    300

atctacaagg gcctgaccat ctggtctccg aacgtgaaca tcttccgcga tccgagatgg    360

ggacgcggtc acgagaccta cggagaggac ccgtatctga cctcccgcct gggcgtcggc    420

ttcgtcaaag gcttgcaggg cgagggggaa acgcggaagg tcgcggcatg cgcgaagcat    480

ttcgccgtcc actccgggcc ggaggacctg aggcacaggt tcaacgcgac cgtctccgag    540

aaggacctgt acgagacgta tctgcccgcc tttaaggcgc tcgtccggga agcggacgtc    600

gagaccgtca tgggcgccta caaccgcacg aacggggaac cgtgctgcgg cagtaagaag    660

ctgttaaggg atattttgag agacgaatgg ggcttccgcg gccacgtcgt cagcgattgc    720

tgggccgtga aggatttcca tgagaaccat aaggttacgg agggaccgga gcagtccgtc    780

aagctcgctt tggagaacgg ctgcgacgtc aattgcggct gcacgtttca gttcgtgatg    840

agtgcgtacg cgaagggact catctcggag gaaaccatcc gtacatccgc cgtcaggttg    900

tttaccacaa ggtatcttct gggcatgatg gaggggagcg cgttcgaccg tatcggcatc    960

gaagcgctgg aagcgaagga gcatctgtcc gttgccgggg aagccgccga aaagggatgc   1020

gtgctgctga aaaacaacgg cttgctcccg ctcgatgccg gaagcatcaa aaagatcggc   1080

gtcatcggac cgaacgcgaa cagcagggcg gcgctcatcg gcaactacca cgggacctcc   1140

tcccgctacg tcaccgttct cgagggcata caggactatc tcggcgatga cgtgcgtgtg   1200

ctgtattccg agggctgtga tatcagtaac gacaaggtgg agcccctggc gcaggacggc   1260

gacagactgt ccgaagcgca ggccgtggcg gacgcgtccg acgttgtcgt gctcgtggtc   1320

ggactgaatg agaatctcga gggcgaagag ggcgacgcgg gcaaccagta cgtctccggc   1380

gacaagcagg acctcctgct gccggccccg cagagaaagc tgatgcaggc ggtctttgcc   1440

tcaaagaagc cgacggtcat cgtactgatg gcgggcagct ccatcgacct ggaggaatac   1500

ggcgaaaagg cggatgcgat cttgctgccc tggtatccgg gcgcgcgcgg cggcaaaacg   1560

gtcgcggaca tcctcttcgg cagggtctcc ccgtccggca agcttccggt caccttctac   1620

cgcaacgaag atctcgaaaa gatccccgcg tttacggatt acggtatgaa aaaccagact   1680

taccgctatc tcgattttgc acccctctat ccgttcggct acggcctgac ctacggcgac   1740

tgtgccgtcg aaacggccga agcttgtaaa acgggcgata cggtccgggt aagggctacc   1800

gtaaagaatc tttccggcgt cgcgacgcag gaggtcatac aggtctatgc ccaaaatgag   1860

ggaagcgcta cggcgccccg caacccgcgg cttgtcggct tcaaacggat atcctgcccg   1920

ccgcagcgga gcgtaacgga agaatgggag tttaaggcgg agctgctgga taccgtggga   1980

gcggacggac gcccggtctt cgaggagcac gccacactgt acgtttcttt gggacaaccg   2040

gacgcgctga cggaaaaact gacgggacac gccgccatac ggatcggcat ctga         2094

<210> 454
<211> 697
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (30)...(273)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (340)...(580)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (92)...(95)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (177)...(180)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (211)...(214)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (566)...(569)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (612)...(615)
<223> N-glycosylation site. Prosite id = PS00001

<400> 454
Met Asp Arg Glu Gln Ala Lys Arg Lys Ala Glu Ala Leu Val Gly Gln 
1               5                   10                  15      


Met Thr Val Glu Glu Ala Ala Ser Gln Leu Arg Phe Asp Ala Pro Ala 
            20                  25                  30          


Ile Glu Arg Leu Asn Ile Pro Ala Tyr Asn Trp Trp Asn Glu Gly Leu 
        35                  40                  45              


His Gly Leu Ala Arg Ser Gly Val Ala Thr Ser Phe Pro Gln Ala Ile 
    50                  55                  60                  


Ala Met Ala Ala Ala Phe Asp Pro Asp Met Met Glu Arg Val Gly Asp 
65                  70                  75                  80  


Val Ile Ala Thr Glu Ala Arg Ala Arg Tyr Asn Thr Ser Ser Val Arg 
                85                  90                  95      


Gly Asp Arg Gly Ile Tyr Lys Gly Leu Thr Ile Trp Ser Pro Asn Val 
            100                 105                 110         


Asn Ile Phe Arg Asp Pro Arg Trp Gly Arg Gly His Glu Thr Tyr Gly 
        115                 120                 125             


Glu Asp Pro Tyr Leu Thr Ser Arg Leu Gly Val Gly Phe Val Lys Gly 
    130                 135                 140                 


Leu Gln Gly Glu Gly Glu Thr Arg Lys Val Ala Ala Cys Ala Lys His 
145                 150                 155                 160 


Phe Ala Val His Ser Gly Pro Glu Asp Leu Arg His Arg Phe Asn Ala 
                165                 170                 175     


Thr Val Ser Glu Lys Asp Leu Tyr Glu Thr Tyr Leu Pro Ala Phe Lys 
            180                 185                 190         


Ala Leu Val Arg Glu Ala Asp Val Glu Thr Val Met Gly Ala Tyr Asn 
        195                 200                 205             


Arg Thr Asn Gly Glu Pro Cys Cys Gly Ser Lys Lys Leu Leu Arg Asp 
    210                 215                 220                 


Ile Leu Arg Asp Glu Trp Gly Phe Arg Gly His Val Val Ser Asp Cys 
225                 230                 235                 240 


Trp Ala Val Lys Asp Phe His Glu Asn His Lys Val Thr Glu Gly Pro 
                245                 250                 255     


Glu Gln Ser Val Lys Leu Ala Leu Glu Asn Gly Cys Asp Val Asn Cys 
            260                 265                 270         


Gly Cys Thr Phe Gln Phe Val Met Ser Ala Tyr Ala Lys Gly Leu Ile 
        275                 280                 285             


Ser Glu Glu Thr Ile Arg Thr Ser Ala Val Arg Leu Phe Thr Thr Arg 
    290                 295                 300                 


Tyr Leu Leu Gly Met Met Glu Gly Ser Ala Phe Asp Arg Ile Gly Ile 
305                 310                 315                 320 


Glu Ala Leu Glu Ala Lys Glu His Leu Ser Val Ala Gly Glu Ala Ala 
                325                 330                 335     


Glu Lys Gly Cys Val Leu Leu Lys Asn Asn Gly Leu Leu Pro Leu Asp 
            340                 345                 350         


Ala Gly Ser Ile Lys Lys Ile Gly Val Ile Gly Pro Asn Ala Asn Ser 
        355                 360                 365             


Arg Ala Ala Leu Ile Gly Asn Tyr His Gly Thr Ser Ser Arg Tyr Val 
    370                 375                 380                 


Thr Val Leu Glu Gly Ile Gln Asp Tyr Leu Gly Asp Asp Val Arg Val 
385                 390                 395                 400 


Leu Tyr Ser Glu Gly Cys Asp Ile Ser Asn Asp Lys Val Glu Pro Leu 
                405                 410                 415     


Ala Gln Asp Gly Asp Arg Leu Ser Glu Ala Gln Ala Val Ala Asp Ala 
            420                 425                 430         


Ser Asp Val Val Val Leu Val Val Gly Leu Asn Glu Asn Leu Glu Gly 
        435                 440                 445             


Glu Glu Gly Asp Ala Gly Asn Gln Tyr Val Ser Gly Asp Lys Gln Asp 
    450                 455                 460                 


Leu Leu Leu Pro Ala Pro Gln Arg Lys Leu Met Gln Ala Val Phe Ala 
465                 470                 475                 480 


Ser Lys Lys Pro Thr Val Ile Val Leu Met Ala Gly Ser Ser Ile Asp 
                485                 490                 495     


Leu Glu Glu Tyr Gly Glu Lys Ala Asp Ala Ile Leu Leu Pro Trp Tyr 
            500                 505                 510         


Pro Gly Ala Arg Gly Gly Lys Thr Val Ala Asp Ile Leu Phe Gly Arg 
        515                 520                 525             


Val Ser Pro Ser Gly Lys Leu Pro Val Thr Phe Tyr Arg Asn Glu Asp 
    530                 535                 540                 


Leu Glu Lys Ile Pro Ala Phe Thr Asp Tyr Gly Met Lys Asn Gln Thr 
545                 550                 555                 560 


Tyr Arg Tyr Leu Asp Phe Ala Pro Leu Tyr Pro Phe Gly Tyr Gly Leu 
                565                 570                 575     


Thr Tyr Gly Asp Cys Ala Val Glu Thr Ala Glu Ala Cys Lys Thr Gly 
            580                 585                 590         


Asp Thr Val Arg Val Arg Ala Thr Val Lys Asn Leu Ser Gly Val Ala 
        595                 600                 605             


Thr Gln Glu Val Ile Gln Val Tyr Ala Gln Asn Glu Gly Ser Ala Thr 
    610                 615                 620                 


Ala Pro Arg Asn Pro Arg Leu Val Gly Phe Lys Arg Ile Ser Cys Pro 
625                 630                 635                 640 


Pro Gln Arg Ser Val Thr Glu Glu Trp Glu Phe Lys Ala Glu Leu Leu 
                645                 650                 655     


Asp Thr Val Gly Ala Asp Gly Arg Pro Val Phe Glu Glu His Ala Thr 
            660                 665                 670         


Leu Tyr Val Ser Leu Gly Gln Pro Asp Ala Leu Thr Glu Lys Leu Thr 
        675                 680                 685             


Gly His Ala Ala Ile Arg Ile Gly Ile 
    690                 695         


<210> 455
<211> 1482
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 455
atgaagcaca tctccatgct tgtcaacgcc agagacgtca ttgcccgcat ccgcccggag     60

atctacggcc acttcgccga gcatctgggc cgctgcatct acggcggcat cttcgtcggc    120

gaaaacagcc ccatccccaa tgtaaagggc atccgcaagg acgtcatcga cgccttccgc    180

cacattcagg tacccaccat ccgctggccc ggcggctgct tcgccgagga ataccactgg    240

caggacggcg tcggcccgct ggaaaaccgc aagaagatgg tcaacatcaa ctggggcgcg    300

gtggtggagg acaattcctt cggtacccac gagtacatgg ccctgtgcga gctgatcggc    360

tccaggccct acatcaacgg caacgtcggc tccggcacgg tgcaggaaat ggccgaatgg    420

gtagagtaca tgaccgccgt gggcacccct ctggccgagc agcggaaaga gaacggtcag    480

gacgaaccct ggaaggtgcc cttcctgggc gtgggcaacg agaactgggg ctgcggcggc    540

gaaatgaccg ccgagaccta cgccgccgaa ttccgccgct acgccgcctt ctgccgcgag    600

cacaacggca accacctctg ccgcatcgcc tgtggcccgt cctctgccga tttcggctgg    660

atggagacga tgatgaaggc catctcgaag ggctggggcg gttaccccct ggccggcggc    720

atcgacctgc attactatac catgcccatc cacccgaaga tggactccgc cacgaagttc    780

tccgacgagg agcactacgc caccatggac agcgccttct actgcgacga gctgctgacc    840

cgccacaccg agatcatgaa ccgctacgac ccggagaaca gggtgggcct cgtcatcggc    900

gaatggggct gctggcacga ggtggagccc ggcaccaacc ccggcttcct ctatcaacag    960

aacgccatgc gcgacgcgct ggtggccgcc atccacctca acatcttcaa ccgccacgca   1020

aagcgcgtca tgatggccaa ccttgcccag accgtcaacg tcctgcaggc catcctcctc   1080

accgacggcg accagctggt caaaaccccc acctattacg tcttcgacct gtacaaggcc   1140

catcaaaacg gcgccgccgt gtactgctac accgatgacg agaaggccgc cgggggctat   1200

aaggctccga tgatctcctc ctcctgctcg gtcaaggacg gcgtcatgac cctgacgctg   1260

gccaattgct ccctgacgga ggaggccgcc atcgactgcg acctctgcca cttcgccgct   1320

gcggaggcca ccgcccgcat cctcaccgcg gacgtgcgtg ccttcaacga ctttgaacac   1380

ccggaggacg tcgtcatcca ccccttcgac gtgaacctca ccggcggcag gctgaccctg   1440

accctgccgc cctgctccgt ggcggaggtc accctgcgct ga                      1482

<210> 456
<211> 493
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (300)...(486)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (428)...(431)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (479)...(482)
<223> N-glycosylation site. Prosite id = PS00001

<400> 456
Met Lys His Ile Ser Met Leu Val Asn Ala Arg Asp Val Ile Ala Arg 
1               5                   10                  15      


Ile Arg Pro Glu Ile Tyr Gly His Phe Ala Glu His Leu Gly Arg Cys 
            20                  25                  30          


Ile Tyr Gly Gly Ile Phe Val Gly Glu Asn Ser Pro Ile Pro Asn Val 
        35                  40                  45              


Lys Gly Ile Arg Lys Asp Val Ile Asp Ala Phe Arg His Ile Gln Val 
    50                  55                  60                  


Pro Thr Ile Arg Trp Pro Gly Gly Cys Phe Ala Glu Glu Tyr His Trp 
65                  70                  75                  80  


Gln Asp Gly Val Gly Pro Leu Glu Asn Arg Lys Lys Met Val Asn Ile 
                85                  90                  95      


Asn Trp Gly Ala Val Val Glu Asp Asn Ser Phe Gly Thr His Glu Tyr 
            100                 105                 110         


Met Ala Leu Cys Glu Leu Ile Gly Ser Arg Pro Tyr Ile Asn Gly Asn 
        115                 120                 125             


Val Gly Ser Gly Thr Val Gln Glu Met Ala Glu Trp Val Glu Tyr Met 
    130                 135                 140                 


Thr Ala Val Gly Thr Pro Leu Ala Glu Gln Arg Lys Glu Asn Gly Gln 
145                 150                 155                 160 


Asp Glu Pro Trp Lys Val Pro Phe Leu Gly Val Gly Asn Glu Asn Trp 
                165                 170                 175     


Gly Cys Gly Gly Glu Met Thr Ala Glu Thr Tyr Ala Ala Glu Phe Arg 
            180                 185                 190         


Arg Tyr Ala Ala Phe Cys Arg Glu His Asn Gly Asn His Leu Cys Arg 
        195                 200                 205             


Ile Ala Cys Gly Pro Ser Ser Ala Asp Phe Gly Trp Met Glu Thr Met 
    210                 215                 220                 


Met Lys Ala Ile Ser Lys Gly Trp Gly Gly Tyr Pro Leu Ala Gly Gly 
225                 230                 235                 240 


Ile Asp Leu His Tyr Tyr Thr Met Pro Ile His Pro Lys Met Asp Ser 
                245                 250                 255     


Ala Thr Lys Phe Ser Asp Glu Glu His Tyr Ala Thr Met Asp Ser Ala 
            260                 265                 270         


Phe Tyr Cys Asp Glu Leu Leu Thr Arg His Thr Glu Ile Met Asn Arg 
        275                 280                 285             


Tyr Asp Pro Glu Asn Arg Val Gly Leu Val Ile Gly Glu Trp Gly Cys 
    290                 295                 300                 


Trp His Glu Val Glu Pro Gly Thr Asn Pro Gly Phe Leu Tyr Gln Gln 
305                 310                 315                 320 


Asn Ala Met Arg Asp Ala Leu Val Ala Ala Ile His Leu Asn Ile Phe 
                325                 330                 335     


Asn Arg His Ala Lys Arg Val Met Met Ala Asn Leu Ala Gln Thr Val 
            340                 345                 350         


Asn Val Leu Gln Ala Ile Leu Leu Thr Asp Gly Asp Gln Leu Val Lys 
        355                 360                 365             


Thr Pro Thr Tyr Tyr Val Phe Asp Leu Tyr Lys Ala His Gln Asn Gly 
    370                 375                 380                 


Ala Ala Val Tyr Cys Tyr Thr Asp Asp Glu Lys Ala Ala Gly Gly Tyr 
385                 390                 395                 400 


Lys Ala Pro Met Ile Ser Ser Ser Cys Ser Val Lys Asp Gly Val Met 
                405                 410                 415     


Thr Leu Thr Leu Ala Asn Cys Ser Leu Thr Glu Glu Ala Ala Ile Asp 
            420                 425                 430         


Cys Asp Leu Cys His Phe Ala Ala Ala Glu Ala Thr Ala Arg Ile Leu 
        435                 440                 445             


Thr Ala Asp Val Arg Ala Phe Asn Asp Phe Glu His Pro Glu Asp Val 
    450                 455                 460                 


Val Ile His Pro Phe Asp Val Asn Leu Thr Gly Gly Arg Leu Thr Leu 
465                 470                 475                 480 


Thr Leu Pro Pro Cys Ser Val Ala Glu Val Thr Leu Arg 
                485                 490             


<210> 457
<211> 1401
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 457
atgatcaatc tttatggcgt ctttgacatc cggacctttg gggcccaacc ggacggagaa     60

acgccttcca ctgcggcgat tacggcggcc atcgaaactt gtgccgcggc cgggggagga    120

gtggtctaca tcccggccgg acggttcctc accggtcccc tccgcctcaa aagccacgtc    180

cggctccatc tcgaggccgg agcgcacttg ctctttagtc aggacccggc cgattatcct    240

gttctggaga cgaggtggga ggggaaggag gtcttgacct atgcacacca gatctacggc    300

gaggacctcg aaggggtcgc gattaccggt cgggggacca tcgacggccg gggcgagact    360

tggtggcgac tcttccgcgc caaagccttc acccatcccc gaccccgcct catcgccttt    420

acccgctgca aggacatcct gatagaagga gtaaccctcg tcaattcacc ggcctggacc    480

atcaatcctg tgatgtgcga gcgggtgacc atcgataagg tgactatcat caacccgccc    540

gactcgccca acaccgacgg gatcgacccc gattcctccc ggaacgtcta tatcactaac    600

tgctacattg acgtaggcga tgactgcatc gccatcaaag cgggccgaga ggactccctt    660

tatcggacgc cttgtgaaaa cattgtcatc gccaactgcc tcatgcgcca cggtcacggc    720

ggggtggtca tcggcagcga gaccagcggg ggtattcgca aggtagtcat taccaactgc    780

atcttcgagg acaccgaccg gggcattaga cttaagtccc ggcgcggacg cggcgggttc    840

gtcgaggacc tccgggcgac gaatattatc atggaaaagg tgctctgtcc cttcgtcctc    900

aacatgtact atgataccgg gggaggcgtg atcgacgagc gcgcgcatga cttagaaccc    960

catccggtaa gcgaggctac accctccttc cgccgcctct ccttcagtca cattactgcc   1020

cgggaagtgc aggccgccgc ggccttcctc tacggcctgc ccgaacagcc tctggaggac   1080

gtcttatttg acgatatctg gatagagctg gccgccgacg cttctcctgc ccgtccggcc   1140

atgatgcggg ccgtcccgcc catgagccaa ggtggtgtgc tctgctacgg tgcgcggcgg   1200

atctccttcc ggcacatgca cctccgcggg caccgcggtc cggccttcca gatcgaacgc   1260

gcggaggcgg tgcagttgat gggctgctcg accgacggca gtgaagaccc ccagcttgtc   1320

ttgggtcaag cggaggaggt caccatccgt gactgcacct ttaccgccca gcaggacccc   1380

gcaaaagaaa ggcaaaatta a                                             1401

<210> 458
<211> 466
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (34)...(390)
<223> Glycosyl hydrolases family 28

<220> 
<221> SITE
<222> (235)...(249)
<223> Polygalacturonase active site. Prosite id = PS00502

<400> 458
Met Ile Asn Leu Tyr Gly Val Phe Asp Ile Arg Thr Phe Gly Ala Gln 
1               5                   10                  15      


Pro Asp Gly Glu Thr Pro Ser Thr Ala Ala Ile Thr Ala Ala Ile Glu 
            20                  25                  30          


Thr Cys Ala Ala Ala Gly Gly Gly Val Val Tyr Ile Pro Ala Gly Arg 
        35                  40                  45              


Phe Leu Thr Gly Pro Leu Arg Leu Lys Ser His Val Arg Leu His Leu 
    50                  55                  60                  


Glu Ala Gly Ala His Leu Leu Phe Ser Gln Asp Pro Ala Asp Tyr Pro 
65                  70                  75                  80  


Val Leu Glu Thr Arg Trp Glu Gly Lys Glu Val Leu Thr Tyr Ala His 
                85                  90                  95      


Gln Ile Tyr Gly Glu Asp Leu Glu Gly Val Ala Ile Thr Gly Arg Gly 
            100                 105                 110         


Thr Ile Asp Gly Arg Gly Glu Thr Trp Trp Arg Leu Phe Arg Ala Lys 
        115                 120                 125             


Ala Phe Thr His Pro Arg Pro Arg Leu Ile Ala Phe Thr Arg Cys Lys 
    130                 135                 140                 


Asp Ile Leu Ile Glu Gly Val Thr Leu Val Asn Ser Pro Ala Trp Thr 
145                 150                 155                 160 


Ile Asn Pro Val Met Cys Glu Arg Val Thr Ile Asp Lys Val Thr Ile 
                165                 170                 175     


Ile Asn Pro Pro Asp Ser Pro Asn Thr Asp Gly Ile Asp Pro Asp Ser 
            180                 185                 190         


Ser Arg Asn Val Tyr Ile Thr Asn Cys Tyr Ile Asp Val Gly Asp Asp 
        195                 200                 205             


Cys Ile Ala Ile Lys Ala Gly Arg Glu Asp Ser Leu Tyr Arg Thr Pro 
    210                 215                 220                 


Cys Glu Asn Ile Val Ile Ala Asn Cys Leu Met Arg His Gly His Gly 
225                 230                 235                 240 


Gly Val Val Ile Gly Ser Glu Thr Ser Gly Gly Ile Arg Lys Val Val 
                245                 250                 255     


Ile Thr Asn Cys Ile Phe Glu Asp Thr Asp Arg Gly Ile Arg Leu Lys 
            260                 265                 270         


Ser Arg Arg Gly Arg Gly Gly Phe Val Glu Asp Leu Arg Ala Thr Asn 
        275                 280                 285             


Ile Ile Met Glu Lys Val Leu Cys Pro Phe Val Leu Asn Met Tyr Tyr 
    290                 295                 300                 


Asp Thr Gly Gly Gly Val Ile Asp Glu Arg Ala His Asp Leu Glu Pro 
305                 310                 315                 320 


His Pro Val Ser Glu Ala Thr Pro Ser Phe Arg Arg Leu Ser Phe Ser 
                325                 330                 335     


His Ile Thr Ala Arg Glu Val Gln Ala Ala Ala Ala Phe Leu Tyr Gly 
            340                 345                 350         


Leu Pro Glu Gln Pro Leu Glu Asp Val Leu Phe Asp Asp Ile Trp Ile 
        355                 360                 365             


Glu Leu Ala Ala Asp Ala Ser Pro Ala Arg Pro Ala Met Met Arg Ala 
    370                 375                 380                 


Val Pro Pro Met Ser Gln Gly Gly Val Leu Cys Tyr Gly Ala Arg Arg 
385                 390                 395                 400 


Ile Ser Phe Arg His Met His Leu Arg Gly His Arg Gly Pro Ala Phe 
                405                 410                 415     


Gln Ile Glu Arg Ala Glu Ala Val Gln Leu Met Gly Cys Ser Thr Asp 
            420                 425                 430         


Gly Ser Glu Asp Pro Gln Leu Val Leu Gly Gln Ala Glu Glu Val Thr 
        435                 440                 445             


Ile Arg Asp Cys Thr Phe Thr Ala Gln Gln Asp Pro Ala Lys Glu Arg 
    450                 455                 460                 


Gln Asn 
465     

<210> 459
<211> 1509
<212> DNA
<213> Bacillus licheniformis

<400> 459
atgactgtac acaaagcgaa gatgacgatt gacaaggaat ataaggtggc agagattgat     60

aagcgaattt acggctcctt tatcgagcat ctcggcagag ccgtttatga aggcatttat    120

gagcctgatc atcctgaagc tgacgaatca ggctttcgga aagatgtcat taaactagtc    180

agagaactaa aggtgccgtt tatcaggtat cccggcggaa actttgtatc tggatataac    240

tgggaggatg gagtcggacc tgtcgaacag cggccgacaa gacttgattt ggcgtgggcg    300

acaaccgagc cgaacttaat cggtacgaac gaatttatgg attgggcaaa gcttgtcggg    360

gcagaggtga atatggccgt caacctcgga acgagaggaa ttgatgccgc acgcaaccta    420

gtagaatatt gcaaccatcc gtcaggatcg tactacagcg acttgagaaa atcccacgga    480

tataaggaac cgcataaaat taaaacatgg tgcctcggca atgaaatgga cggtccatgg    540

caaatcggcc ataaaacagc ggccgaatac ggaagacttg ctgctgaagc cgcgaaggtg    600

atgaaatgga cggacccgtc gatcgagctt gtcgcctgcg gaagttcggg aagcggaatg    660

ccgacattca tcgactggga aacgaccgtg cttgaccata cgtacgaaca cgtcgagtac    720

atctcgcttc actcgtatta cggcaaccgc gacaatgatc ttccaaacta tttggcgaga    780

tcgctagaca tggatcactt tatcaaaacg gtcatctcag tctgcgacta tatgaaagcg    840

aaaaagaaaa gcaagaaaac gattcacctt tcatatgatg agtggaatgt ctggtatcac    900

tcgaatgaaa aagacaaaga agctgaacgc tgggcgaaag cgccgcacct tcttgaagac    960

atctacaact ttgaggatgc gcttctcgtc ggctgtatgc tgattacgat gctcaagcat   1020

gccgaccgcg tgaaaatcgc ctgtctggct cagcttgtca atgtcatcgc gccgattatg   1080

acagacaaag ggggagaagc atggcgtcag accattttct atccgtttat gcacgcttcc   1140

gtctacggca gaggaacggt actgcagacg gcggtatcgt ctccaaagta tgatgcgaaa   1200

gactttacgg atgtgccgta cttggagtcc gtgtctgttt tcaatgaaga agccgaggaa   1260

ttaaccgttt ttgccgtcaa ccgcgcaaca gacgccggcc ttgaaatgga agccgatatg   1320

agaagctttg aagggtacag tgtctctgag cacatcgttc tggaacacga agatcataaa   1380

gcgacgaatg aaaaagaccg caacaacgtc gttccgcaca gcggcggaga cgccaaagta   1440

tgtgacggca ggctgacggc tcaccttccg aagctttcct ggaatgtgat cagaatgaag   1500

aaacaataa                                                           1509

<210> 460
<211> 502
<212> PRT
<213> Bacillus licheniformis

<220> 
<221> DOMAIN
<222> (293)...(493)
<223> Alpha-L-arabinofuranosidase C-terminus

<400> 460
Met Thr Val His Lys Ala Lys Met Thr Ile Asp Lys Glu Tyr Lys Val 
1               5                   10                  15      


Ala Glu Ile Asp Lys Arg Ile Tyr Gly Ser Phe Ile Glu His Leu Gly 
            20                  25                  30          


Arg Ala Val Tyr Glu Gly Ile Tyr Glu Pro Asp His Pro Glu Ala Asp 
        35                  40                  45              


Glu Ser Gly Phe Arg Lys Asp Val Ile Lys Leu Val Arg Glu Leu Lys 
    50                  55                  60                  


Val Pro Phe Ile Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Tyr Asn 
65                  70                  75                  80  


Trp Glu Asp Gly Val Gly Pro Val Glu Gln Arg Pro Thr Arg Leu Asp 
                85                  90                  95      


Leu Ala Trp Ala Thr Thr Glu Pro Asn Leu Ile Gly Thr Asn Glu Phe 
            100                 105                 110         


Met Asp Trp Ala Lys Leu Val Gly Ala Glu Val Asn Met Ala Val Asn 
        115                 120                 125             


Leu Gly Thr Arg Gly Ile Asp Ala Ala Arg Asn Leu Val Glu Tyr Cys 
    130                 135                 140                 


Asn His Pro Ser Gly Ser Tyr Tyr Ser Asp Leu Arg Lys Ser His Gly 
145                 150                 155                 160 


Tyr Lys Glu Pro His Lys Ile Lys Thr Trp Cys Leu Gly Asn Glu Met 
                165                 170                 175     


Asp Gly Pro Trp Gln Ile Gly His Lys Thr Ala Ala Glu Tyr Gly Arg 
            180                 185                 190         


Leu Ala Ala Glu Ala Ala Lys Val Met Lys Trp Thr Asp Pro Ser Ile 
        195                 200                 205             


Glu Leu Val Ala Cys Gly Ser Ser Gly Ser Gly Met Pro Thr Phe Ile 
    210                 215                 220                 


Asp Trp Glu Thr Thr Val Leu Asp His Thr Tyr Glu His Val Glu Tyr 
225                 230                 235                 240 


Ile Ser Leu His Ser Tyr Tyr Gly Asn Arg Asp Asn Asp Leu Pro Asn 
                245                 250                 255     


Tyr Leu Ala Arg Ser Leu Asp Met Asp His Phe Ile Lys Thr Val Ile 
            260                 265                 270         


Ser Val Cys Asp Tyr Met Lys Ala Lys Lys Lys Ser Lys Lys Thr Ile 
        275                 280                 285             


His Leu Ser Tyr Asp Glu Trp Asn Val Trp Tyr His Ser Asn Glu Lys 
    290                 295                 300                 


Asp Lys Glu Ala Glu Arg Trp Ala Lys Ala Pro His Leu Leu Glu Asp 
305                 310                 315                 320 


Ile Tyr Asn Phe Glu Asp Ala Leu Leu Val Gly Cys Met Leu Ile Thr 
                325                 330                 335     


Met Leu Lys His Ala Asp Arg Val Lys Ile Ala Cys Leu Ala Gln Leu 
            340                 345                 350         


Val Asn Val Ile Ala Pro Ile Met Thr Asp Lys Gly Gly Glu Ala Trp 
        355                 360                 365             


Arg Gln Thr Ile Phe Tyr Pro Phe Met His Ala Ser Val Tyr Gly Arg 
    370                 375                 380                 


Gly Thr Val Leu Gln Thr Ala Val Ser Ser Pro Lys Tyr Asp Ala Lys 
385                 390                 395                 400 


Asp Phe Thr Asp Val Pro Tyr Leu Glu Ser Val Ser Val Phe Asn Glu 
                405                 410                 415     


Glu Ala Glu Glu Leu Thr Val Phe Ala Val Asn Arg Ala Thr Asp Ala 
            420                 425                 430         


Gly Leu Glu Met Glu Ala Asp Met Arg Ser Phe Glu Gly Tyr Ser Val 
        435                 440                 445             


Ser Glu His Ile Val Leu Glu His Glu Asp His Lys Ala Thr Asn Glu 
    450                 455                 460                 


Lys Asp Arg Asn Asn Val Val Pro His Ser Gly Gly Asp Ala Lys Val 
465                 470                 475                 480 


Cys Asp Gly Arg Leu Thr Ala His Leu Pro Lys Leu Ser Trp Asn Val 
                485                 490                 495     


Ile Arg Met Lys Lys Gln 
            500         


<210> 461
<211> 1503
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 461
atgaaaaaag cgcgaatgat tgtagacaaa gaatataaaa tcggtgaagt agataaacgg     60

atttatggct cgtttatcga acatatgggt cgtgcggtat atgaaggcat atacgagcct    120

gatcaccctg aagcggatga agatggattt agaaaagatg tccagtcgct gatcaaagaa    180

ttacaggttc ccatcatccg ctatccgggc ggaaactttt tatccggata caactgggag    240

gacggtgtcg gaccagtcga aaaccgcccg agacggcttg acttagcatg gcaaacgaca    300

gaaaccaatg aagtgggaac aaatgaattt ttatcttggg ccaaaaaggt gaacactgag    360

gtcaatatgg ccgtcaacct cggcacaaga ggcatagatg ccgcccgtaa tctcgttgaa    420

tactgcaacc acccgaaagg ctcttactgg agtgatttaa gaagatcgca tggctatgaa    480

cagccgtacg gcatcaaaac atggtgctta ggaaacgaaa tggatggacc gtggcagatc    540

ggccacaaaa cagctgatga atacggacgg cttgctgcag agacagcaaa ggtcatgaag    600

tgggttgacc catcaattga actcgttgcc tgcggcagct caaacagcgg tatgccgacc    660

tttatcgatt gggaagcgaa ggtgctggag catacgtatg agcatgtcga ctatatctct    720

cttcacactt actacggaaa ccgggataac aatctgccaa actacttggc ccgttccatg    780

gatttggatc attttatcaa atcagtcgct gcgacctgtg actatgtaaa agcaaaaaca    840

tgcagcaaga aaacaatcaa tctttctctg gatgaatgga acgtctggta ccactcaaat    900

gaggctgata aaaaagtcga gccgtggatc actgcgcgtc cgattttaga ggatatttac    960

aattttgaag atgccttatt agtcggctct ctgctcatta cgatgctgca gcacgcagac   1020

cgtgtgaaaa ttgcgtgtct tgcacagctt gtcaatgtca tcgcgccaat catgacggaa   1080

aaaggcggag aagcatggag acagccgatt ttctatccat acatgcatgc ttctgtttac   1140

ggaaggggcg agtcactgaa accgcttatt tcttctccta agtacgattg ttctgatttc   1200

actgatgtgc catatgttga tgctgctgtt gtgtactctg aagaggaaga aacactcact   1260

atttttgcgg taaacaaggc tgaggatcag atggagacgg agatttcgct cagaggcttt   1320

gaatcctacc aaatcgcaga gcacatcgta cttgagcatc aggatatcaa agcgacaaac   1380

cagcataaca gaaaaaatgt cgttccgcat tccaacggat catcgtctgt cagcgaaaat   1440

ggcttaactg gtcatttcac gccgctttcc tggaatgtga tccgcctgaa aaaacagtca   1500

taa                                                                 1503

<210> 462
<211> 500
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (291)...(490)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (479)...(482)
<223> N-glycosylation site. Prosite id = PS00001

<400> 462
Met Lys Lys Ala Arg Met Ile Val Asp Lys Glu Tyr Lys Ile Gly Glu 
1               5                   10                  15      


Val Asp Lys Arg Ile Tyr Gly Ser Phe Ile Glu His Met Gly Arg Ala 
            20                  25                  30          


Val Tyr Glu Gly Ile Tyr Glu Pro Asp His Pro Glu Ala Asp Glu Asp 
        35                  40                  45              


Gly Phe Arg Lys Asp Val Gln Ser Leu Ile Lys Glu Leu Gln Val Pro 
    50                  55                  60                  


Ile Ile Arg Tyr Pro Gly Gly Asn Phe Leu Ser Gly Tyr Asn Trp Glu 
65                  70                  75                  80  


Asp Gly Val Gly Pro Val Glu Asn Arg Pro Arg Arg Leu Asp Leu Ala 
                85                  90                  95      


Trp Gln Thr Thr Glu Thr Asn Glu Val Gly Thr Asn Glu Phe Leu Ser 
            100                 105                 110         


Trp Ala Lys Lys Val Asn Thr Glu Val Asn Met Ala Val Asn Leu Gly 
        115                 120                 125             


Thr Arg Gly Ile Asp Ala Ala Arg Asn Leu Val Glu Tyr Cys Asn His 
    130                 135                 140                 


Pro Lys Gly Ser Tyr Trp Ser Asp Leu Arg Arg Ser His Gly Tyr Glu 
145                 150                 155                 160 


Gln Pro Tyr Gly Ile Lys Thr Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Pro Trp Gln Ile Gly His Lys Thr Ala Asp Glu Tyr Gly Arg Leu Ala 
            180                 185                 190         


Ala Glu Thr Ala Lys Val Met Lys Trp Val Asp Pro Ser Ile Glu Leu 
        195                 200                 205             


Val Ala Cys Gly Ser Ser Asn Ser Gly Met Pro Thr Phe Ile Asp Trp 
    210                 215                 220                 


Glu Ala Lys Val Leu Glu His Thr Tyr Glu His Val Asp Tyr Ile Ser 
225                 230                 235                 240 


Leu His Thr Tyr Tyr Gly Asn Arg Asp Asn Asn Leu Pro Asn Tyr Leu 
                245                 250                 255     


Ala Arg Ser Met Asp Leu Asp His Phe Ile Lys Ser Val Ala Ala Thr 
            260                 265                 270         


Cys Asp Tyr Val Lys Ala Lys Thr Cys Ser Lys Lys Thr Ile Asn Leu 
        275                 280                 285             


Ser Leu Asp Glu Trp Asn Val Trp Tyr His Ser Asn Glu Ala Asp Lys 
    290                 295                 300                 


Lys Val Glu Pro Trp Ile Thr Ala Arg Pro Ile Leu Glu Asp Ile Tyr 
305                 310                 315                 320 


Asn Phe Glu Asp Ala Leu Leu Val Gly Ser Leu Leu Ile Thr Met Leu 
                325                 330                 335     


Gln His Ala Asp Arg Val Lys Ile Ala Cys Leu Ala Gln Leu Val Asn 
            340                 345                 350         


Val Ile Ala Pro Ile Met Thr Glu Lys Gly Gly Glu Ala Trp Arg Gln 
        355                 360                 365             


Pro Ile Phe Tyr Pro Tyr Met His Ala Ser Val Tyr Gly Arg Gly Glu 
    370                 375                 380                 


Ser Leu Lys Pro Leu Ile Ser Ser Pro Lys Tyr Asp Cys Ser Asp Phe 
385                 390                 395                 400 


Thr Asp Val Pro Tyr Val Asp Ala Ala Val Val Tyr Ser Glu Glu Glu 
                405                 410                 415     


Glu Thr Leu Thr Ile Phe Ala Val Asn Lys Ala Glu Asp Gln Met Glu 
            420                 425                 430         


Thr Glu Ile Ser Leu Arg Gly Phe Glu Ser Tyr Gln Ile Ala Glu His 
        435                 440                 445             


Ile Val Leu Glu His Gln Asp Ile Lys Ala Thr Asn Gln His Asn Arg 
    450                 455                 460                 


Lys Asn Val Val Pro His Ser Asn Gly Ser Ser Ser Val Ser Glu Asn 
465                 470                 475                 480 


Gly Leu Thr Gly His Phe Thr Pro Leu Ser Trp Asn Val Ile Arg Leu 
                485                 490                 495     


Lys Lys Gln Ser 
            500 


<210> 463
<211> 1509
<212> DNA
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<400> 463
atgaaaggaa tgatcagagg catgtctgaa catcaagcag tgattcaaac agatatcgta     60

aaaggaacca ttaacaaaaa tatatacggt cattttgctg agcatttagg aagagggatt    120

tatgagggga tctgggtcgg aacggactca gacatcccca atatcaacgg gatacgaaag    180

gacgtgctgg aggcgctcaa acagctgcac atccctgtcc tcaggtggcc gggcgggtgt    240

tttgcggacg aataccattg ggcaaacggt gtcggtgacc gtaagacaat gctgaacact    300

cactggggcg gtacaattga atcaaatgaa ttcggaacgc atgaatttat gatgctttgc    360

gagctgcttg aatgcgagcc atatatttgc ggcaatgtcg gaagcggaac cgttcaggaa    420

atggcggagt ggattgaata tatgacattt gaagaaggca cgccgatgtc agactggaga    480

aagcaaaatg gaagagaaga gccttggaag ctgaaatatt tcggcgtggg caatgaaaac    540

tggggctgcg gcggcaacat gcatcccgaa tactacgcag atctgtaccg gcgttttcag    600

acttatgtcc gcaattacag tgggaatgac atttataaaa ttgcaggcgg agcaaatgtg    660

gatgatttta attggacgga cgtgcttatg aaaaaagccg ctggcctgat ggacgggttg    720

agtcttcatt attacacgat tccgggggat ttctggaacg gcaaaggatc agccacagaa    780

ttcacggaag atgagtggtt tattacgatg aaaaaagcca aatacatcga tgaattgatt    840

caaaaacacg gcacgattat ggaccggtac gatccggagc agcgggtcgg gctgattatt    900

gatgaatggg gcacttggtt tgatcccgag ccaggcacga atcccggttt cttatatcag    960

caaaacacca ttcgtgatgc actggtggcg gcttctcatt tccacatttt ccatcagcat   1020

tgccgccggg tgcaaatggc caacatcgcc caaacagtaa acgttctgca agcgatgatt   1080

ttgactgagg gcgagcggat gcttttgaca ccgacgtacc atgtattcaa tatgtttaag   1140

gtgcaccagg acgcttctct tttagcaaca gagacaatgt ctgccgacta tgaatggaag   1200

ggtgaaacgc ttccgcaaat cagcatttca gcgtcgaaac aagctgaagg tgatgtcaat   1260

atcactattt gcaacatcga tcaccaaaac aaagcggagg cggaaatcga gctgagaggc   1320

ctacacaagg cagcggacca tcccggagtc attcttacgg cagaaaaaat gaatgcgcat   1380

aacacgtttg acgatcctca tcatgtcaaa ccggaatcct tcagacaata cacgctcagc   1440

aaaaacaaac tgaaagtaaa actcccgcca atgtcagtcg tcttacttac gctgcgtgct   1500

gattcttaa                                                           1509

<210> 464
<211> 502
<212> PRT
<213> Unknown

<220> 
<223> Obtained from an environmental sample

<220> 
<221> DOMAIN
<222> (301)...(492)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (208)...(211)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (227)...(230)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (426)...(429)
<223> N-glycosylation site. Prosite id = PS00001

<400> 464
Met Lys Gly Met Ile Arg Gly Met Ser Glu His Gln Ala Val Ile Gln 
1               5                   10                  15      


Thr Asp Ile Val Lys Gly Thr Ile Asn Lys Asn Ile Tyr Gly His Phe 
            20                  25                  30          


Ala Glu His Leu Gly Arg Gly Ile Tyr Glu Gly Ile Trp Val Gly Thr 
        35                  40                  45              


Asp Ser Asp Ile Pro Asn Ile Asn Gly Ile Arg Lys Asp Val Leu Glu 
    50                  55                  60                  


Ala Leu Lys Gln Leu His Ile Pro Val Leu Arg Trp Pro Gly Gly Cys 
65                  70                  75                  80  


Phe Ala Asp Glu Tyr His Trp Ala Asn Gly Val Gly Asp Arg Lys Thr 
                85                  90                  95      


Met Leu Asn Thr His Trp Gly Gly Thr Ile Glu Ser Asn Glu Phe Gly 
            100                 105                 110         


Thr His Glu Phe Met Met Leu Cys Glu Leu Leu Glu Cys Glu Pro Tyr 
        115                 120                 125             


Ile Cys Gly Asn Val Gly Ser Gly Thr Val Gln Glu Met Ala Glu Trp 
    130                 135                 140                 


Ile Glu Tyr Met Thr Phe Glu Glu Gly Thr Pro Met Ser Asp Trp Arg 
145                 150                 155                 160 


Lys Gln Asn Gly Arg Glu Glu Pro Trp Lys Leu Lys Tyr Phe Gly Val 
                165                 170                 175     


Gly Asn Glu Asn Trp Gly Cys Gly Gly Asn Met His Pro Glu Tyr Tyr 
            180                 185                 190         


Ala Asp Leu Tyr Arg Arg Phe Gln Thr Tyr Val Arg Asn Tyr Ser Gly 
        195                 200                 205             


Asn Asp Ile Tyr Lys Ile Ala Gly Gly Ala Asn Val Asp Asp Phe Asn 
    210                 215                 220                 


Trp Thr Asp Val Leu Met Lys Lys Ala Ala Gly Leu Met Asp Gly Leu 
225                 230                 235                 240 


Ser Leu His Tyr Tyr Thr Ile Pro Gly Asp Phe Trp Asn Gly Lys Gly 
                245                 250                 255     


Ser Ala Thr Glu Phe Thr Glu Asp Glu Trp Phe Ile Thr Met Lys Lys 
            260                 265                 270         


Ala Lys Tyr Ile Asp Glu Leu Ile Gln Lys His Gly Thr Ile Met Asp 
        275                 280                 285             


Arg Tyr Asp Pro Glu Gln Arg Val Gly Leu Ile Ile Asp Glu Trp Gly 
    290                 295                 300                 


Thr Trp Phe Asp Pro Glu Pro Gly Thr Asn Pro Gly Phe Leu Tyr Gln 
305                 310                 315                 320 


Gln Asn Thr Ile Arg Asp Ala Leu Val Ala Ala Ser His Phe His Ile 
                325                 330                 335     


Phe His Gln His Cys Arg Arg Val Gln Met Ala Asn Ile Ala Gln Thr 
            340                 345                 350         


Val Asn Val Leu Gln Ala Met Ile Leu Thr Glu Gly Glu Arg Met Leu 
        355                 360                 365             


Leu Thr Pro Thr Tyr His Val Phe Asn Met Phe Lys Val His Gln Asp 
    370                 375                 380                 


Ala Ser Leu Leu Ala Thr Glu Thr Met Ser Ala Asp Tyr Glu Trp Lys 
385                 390                 395                 400 


Gly Glu Thr Leu Pro Gln Ile Ser Ile Ser Ala Ser Lys Gln Ala Glu 
                405                 410                 415     


Gly Asp Val Asn Ile Thr Ile Cys Asn Ile Asp His Gln Asn Lys Ala 
            420                 425                 430         


Glu Ala Glu Ile Glu Leu Arg Gly Leu His Lys Ala Ala Asp His Pro 
        435                 440                 445             


Gly Val Ile Leu Thr Ala Glu Lys Met Asn Ala His Asn Thr Phe Asp 
    450                 455                 460                 


Asp Pro His His Val Lys Pro Glu Ser Phe Arg Gln Tyr Thr Leu Ser 
465                 470                 475                 480 


Lys Asn Lys Leu Lys Val Lys Leu Pro Pro Met Ser Val Val Leu Leu 
                485                 490                 495     


Thr Leu Arg Ala Asp Ser 
            500         


<210> 465
<211> 1503
<212> DNA
<213> Bacillus halodurans ATCC

<400> 465
atgacactta cagcaacaat ggttgtcgac aaatcgttta aaattggcga aattgataag     60

cgcatttatg gttcatttat tgagcaccta ggacgtgctg tttatgaagg aatttatgaa    120

cccggacacc ctgatgggga tgagcaaggg tttcgcaaag acgttatccg gctcgttcaa    180

gaactgcaag tgccactcgt acgctatcct ggcgggaatt ttgtatccgg ttacaactgg    240

gaggatgggg taggtcctgt ttccgaaagg ccaaagcggt tggatttagc gtggagaacg    300

acggagacga atgaaatagg gacaaatgaa tttgttgatt gggcgaaaaa ggttggggca    360

gaggtgaata tggctgtgaa cctcggctct cgcggggttg atgcagcccg taatcttgtt    420

gagtattgta accatccgtc tggttcttat tggagtgatc tacgcatctc ccatggatac    480

aaagatccgc ataatattaa aacatggtgt ttagggaatg agatggatgg tccttggcaa    540

atcgggcaaa aaacagcaga agaatacggt cgtgtagcgg cggaagcagg aaaagtgatg    600

aagctcgtag acccttccat agaactcgtt gcttgtggga gctccaacag taaaatggca    660

acgttcgccg attgggaagc aacggtctta gaccatacgt acgattatgt agactatatt    720

tcgctacata cttattacgg aaatcgtgat gatgatctag caaactatct tgctcagtcg    780

atggatatgg atgagttcat tcgctcagtg attgcgattg ctgattatgt gaaagcgaaa    840

aagcgaagca aaaaaacgat tcatctctca ttcgatgagt ggaatgtttg gttccactcc    900

aacgaagcgg atcggcaaat aactccgtgg agtgtagcac cgccattatt ggaagacatt    960

tatacatttg aagatgccct tcttgttgga agcatgctca tcacgttact gaagcatgcc   1020

gatcgtgtca aaattgcgtg tcttgctcaa ctcgtaaatg tcattgctcc gattatgact   1080

gaaaaaggtg ggccggcttg gaagcagacg attttttatc cgtacatgca cgcatcagta   1140

tacggacgtg gtgtggcttt acaagcgcaa atctcttcac ctaaatacga tagtaaagat   1200

tttacagacg ttccttattt ggatgcggcg gtggtgcatc tcgaggaagc cgaagaagtg   1260

acgatctttg cagttaataa acaccaaaca gaatcgctaa atttacaatg tgatatgcgt   1320

agctttgaag ggtatcacgt attggagcat attgtccttg aacatgaaaa tatgaaagcg   1380

acaaatcaag gacgagaaca ggtaacgcct catcacaatg gtgactccgc cattgatcaa   1440

gggcggctga cagcgaatct agcaaagcta tcttggaacg taattcggct agggaaaaaa   1500

taa                                                                 1503

<210> 466
<211> 500
<212> PRT
<213> Bacillus halodurans ATCC

<220> 
<221> DOMAIN
<222> (292)...(491)
<223> Alpha-L-arabinofuranosidase C-terminus

<400> 466
Met Thr Leu Thr Ala Thr Met Val Val Asp Lys Ser Phe Lys Ile Gly 
1               5                   10                  15      


Glu Ile Asp Lys Arg Ile Tyr Gly Ser Phe Ile Glu His Leu Gly Arg 
            20                  25                  30          


Ala Val Tyr Glu Gly Ile Tyr Glu Pro Gly His Pro Asp Gly Asp Glu 
        35                  40                  45              


Gln Gly Phe Arg Lys Asp Val Ile Arg Leu Val Gln Glu Leu Gln Val 
    50                  55                  60                  


Pro Leu Val Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Tyr Asn Trp 
65                  70                  75                  80  


Glu Asp Gly Val Gly Pro Val Ser Glu Arg Pro Lys Arg Leu Asp Leu 
                85                  90                  95      


Ala Trp Arg Thr Thr Glu Thr Asn Glu Ile Gly Thr Asn Glu Phe Val 
            100                 105                 110         


Asp Trp Ala Lys Lys Val Gly Ala Glu Val Asn Met Ala Val Asn Leu 
        115                 120                 125             


Gly Ser Arg Gly Val Asp Ala Ala Arg Asn Leu Val Glu Tyr Cys Asn 
    130                 135                 140                 


His Pro Ser Gly Ser Tyr Trp Ser Asp Leu Arg Ile Ser His Gly Tyr 
145                 150                 155                 160 


Lys Asp Pro His Asn Ile Lys Thr Trp Cys Leu Gly Asn Glu Met Asp 
                165                 170                 175     


Gly Pro Trp Gln Ile Gly Gln Lys Thr Ala Glu Glu Tyr Gly Arg Val 
            180                 185                 190         


Ala Ala Glu Ala Gly Lys Val Met Lys Leu Val Asp Pro Ser Ile Glu 
        195                 200                 205             


Leu Val Ala Cys Gly Ser Ser Asn Ser Lys Met Ala Thr Phe Ala Asp 
    210                 215                 220                 


Trp Glu Ala Thr Val Leu Asp His Thr Tyr Asp Tyr Val Asp Tyr Ile 
225                 230                 235                 240 


Ser Leu His Thr Tyr Tyr Gly Asn Arg Asp Asp Asp Leu Ala Asn Tyr 
                245                 250                 255     


Leu Ala Gln Ser Met Asp Met Asp Glu Phe Ile Arg Ser Val Ile Ala 
            260                 265                 270         


Ile Ala Asp Tyr Val Lys Ala Lys Lys Arg Ser Lys Lys Thr Ile His 
        275                 280                 285             


Leu Ser Phe Asp Glu Trp Asn Val Trp Phe His Ser Asn Glu Ala Asp 
    290                 295                 300                 


Arg Gln Ile Thr Pro Trp Ser Val Ala Pro Pro Leu Leu Glu Asp Ile 
305                 310                 315                 320 


Tyr Thr Phe Glu Asp Ala Leu Leu Val Gly Ser Met Leu Ile Thr Leu 
                325                 330                 335     


Leu Lys His Ala Asp Arg Val Lys Ile Ala Cys Leu Ala Gln Leu Val 
            340                 345                 350         


Asn Val Ile Ala Pro Ile Met Thr Glu Lys Gly Gly Pro Ala Trp Lys 
        355                 360                 365             


Gln Thr Ile Phe Tyr Pro Tyr Met His Ala Ser Val Tyr Gly Arg Gly 
    370                 375                 380                 


Val Ala Leu Gln Ala Gln Ile Ser Ser Pro Lys Tyr Asp Ser Lys Asp 
385                 390                 395                 400 


Phe Thr Asp Val Pro Tyr Leu Asp Ala Ala Val Val His Leu Glu Glu 
                405                 410                 415     


Ala Glu Glu Val Thr Ile Phe Ala Val Asn Lys His Gln Thr Glu Ser 
            420                 425                 430         


Leu Asn Leu Gln Cys Asp Met Arg Ser Phe Glu Gly Tyr His Val Leu 
        435                 440                 445             


Glu His Ile Val Leu Glu His Glu Asn Met Lys Ala Thr Asn Gln Gly 
    450                 455                 460                 


Arg Glu Gln Val Thr Pro His His Asn Gly Asp Ser Ala Ile Asp Gln 
465                 470                 475                 480 


Gly Arg Leu Thr Ala Asn Leu Ala Lys Leu Ser Trp Asn Val Ile Arg 
                485                 490                 495     


Leu Gly Lys Lys 
            500 


<210> 467
<211> 1455
<212> DNA
<213> Thermotoga maritima MSB8

<400> 467
atgtcctaca ggatagtggt ggatccaaaa gaagttgtca agccgattag cagacacatc     60

tacggtcatt tcacggaaca tctgggaagg tgtatctacg gcggaattta tgaagaaggt    120

tctccgctct ccgatgaaag gggtttcaga aaggacgttc tggaggctgt aaagaggata    180

aaagttccga acttgagatg gcccggtgga aacttcgtgt cgaactacca ctgggaagac    240

ggaataggtc ccaaagatca gaggcctgtt aggttcgatc tcgcctggca acaggaagag    300

acgaatagat ttggaacgga cgaattcatt gagtactgtc gtgagatagg agcagaacct    360

tacatcagta taaacatggg aactggaaca ctcgacgaag ctctccactg gcttgaatac    420

tgcaatggaa agggtaatac ctactacgct caactcagaa gaaagtacgg tcatccagaa    480

ccttacaacg taaagttctg gggaataggc aacgagatgt acggggaatg gcaggtaggc    540

cacatgacgg cggacgaata cgcaagagcc gccaaagaat acacgaaatg gatgaaggtt    600

ttcgacccta caattaaagc gatcgccgtg ggctgtgacg accccatatg gaatctcagg    660

gttcttcaag aagcaggtga tgtgattgac ttcatatcct accatttcta cacagggtcc    720

gacgattact acgaaacggt ctctacggtt taccttctca aagaaagact catcggagtg    780

aaaaagctca ttgatatggt ggatactgct agaaagagag gtgtcaaaat cgcccttgat    840

gaatggaacg tatggtacag agtgtccgat aacaagctcg aagaacctta cgatctcaaa    900

gatggtatct ttgcatgtgg agtgcttgta cttcttcaaa agatgagcga catagtccca    960

cttgccaatc tcgcacagct tgtaaacgcc cttggagcta tacacaccga gaaagacggt   1020

ctcattctca cacccgttta caaggctttt gaactcatcg tgaatcattc cggagaaaag   1080

cttgtcaaga cccatgttga atcggagact tacaacatag aaggagtcat gttcatcaac   1140

aaaatgcctt tctctgtcga gaacgcaccg ttccttgatg ccgccgcttc catctcagaa   1200

gatggcaaga aacttttcat cgctgttgta aactacagga aagaagacgc tttgaaggtt   1260

ccaatcagag tggaaggtct gggacagaaa aaagccaccg tttatacact cacaggtccg   1320

gacgtgaacg cgagaaacac catggaaaat ccgaacgtcg ttgatattac ctccgaaacc   1380

atcaccgttg acaccgaatt tgaacacacg tttaaaccat tctcttgcag tgtgattgag   1440

gtagaattgg agtaa                                                    1455

<210> 468
<211> 484
<212> PRT
<213> Thermotoga maritima MSB8

<220> 
<221> DOMAIN
<222> (280)...(475)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (360)...(363)
<223> N-glycosylation site. Prosite id = PS00001

<400> 468
Met Ser Tyr Arg Ile Val Val Asp Pro Lys Glu Val Val Lys Pro Ile 
1               5                   10                  15      


Ser Arg His Ile Tyr Gly His Phe Thr Glu His Leu Gly Arg Cys Ile 
            20                  25                  30          


Tyr Gly Gly Ile Tyr Glu Glu Gly Ser Pro Leu Ser Asp Glu Arg Gly 
        35                  40                  45              


Phe Arg Lys Asp Val Leu Glu Ala Val Lys Arg Ile Lys Val Pro Asn 
    50                  55                  60                  


Leu Arg Trp Pro Gly Gly Asn Phe Val Ser Asn Tyr His Trp Glu Asp 
65                  70                  75                  80  


Gly Ile Gly Pro Lys Asp Gln Arg Pro Val Arg Phe Asp Leu Ala Trp 
                85                  90                  95      


Gln Gln Glu Glu Thr Asn Arg Phe Gly Thr Asp Glu Phe Ile Glu Tyr 
            100                 105                 110         


Cys Arg Glu Ile Gly Ala Glu Pro Tyr Ile Ser Ile Asn Met Gly Thr 
        115                 120                 125             


Gly Thr Leu Asp Glu Ala Leu His Trp Leu Glu Tyr Cys Asn Gly Lys 
    130                 135                 140                 


Gly Asn Thr Tyr Tyr Ala Gln Leu Arg Arg Lys Tyr Gly His Pro Glu 
145                 150                 155                 160 


Pro Tyr Asn Val Lys Phe Trp Gly Ile Gly Asn Glu Met Tyr Gly Glu 
                165                 170                 175     


Trp Gln Val Gly His Met Thr Ala Asp Glu Tyr Ala Arg Ala Ala Lys 
            180                 185                 190         


Glu Tyr Thr Lys Trp Met Lys Val Phe Asp Pro Thr Ile Lys Ala Ile 
        195                 200                 205             


Ala Val Gly Cys Asp Asp Pro Ile Trp Asn Leu Arg Val Leu Gln Glu 
    210                 215                 220                 


Ala Gly Asp Val Ile Asp Phe Ile Ser Tyr His Phe Tyr Thr Gly Ser 
225                 230                 235                 240 


Asp Asp Tyr Tyr Glu Thr Val Ser Thr Val Tyr Leu Leu Lys Glu Arg 
                245                 250                 255     


Leu Ile Gly Val Lys Lys Leu Ile Asp Met Val Asp Thr Ala Arg Lys 
            260                 265                 270         


Arg Gly Val Lys Ile Ala Leu Asp Glu Trp Asn Val Trp Tyr Arg Val 
        275                 280                 285             


Ser Asp Asn Lys Leu Glu Glu Pro Tyr Asp Leu Lys Asp Gly Ile Phe 
    290                 295                 300                 


Ala Cys Gly Val Leu Val Leu Leu Gln Lys Met Ser Asp Ile Val Pro 
305                 310                 315                 320 


Leu Ala Asn Leu Ala Gln Leu Val Asn Ala Leu Gly Ala Ile His Thr 
                325                 330                 335     


Glu Lys Asp Gly Leu Ile Leu Thr Pro Val Tyr Lys Ala Phe Glu Leu 
            340                 345                 350         


Ile Val Asn His Ser Gly Glu Lys Leu Val Lys Thr His Val Glu Ser 
        355                 360                 365             


Glu Thr Tyr Asn Ile Glu Gly Val Met Phe Ile Asn Lys Met Pro Phe 
    370                 375                 380                 


Ser Val Glu Asn Ala Pro Phe Leu Asp Ala Ala Ala Ser Ile Ser Glu 
385                 390                 395                 400 


Asp Gly Lys Lys Leu Phe Ile Ala Val Val Asn Tyr Arg Lys Glu Asp 
                405                 410                 415     


Ala Leu Lys Val Pro Ile Arg Val Glu Gly Leu Gly Gln Lys Lys Ala 
            420                 425                 430         


Thr Val Tyr Thr Leu Thr Gly Pro Asp Val Asn Ala Arg Asn Thr Met 
        435                 440                 445             


Glu Asn Pro Asn Val Val Asp Ile Thr Ser Glu Thr Ile Thr Val Asp 
    450                 455                 460                 


Thr Glu Phe Glu His Thr Phe Lys Pro Phe Ser Cys Ser Val Ile Glu 
465                 470                 475                 480 


Val Glu Leu Glu 
                

<210> 469
<211> 1509
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 469
atgcaagaag ctaaaatcac tctcgaccgc gacttcgtca tcggccctat cgacccgcgc     60

ctctacggct ccttcctcga gcatttgggc cgggccatct acaccgggat ctacgaaccg    120

ggccacccca ccgccgacga actcggcttt cgccgggacg tcctggaact cgtgcgtgag    180

ctcggtgtcc ccatcgttcg ttacccgggc gggaacttcg tctccggcta taattgggag    240

gacggtgtgg gacccaggtc cgaacgacca cggcgcctgg acctgagctg gcggacggtg    300

gaaacaaacg aggtgggcct caacgaattc gccgcctggg ccaagaaggc cggggccgag    360

gtcatgctgg cggtgaacct cggcacgcgg ggcttggacg ccgcccgcaa cctggtcgag    420

tactgcaacc atcccggcgg gagctactgg agcgatctcc gccgggccca cggcgtggcc    480

gaaccgcacc gcatcaaggt ctggtgcctg gggaacgaga tggacgggcc ctggcagatc    540

gggcacaaaa cggcggagga atacggccgc ctggcctgcg agacggccaa ggtcatgcgg    600

tgggtggacc cctccatcga gctcgtggtc tgcgggagct cgggctggca gatgcccacc    660

ttcccctctt gggagatcac ggtcctggaa cacacctacg aacacgtgga ctacctttcg    720

cttcatactt acttcgggaa ccgcgacggc gatctggcca acttcctcgc ccaatcggtg    780

gggatggacc actacatccg cacggccatc gctgcttgtg actacgtcca ggcgaagaaa    840

agagggaaaa agcggattaa catctccttt gacgagtgga acgtctggta ccactccaac    900

gaggcggacc ggaagatcga accctggagt atcgccccgc cgcttttaga ggatatttat    960

aaccttgccg acgcccttgt ggtgggctgt atgttgatca ccctcctcaa acacgcggac   1020

cggataaaga tcgcctgcct ggcccagctc gtcaacgtca tcgcgccgat catgactatg   1080

aaggggggac cggcctggcg gcagaccatc ttctacccct tcctccacgc ctcccggtac   1140

ggtcacggcg tcgccttgca aacccaagtt cgggcgcctc tctacgacac caaggacttc   1200

gaagccgtgc cgctcctgga ggcggtggcc acgatggacg aggaagacgg ggaactcgcc   1260

atctttgccg tcaaccgttc ccaagaggaa gcgctggcgc tggaggtgga gatgcggggg   1320

atgagagcgg agtatctccc tcttgaacac ctcgtcctga ccgacgaaga cccccaggcg   1380

gcaaacacgg cggccgaacc ggaccggatt atgccacgcc gtcaggacgg cgaccgggtg   1440

gaagacggcc gcctcaagac cgtcctcccc aagctttcgt ggcataccat ccgcttgaaa   1500

aaagcatga                                                           1509

<210> 470
<211> 502
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (291)...(493)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (431)...(434)
<223> N-glycosylation site. Prosite id = PS00001

<400> 470
Met Gln Glu Ala Lys Ile Thr Leu Asp Arg Asp Phe Val Ile Gly Pro 
1               5                   10                  15      


Ile Asp Pro Arg Leu Tyr Gly Ser Phe Leu Glu His Leu Gly Arg Ala 
            20                  25                  30          


Ile Tyr Thr Gly Ile Tyr Glu Pro Gly His Pro Thr Ala Asp Glu Leu 
        35                  40                  45              


Gly Phe Arg Arg Asp Val Leu Glu Leu Val Arg Glu Leu Gly Val Pro 
    50                  55                  60                  


Ile Val Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Tyr Asn Trp Glu 
65                  70                  75                  80  


Asp Gly Val Gly Pro Arg Ser Glu Arg Pro Arg Arg Leu Asp Leu Ser 
                85                  90                  95      


Trp Arg Thr Val Glu Thr Asn Glu Val Gly Leu Asn Glu Phe Ala Ala 
            100                 105                 110         


Trp Ala Lys Lys Ala Gly Ala Glu Val Met Leu Ala Val Asn Leu Gly 
        115                 120                 125             


Thr Arg Gly Leu Asp Ala Ala Arg Asn Leu Val Glu Tyr Cys Asn His 
    130                 135                 140                 


Pro Gly Gly Ser Tyr Trp Ser Asp Leu Arg Arg Ala His Gly Val Ala 
145                 150                 155                 160 


Glu Pro His Arg Ile Lys Val Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Pro Trp Gln Ile Gly His Lys Thr Ala Glu Glu Tyr Gly Arg Leu Ala 
            180                 185                 190         


Cys Glu Thr Ala Lys Val Met Arg Trp Val Asp Pro Ser Ile Glu Leu 
        195                 200                 205             


Val Val Cys Gly Ser Ser Gly Trp Gln Met Pro Thr Phe Pro Ser Trp 
    210                 215                 220                 


Glu Ile Thr Val Leu Glu His Thr Tyr Glu His Val Asp Tyr Leu Ser 
225                 230                 235                 240 


Leu His Thr Tyr Phe Gly Asn Arg Asp Gly Asp Leu Ala Asn Phe Leu 
                245                 250                 255     


Ala Gln Ser Val Gly Met Asp His Tyr Ile Arg Thr Ala Ile Ala Ala 
            260                 265                 270         


Cys Asp Tyr Val Gln Ala Lys Lys Arg Gly Lys Lys Arg Ile Asn Ile 
        275                 280                 285             


Ser Phe Asp Glu Trp Asn Val Trp Tyr His Ser Asn Glu Ala Asp Arg 
    290                 295                 300                 


Lys Ile Glu Pro Trp Ser Ile Ala Pro Pro Leu Leu Glu Asp Ile Tyr 
305                 310                 315                 320 


Asn Leu Ala Asp Ala Leu Val Val Gly Cys Met Leu Ile Thr Leu Leu 
                325                 330                 335     


Lys His Ala Asp Arg Ile Lys Ile Ala Cys Leu Ala Gln Leu Val Asn 
            340                 345                 350         


Val Ile Ala Pro Ile Met Thr Met Lys Gly Gly Pro Ala Trp Arg Gln 
        355                 360                 365             


Thr Ile Phe Tyr Pro Phe Leu His Ala Ser Arg Tyr Gly His Gly Val 
    370                 375                 380                 


Ala Leu Gln Thr Gln Val Arg Ala Pro Leu Tyr Asp Thr Lys Asp Phe 
385                 390                 395                 400 


Glu Ala Val Pro Leu Leu Glu Ala Val Ala Thr Met Asp Glu Glu Asp 
                405                 410                 415     


Gly Glu Leu Ala Ile Phe Ala Val Asn Arg Ser Gln Glu Glu Ala Leu 
            420                 425                 430         


Ala Leu Glu Val Glu Met Arg Gly Met Arg Ala Glu Tyr Leu Pro Leu 
        435                 440                 445             


Glu His Leu Val Leu Thr Asp Glu Asp Pro Gln Ala Ala Asn Thr Ala 
    450                 455                 460                 


Ala Glu Pro Asp Arg Ile Met Pro Arg Arg Gln Asp Gly Asp Arg Val 
465                 470                 475                 480 


Glu Asp Gly Arg Leu Lys Thr Val Leu Pro Lys Leu Ser Trp His Thr 
                485                 490                 495     


Ile Arg Leu Lys Lys Ala 
            500         

<210> 471
<211> 1509
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 471
atgaaacagg caagcatgat cctcgacaag gactatacca tcggtaagat cgatccccgt     60

atgtacggct cgttcatcga gcatctgggc cgcgcggtct acggcggcat ctacgagccc    120

ggccatccca ccgcggacga gaacggcttc cgtcgcgatg tgatggatat ggtcaggaat    180

ctgggcgtta ccatcgtgcg ctatcccggc ggcaacttcg tttccggctt caactgggag    240

gattccgtgg gtccccgcga gagccgtccg aagcgtctgg acctcgcctg gcagaccacc    300

gagaccaacg aagtaggcct gcatgaattt gtggagtggg cgcgcaagtc cggctccgaa    360

gtcatgtatg ccgtgaatct gggcacccgc ggccccgaag aggcccgcaa tgtggtggag    420

tacgccaacc acaagggcgg ctcctatctc tccgacctgc gcattaaaaa cggcatgaag    480

gaccccatgg gcatcaagct ctggtgcctc ggaaatgaga tggacggccc ctggcagatg    540

tgccacaaga ccgcttccga gtacggcaga accgcccacg aggccgcgaa gctcatgaag    600

tgggtcgatc cctccatcga gtgcgtggtc tgcggctctt ccggccacaa tatgcccacc    660

tacggcgact gggaatacga ggtgctttcc gagtgctacg acagcgtgga ttacgtttcc    720

ctccaccgct actacggaaa ccccaccaag gacactcccg gcttcctcgc ccgcaatatg    780

gacctggacg ccttcattaa ggaagtcgtg gccatctgcg atgcggtcaa gggcaagaag    840

cacggcaaaa agcagatcaa cctctccttc gacgagtgga acgtgtggta tcactcccac    900

gagcaggacc gggaaatcta caagcgtgac aagtggggaa gagcactgcc ccttctggag    960

gatgtgtaca acttcgagga cgccctgctg gcgggctcca tcctcatcac cttcctgcgc   1020

aacgccgacc gcgtcaaggt cgcctgcctc gcgcagctgg tcaacgtcat cgcccccatc   1080

atgacccgca acggcggcgg agtgtgggcg cagaccatct actggccctt cctgcacgcc   1140

tccaaatacg gccgcggaac ggcgctgaga gcactcgtaa gcagtccctc ctacgactgc   1200

aaggactatg aaaacgtgcc ctacgtggac gccaccgcca ctatggatga tgaaggcaac   1260

gtgaccatct tcgccgtgaa tcgctccatg gaggatgact tcgagctaac cgccgacctg   1320

cgctccttcg gctctctcaa ggcgggcgaa catatcctgc tccatcacga cgatgtgaac   1380

gcggtcaaca ccgaactcga tcctctgaat gtctccccga aacagggcga aaaagccaaa   1440

atcgacggcg gcagcatgag cgttaagctc cccgccctca gctggaacgt gatccgcctc   1500

accaaataa                                                           1509

<210> 472
<211> 502
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (291)...(494)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (426)...(429)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (433)...(436)
<223> N-glycosylation site. Prosite id = PS00001

<400> 472
Met Lys Gln Ala Ser Met Ile Leu Asp Lys Asp Tyr Thr Ile Gly Lys 
1               5                   10                  15      


Ile Asp Pro Arg Met Tyr Gly Ser Phe Ile Glu His Leu Gly Arg Ala 
            20                  25                  30          


Val Tyr Gly Gly Ile Tyr Glu Pro Gly His Pro Thr Ala Asp Glu Asn 
        35                  40                  45              


Gly Phe Arg Arg Asp Val Met Asp Met Val Arg Asn Leu Gly Val Thr 
    50                  55                  60                  


Ile Val Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Phe Asn Trp Glu 
65                  70                  75                  80  


Asp Ser Val Gly Pro Arg Glu Ser Arg Pro Lys Arg Leu Asp Leu Ala 
                85                  90                  95      


Trp Gln Thr Thr Glu Thr Asn Glu Val Gly Leu His Glu Phe Val Glu 
            100                 105                 110         


Trp Ala Arg Lys Ser Gly Ser Glu Val Met Tyr Ala Val Asn Leu Gly 
        115                 120                 125             


Thr Arg Gly Pro Glu Glu Ala Arg Asn Val Val Glu Tyr Ala Asn His 
    130                 135                 140                 


Lys Gly Gly Ser Tyr Leu Ser Asp Leu Arg Ile Lys Asn Gly Met Lys 
145                 150                 155                 160 


Asp Pro Met Gly Ile Lys Leu Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Pro Trp Gln Met Cys His Lys Thr Ala Ser Glu Tyr Gly Arg Thr Ala 
            180                 185                 190         


His Glu Ala Ala Lys Leu Met Lys Trp Val Asp Pro Ser Ile Glu Cys 
        195                 200                 205             


Val Val Cys Gly Ser Ser Gly His Asn Met Pro Thr Tyr Gly Asp Trp 
    210                 215                 220                 


Glu Tyr Glu Val Leu Ser Glu Cys Tyr Asp Ser Val Asp Tyr Val Ser 
225                 230                 235                 240 


Leu His Arg Tyr Tyr Gly Asn Pro Thr Lys Asp Thr Pro Gly Phe Leu 
                245                 250                 255     


Ala Arg Asn Met Asp Leu Asp Ala Phe Ile Lys Glu Val Val Ala Ile 
            260                 265                 270         


Cys Asp Ala Val Lys Gly Lys Lys His Gly Lys Lys Gln Ile Asn Leu 
        275                 280                 285             


Ser Phe Asp Glu Trp Asn Val Trp Tyr His Ser His Glu Gln Asp Arg 
    290                 295                 300                 


Glu Ile Tyr Lys Arg Asp Lys Trp Gly Arg Ala Leu Pro Leu Leu Glu 
305                 310                 315                 320 


Asp Val Tyr Asn Phe Glu Asp Ala Leu Leu Ala Gly Ser Ile Leu Ile 
                325                 330                 335     


Thr Phe Leu Arg Asn Ala Asp Arg Val Lys Val Ala Cys Leu Ala Gln 
            340                 345                 350         


Leu Val Asn Val Ile Ala Pro Ile Met Thr Arg Asn Gly Gly Gly Val 
        355                 360                 365             


Trp Ala Gln Thr Ile Tyr Trp Pro Phe Leu His Ala Ser Lys Tyr Gly 
    370                 375                 380                 


Arg Gly Thr Ala Leu Arg Ala Leu Val Ser Ser Pro Ser Tyr Asp Cys 
385                 390                 395                 400 


Lys Asp Tyr Glu Asn Val Pro Tyr Val Asp Ala Thr Ala Thr Met Asp 
                405                 410                 415     


Asp Glu Gly Asn Val Thr Ile Phe Ala Val Asn Arg Ser Met Glu Asp 
            420                 425                 430         


Asp Phe Glu Leu Thr Ala Asp Leu Arg Ser Phe Gly Ser Leu Lys Ala 
        435                 440                 445             


Gly Glu His Ile Leu Leu His His Asp Asp Val Asn Ala Val Asn Thr 
    450                 455                 460                 


Glu Leu Asp Pro Leu Asn Val Ser Pro Lys Gln Gly Glu Lys Ala Lys 
465                 470                 475                 480 


Ile Asp Gly Gly Ser Met Ser Val Lys Leu Pro Ala Leu Ser Trp Asn 
                485                 490                 495     


Val Ile Arg Leu Thr Lys 
            500         

<210> 473
<211> 1509
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 473
atgaaaaagg cgtcaatcat cattgacagg gattatataa cgggaaaaat cgacagacga     60

atctacggat cgtttatcga gcatcttggc cgtgcggtct acggaggaat ctatgaaccc    120

gggcacccac tggcggacga gctgggattt cgcagagacg tgatggaaag cgtgaaaaag    180

ctgggggttc cgattgtacg ctatccggga ggaaatttcg tttccggttt ccaatgggaa    240

gacagcgtgg gaccccgtgc atcccgtccg aaacgattgg atctggcatg gtttactacg    300

gagaccaatg aagtcggtct gcatgaattt gcaggatggg cggaaaaagc cggggcagaa    360

atgatgtatg ccgtcaacct cggaacccgc ggcccggaag aggcgcggga cgtcgtggaa    420

tatgcaaacc atacttccgg gagcctcttt tcggatatga gaattgccaa cggaaggaaa    480

gatccgttta atatcaagct gtggtgtctt ggaaacgaaa tggacggggc atggcagatg    540

ggacagaaga ctgcccggga atacggcaga acggccaacg aggctgctaa gatgatgaag    600

tgggttgacc cgaatattga ggtcgttgcc tgcggctcat ccagctccga atcaccgact    660

ttcggctcct gggagctgga aatgctggac gagtgctatg aaaacgtgga ttatgtctca    720

cttcaccgct actatggcaa cccaacagag gatacgccgg gattccttgc gcgcacgatg    780

gatatggatg attttatccg gagcgttgtt tccatgtgcg atgccgttaa ggcaaaaaaa    840

cgcagcaaac gcacgctgaa tctttctttc gatgagtgga acgtatggta tcattctgct    900

gagcaggata aggaaatctg gaaacgggat aagtggaacc gggcgcttcc tttattggaa    960

gatgtttata actttgaaga cgctcttctg gtcgggtcca tgctcattac cctcctccgc   1020

aacgcagaca gggtgaaggt tgcatgtctt gcgcagctcg tgaatgttat tgcaccgatc   1080

atgacccgta atggcggcgg ctgctgggcg cagacaatct actatccgtt catgcatgct   1140

tcccatttcg gtcagggaac ggcgctgaaa acattggtca atacgccgct gtatgattgc   1200

aaagactatg agggcgtgcc gctgatcgat gccgtggcga cagtagacga tgagggagac   1260

gtgacgttgt tctgcgtcaa ccgtgatatg actgaagatt ttgcactgga tattgacctt   1320

cgctcttttg gcaggctcac gattaaagag catattctgc tccatcatga tgatgtgaaa   1380

gctgtcaaca cagaggacaa tccgatgaat gtcgttccat gcgtcggtcc aggaggaaca   1440

atcgacggag gaaaggcaac tgtgagaatt ccggcgctga gctggaatgt gatccgattt   1500

gcaatataa                                                           1509

<210> 474
<211> 502
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (291)...(494)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (145)...(148)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<400> 474
Met Lys Lys Ala Ser Ile Ile Ile Asp Arg Asp Tyr Ile Thr Gly Lys 
1               5                   10                  15      


Ile Asp Arg Arg Ile Tyr Gly Ser Phe Ile Glu His Leu Gly Arg Ala 
            20                  25                  30          


Val Tyr Gly Gly Ile Tyr Glu Pro Gly His Pro Leu Ala Asp Glu Leu 
        35                  40                  45              


Gly Phe Arg Arg Asp Val Met Glu Ser Val Lys Lys Leu Gly Val Pro 
    50                  55                  60                  


Ile Val Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Phe Gln Trp Glu 
65                  70                  75                  80  


Asp Ser Val Gly Pro Arg Ala Ser Arg Pro Lys Arg Leu Asp Leu Ala 
                85                  90                  95      


Trp Phe Thr Thr Glu Thr Asn Glu Val Gly Leu His Glu Phe Ala Gly 
            100                 105                 110         


Trp Ala Glu Lys Ala Gly Ala Glu Met Met Tyr Ala Val Asn Leu Gly 
        115                 120                 125             


Thr Arg Gly Pro Glu Glu Ala Arg Asp Val Val Glu Tyr Ala Asn His 
    130                 135                 140                 


Thr Ser Gly Ser Leu Phe Ser Asp Met Arg Ile Ala Asn Gly Arg Lys 
145                 150                 155                 160 


Asp Pro Phe Asn Ile Lys Leu Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Ala Trp Gln Met Gly Gln Lys Thr Ala Arg Glu Tyr Gly Arg Thr Ala 
            180                 185                 190         


Asn Glu Ala Ala Lys Met Met Lys Trp Val Asp Pro Asn Ile Glu Val 
        195                 200                 205             


Val Ala Cys Gly Ser Ser Ser Ser Glu Ser Pro Thr Phe Gly Ser Trp 
    210                 215                 220                 


Glu Leu Glu Met Leu Asp Glu Cys Tyr Glu Asn Val Asp Tyr Val Ser 
225                 230                 235                 240 


Leu His Arg Tyr Tyr Gly Asn Pro Thr Glu Asp Thr Pro Gly Phe Leu 
                245                 250                 255     


Ala Arg Thr Met Asp Met Asp Asp Phe Ile Arg Ser Val Val Ser Met 
            260                 265                 270         


Cys Asp Ala Val Lys Ala Lys Lys Arg Ser Lys Arg Thr Leu Asn Leu 
        275                 280                 285             


Ser Phe Asp Glu Trp Asn Val Trp Tyr His Ser Ala Glu Gln Asp Lys 
    290                 295                 300                 


Glu Ile Trp Lys Arg Asp Lys Trp Asn Arg Ala Leu Pro Leu Leu Glu 
305                 310                 315                 320 


Asp Val Tyr Asn Phe Glu Asp Ala Leu Leu Val Gly Ser Met Leu Ile 
                325                 330                 335     


Thr Leu Leu Arg Asn Ala Asp Arg Val Lys Val Ala Cys Leu Ala Gln 
            340                 345                 350         


Leu Val Asn Val Ile Ala Pro Ile Met Thr Arg Asn Gly Gly Gly Cys 
        355                 360                 365             


Trp Ala Gln Thr Ile Tyr Tyr Pro Phe Met His Ala Ser His Phe Gly 
    370                 375                 380                 


Gln Gly Thr Ala Leu Lys Thr Leu Val Asn Thr Pro Leu Tyr Asp Cys 
385                 390                 395                 400 


Lys Asp Tyr Glu Gly Val Pro Leu Ile Asp Ala Val Ala Thr Val Asp 
                405                 410                 415     


Asp Glu Gly Asp Val Thr Leu Phe Cys Val Asn Arg Asp Met Thr Glu 
            420                 425                 430         


Asp Phe Ala Leu Asp Ile Asp Leu Arg Ser Phe Gly Arg Leu Thr Ile 
        435                 440                 445             


Lys Glu His Ile Leu Leu His His Asp Asp Val Lys Ala Val Asn Thr 
    450                 455                 460                 


Glu Asp Asn Pro Met Asn Val Val Pro Cys Val Gly Pro Gly Gly Thr 
465                 470                 475                 480 


Ile Asp Gly Gly Lys Ala Thr Val Arg Ile Pro Ala Leu Ser Trp Asn 
                485                 490                 495     


Val Ile Arg Phe Ala Ile 
            500         

<210> 475
<211> 1512
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 475
atgcagagcg gcaagctgtc gatcgacccc gccttcgtcg tagcgcccgt caaccggcgg     60

gtcttcgggt cgttcgtcga gcacatgggt cggtgcgtgt acggcgggct gtacgagccc    120

gggcacccga ccgcggacga ggacggcctg cgcggcgacg tgctcgacct cgtgcgggag    180

atgggcgtca cggccgtgcg ctacccgggc ggcaacttcg tctcgggata ccgctgggag    240

gacggcgtcg gcccggtcga ggaccgcccg acccgcctgg acccggcgtg gaagacggtc    300

gagaccaacg cgttcgggct caacgagttc atgcgctggg cgcgcaaggc cgagatcgag    360

ccggtcatgg ccgtgaacct cggcacgcgc ggcatcgccg aggcgatcga gctgctcgag    420

tacgccaacc acccccaggg caccgcgctg tccgacctgc gcgtcgccca cggcgcgccg    480

gagccgcacg cgatccgcac gtggtgcctg ggcaacgaga tggacgggcc gtggcagctc    540

ggccacaaga cggccgagga gtacgggcgc ctggccgccg agacggcgcg cgccatgcgc    600

cagctcgagc cggacctcga gctcgtggcc tgcgggtcgt cgggccgggc gatctcgacc    660

ttcggcgcgt gggaggacac ggtcctcgag cacacgtacg acctggtcga ccacatctcg    720

gcgcacgcgt actacgagct cgacggcgac gaccaggcca gcttcctggc gtcgtcggtc    780

gacatggaca agttcatccg tgaggtcgtg gcgacggccg acgcggtggg tgcgcgcctg    840

aagtcgtcga agaagatcat gatctcgttc gacgagtgga acgtctggta caacaaggcg    900

ctcaccgagt cgggcctgcc gacggactgg acgcaggcgc cgcggctcag cgaggacgag    960

tacacgctgc tcgacgccgt cgtcgtcggg tcgctgctca tcacgctgct gcggcacagc   1020

gaccgcgtcg cgatcgcgtg ccaggcgcag ctcgtcaaca cgatcgcacc gatccgctcc   1080

gagcccggcg ggcccgcgtg gcggcagtcg atcttccacc ccttcgcgct caccgcgcag   1140

cacgcgcagg gccaggtgct ggacctgcgg gtcgacgcgc cgaccctgca gaccgccaag   1200

cacggcgagg tgtcggtgct cgactccgtc gccacctacg acgcgcagac gggccggctg   1260

gccgtgttcg tggtcaaccg tgacccctcg caggccgtcg cgttcgccac ggacctgcgg   1320

gcgttcggca ccgcgaccct caccgaggcg acggtcctgg ccggcgacga cgtgctcgcc   1380

gtcaacacgc aggccgatcc ggagcgcgtg acgccgcagc cgcacacgtc ggccacggtc   1440

gacggaacga ccctgcgcgc cgagctcccg gccgcgtcgt ggagcatgtt cctgctggac   1500

acgaccccct ga                                                       1512

<210> 476
<211> 503
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (291)...(493)
<223> Alpha-L-arabinofuranosidase C-terminus

<400> 476
Met Gln Ser Gly Lys Leu Ser Ile Asp Pro Ala Phe Val Val Ala Pro 
1               5                   10                  15      


Val Asn Arg Arg Val Phe Gly Ser Phe Val Glu His Met Gly Arg Cys 
            20                  25                  30          


Val Tyr Gly Gly Leu Tyr Glu Pro Gly His Pro Thr Ala Asp Glu Asp 
        35                  40                  45              


Gly Leu Arg Gly Asp Val Leu Asp Leu Val Arg Glu Met Gly Val Thr 
    50                  55                  60                  


Ala Val Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Tyr Arg Trp Glu 
65                  70                  75                  80  


Asp Gly Val Gly Pro Val Glu Asp Arg Pro Thr Arg Leu Asp Pro Ala 
                85                  90                  95      


Trp Lys Thr Val Glu Thr Asn Ala Phe Gly Leu Asn Glu Phe Met Arg 
            100                 105                 110         


Trp Ala Arg Lys Ala Glu Ile Glu Pro Val Met Ala Val Asn Leu Gly 
        115                 120                 125             


Thr Arg Gly Ile Ala Glu Ala Ile Glu Leu Leu Glu Tyr Ala Asn His 
    130                 135                 140                 


Pro Gln Gly Thr Ala Leu Ser Asp Leu Arg Val Ala His Gly Ala Pro 
145                 150                 155                 160 


Glu Pro His Ala Ile Arg Thr Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Pro Trp Gln Leu Gly His Lys Thr Ala Glu Glu Tyr Gly Arg Leu Ala 
            180                 185                 190         


Ala Glu Thr Ala Arg Ala Met Arg Gln Leu Glu Pro Asp Leu Glu Leu 
        195                 200                 205             


Val Ala Cys Gly Ser Ser Gly Arg Ala Ile Ser Thr Phe Gly Ala Trp 
    210                 215                 220                 


Glu Asp Thr Val Leu Glu His Thr Tyr Asp Leu Val Asp His Ile Ser 
225                 230                 235                 240 


Ala His Ala Tyr Tyr Glu Leu Asp Gly Asp Asp Gln Ala Ser Phe Leu 
                245                 250                 255     


Ala Ser Ser Val Asp Met Asp Lys Phe Ile Arg Glu Val Val Ala Thr 
            260                 265                 270         


Ala Asp Ala Val Gly Ala Arg Leu Lys Ser Ser Lys Lys Ile Met Ile 
        275                 280                 285             


Ser Phe Asp Glu Trp Asn Val Trp Tyr Asn Lys Ala Leu Thr Glu Ser 
    290                 295                 300                 


Gly Leu Pro Thr Asp Trp Thr Gln Ala Pro Arg Leu Ser Glu Asp Glu 
305                 310                 315                 320 


Tyr Thr Leu Leu Asp Ala Val Val Val Gly Ser Leu Leu Ile Thr Leu 
                325                 330                 335     


Leu Arg His Ser Asp Arg Val Ala Ile Ala Cys Gln Ala Gln Leu Val 
            340                 345                 350         


Asn Thr Ile Ala Pro Ile Arg Ser Glu Pro Gly Gly Pro Ala Trp Arg 
        355                 360                 365             


Gln Ser Ile Phe His Pro Phe Ala Leu Thr Ala Gln His Ala Gln Gly 
    370                 375                 380                 


Gln Val Leu Asp Leu Arg Val Asp Ala Pro Thr Leu Gln Thr Ala Lys 
385                 390                 395                 400 


His Gly Glu Val Ser Val Leu Asp Ser Val Ala Thr Tyr Asp Ala Gln 
                405                 410                 415     


Thr Gly Arg Leu Ala Val Phe Val Val Asn Arg Asp Pro Ser Gln Ala 
            420                 425                 430         


Val Ala Phe Ala Thr Asp Leu Arg Ala Phe Gly Thr Ala Thr Leu Thr 
        435                 440                 445             


Glu Ala Thr Val Leu Ala Gly Asp Asp Val Leu Ala Val Asn Thr Gln 
    450                 455                 460                 


Ala Asp Pro Glu Arg Val Thr Pro Gln Pro His Thr Ser Ala Thr Val 
465                 470                 475                 480 


Asp Gly Thr Thr Leu Arg Ala Glu Leu Pro Ala Ala Ser Trp Ser Met 
                485                 490                 495     


Phe Leu Leu Asp Thr Thr Pro 
            500             

<210> 477
<211> 1518
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 477
atgaaaaatg cttcaattgt tcttgataag gatttcgttg tcggagagat agacaaaagg     60

atctatggct cgttcatcga gcatctcggc cgagcggtct acggcggaat atatgaaccc    120

ggtcatcctc ttgcggacaa gcacggcttt cgcaaagatg ttctcgagag tgtgaagaag    180

ctcggtgttc ccatcgtgcg ttatcctggc ggcaatttcg tctccgggtt taattgggaa    240

gacagcgtgg gcccggtcag tcagcgcccg aatcgccttg acctcgcctg gttcacgacg    300

gagacgaacg aggtcggtct gcatgagttc gcagactggg cgaaggaagc cgggagcgac    360

cttatgtatg ccgtcaacct cggtacgcgt ggtccggaga gtgcgagaga cattgttgaa    420

tacgccaatc acccatccgg cagtctctat tcagacatgc gcatttccaa cggccgcaaa    480

gaccccttca atatcaaact ctggtgtctc ggcaatgaga tggacggccc ctggcagatg    540

ggccataaaa cggcttatga atacggccgg acggccaatg aagccgccaa gatgatgaag    600

tgggtcgacc cgtcaatcga atgcgttgcc tgcggctcat ctcacagtga gatgccgact    660

ttcggcgaat gggagtatac gatgctcggt gaatgctatg agaacgtgga ctacgtctcc    720

cttcaccggt actacggtaa tcctaccggt gacactccgg gcttcctcgc ccgagcgatg    780

gatatggacg acttcatcaa gagcgtgatt tccatctgcg atgccgtcaa gggcagaaag    840

cacagcaaga agcagatcaa cctgagtttt gacgagtgga acgtatggta tcactccagc    900

gagcaggata aagaaatctg gaagatggat aagtggaacc gtgctcttcc acttctggag    960

gacgtctaca actttgaaga cgcgctcctc gtcggctcca tgctcatcac tctcttacgt   1020

aatgctgacc gcgtgaaggt tgcatgtctg gcgcagcttg tcaacgtcat cgcgccgata   1080

atgacccgta acggcggagg ctgctgggca cagacaatct attatccctt catgcatgcc   1140

tcgaagtacg gccgtggaac ggcactcagg acgctcatct cctcccccgt ttatgactgc   1200

attgactatg aagccgtgcc atatatcgac tcagtcgcca cgatggacga cttcggcaat   1260

gtcactctct tctgcgtcaa ccgcgacctg gcagaggact tcagtctcag ccttgacctg   1320

cgctcattcg ggaagatgga acttgcagaa cacattctgc ttcaccacga cgacgtaaag   1380

gcggtgaata ccgaaacgaa tcccgaaaac gtcattccga cggccgggcc gggcggaaag   1440

gccgagagcg gaaggttcga gctccgcatc ccggctctca gttggaacgt catccgcttt   1500

acgccgagca agaagtaa                                                 1518

<210> 478
<211> 505
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (291)...(494)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (3)...(6)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (426)...(429)
<223> N-glycosylation site. Prosite id = PS00001

<400> 478
Met Lys Asn Ala Ser Ile Val Leu Asp Lys Asp Phe Val Val Gly Glu 
1               5                   10                  15      


Ile Asp Lys Arg Ile Tyr Gly Ser Phe Ile Glu His Leu Gly Arg Ala 
            20                  25                  30          


Val Tyr Gly Gly Ile Tyr Glu Pro Gly His Pro Leu Ala Asp Lys His 
        35                  40                  45              


Gly Phe Arg Lys Asp Val Leu Glu Ser Val Lys Lys Leu Gly Val Pro 
    50                  55                  60                  


Ile Val Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Phe Asn Trp Glu 
65                  70                  75                  80  


Asp Ser Val Gly Pro Val Ser Gln Arg Pro Asn Arg Leu Asp Leu Ala 
                85                  90                  95      


Trp Phe Thr Thr Glu Thr Asn Glu Val Gly Leu His Glu Phe Ala Asp 
            100                 105                 110         


Trp Ala Lys Glu Ala Gly Ser Asp Leu Met Tyr Ala Val Asn Leu Gly 
        115                 120                 125             


Thr Arg Gly Pro Glu Ser Ala Arg Asp Ile Val Glu Tyr Ala Asn His 
    130                 135                 140                 


Pro Ser Gly Ser Leu Tyr Ser Asp Met Arg Ile Ser Asn Gly Arg Lys 
145                 150                 155                 160 


Asp Pro Phe Asn Ile Lys Leu Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Pro Trp Gln Met Gly His Lys Thr Ala Tyr Glu Tyr Gly Arg Thr Ala 
            180                 185                 190         


Asn Glu Ala Ala Lys Met Met Lys Trp Val Asp Pro Ser Ile Glu Cys 
        195                 200                 205             


Val Ala Cys Gly Ser Ser His Ser Glu Met Pro Thr Phe Gly Glu Trp 
    210                 215                 220                 


Glu Tyr Thr Met Leu Gly Glu Cys Tyr Glu Asn Val Asp Tyr Val Ser 
225                 230                 235                 240 


Leu His Arg Tyr Tyr Gly Asn Pro Thr Gly Asp Thr Pro Gly Phe Leu 
                245                 250                 255     


Ala Arg Ala Met Asp Met Asp Asp Phe Ile Lys Ser Val Ile Ser Ile 
            260                 265                 270         


Cys Asp Ala Val Lys Gly Arg Lys His Ser Lys Lys Gln Ile Asn Leu 
        275                 280                 285             


Ser Phe Asp Glu Trp Asn Val Trp Tyr His Ser Ser Glu Gln Asp Lys 
    290                 295                 300                 


Glu Ile Trp Lys Met Asp Lys Trp Asn Arg Ala Leu Pro Leu Leu Glu 
305                 310                 315                 320 


Asp Val Tyr Asn Phe Glu Asp Ala Leu Leu Val Gly Ser Met Leu Ile 
                325                 330                 335     


Thr Leu Leu Arg Asn Ala Asp Arg Val Lys Val Ala Cys Leu Ala Gln 
            340                 345                 350         


Leu Val Asn Val Ile Ala Pro Ile Met Thr Arg Asn Gly Gly Gly Cys 
        355                 360                 365             


Trp Ala Gln Thr Ile Tyr Tyr Pro Phe Met His Ala Ser Lys Tyr Gly 
    370                 375                 380                 


Arg Gly Thr Ala Leu Arg Thr Leu Ile Ser Ser Pro Val Tyr Asp Cys 
385                 390                 395                 400 


Ile Asp Tyr Glu Ala Val Pro Tyr Ile Asp Ser Val Ala Thr Met Asp 
                405                 410                 415     


Asp Phe Gly Asn Val Thr Leu Phe Cys Val Asn Arg Asp Leu Ala Glu 
            420                 425                 430         


Asp Phe Ser Leu Ser Leu Asp Leu Arg Ser Phe Gly Lys Met Glu Leu 
        435                 440                 445             


Ala Glu His Ile Leu Leu His His Asp Asp Val Lys Ala Val Asn Thr 
    450                 455                 460                 


Glu Thr Asn Pro Glu Asn Val Ile Pro Thr Ala Gly Pro Gly Gly Lys 
465                 470                 475                 480 


Ala Glu Ser Gly Arg Phe Glu Leu Arg Ile Pro Ala Leu Ser Trp Asn 
                485                 490                 495     


Val Ile Arg Phe Thr Pro Ser Lys Lys 
            500                 505 

<210> 479
<211> 1017
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 479
atgaagacgt tcatccttgc cgctgccgcg ttgggcgtcg ccatgccagg cgtcgcccaa     60

ccgcccatgc agagtgcgat gaaccgcagg atcgcaggcg atatcgcccc ggtccatgac    120

ccggtcatcg ctcgcgaagg agacacttat tacgtcttct ccaccggtgg atcgaaagaa    180

agcggcgggt tcatccccat tcgcacgtcc aaggatctta tccactggac tgcgcagagc    240

gcggcccttt cggccttgcc agactgggcg accaaggccg tcccgggctc gcgcgacctt    300

tgggcccctg acatatcctt cgcgaacggt cgttggcgcc tatattattc ggtgtcgacc    360

ttcggctcca atcactccgc catcggtctg gccacgagtc cgaccctcga ccccaaggct    420

cctggctacg gctggcgcga tgagggggtc gtggtacgct cgacgcggga cagcgatttc    480

aacgcgatcg atcccaattt cgtgatcgac cgcgaaggga gacattggct ttcgctgggc    540

agtttctgga gcgggctgaa gcttcttgca ctagacaagg agggcaaggt gcgaccagac    600

accgcaccgg tttcgattgc ccagcggcct gcgccggccg gcgccccggc cccggtcgag    660

gcgcccttca ttatcgaccg cggcggctat tattggctga tcgcgtccta cgattattgc    720

tgcaagggcg tgaacagcac ttactacacc gtgatcgggc ggtcgaagga cattaccggc    780

ccctatctcg gcaaggacgg aagttcgatg atgaaggggg gagggacaat ccttcttcga    840

gccgacctgc ccgaacagca gcaattccgc ggtccgggcc atgccggcgt gctgcgtgac    900

ggagagagag attacctcgt ctatcacgcc tatgatcgcg agaaaaaggg cgtgcctacc    960

ctacggatcg ctccgctgaa atggggcgcc gatggctggc ctgttgccga atattga      1017

<210> 480
<211> 338
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(19)

<220> 
<221> DOMAIN
<222> (31)...(334)
<223> Glycosyl hydrolases family 43

<220> 
<221> SITE
<222> (125)...(128)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (248)...(251)
<223> N-glycosylation site. Prosite id = PS00001

<400> 480
Met Lys Thr Phe Ile Leu Ala Ala Ala Ala Leu Gly Val Ala Met Pro 
1               5                   10                  15      


Gly Val Ala Gln Pro Pro Met Gln Ser Ala Met Asn Arg Arg Ile Ala 
            20                  25                  30          


Gly Asp Ile Ala Pro Val His Asp Pro Val Ile Ala Arg Glu Gly Asp 
        35                  40                  45              


Thr Tyr Tyr Val Phe Ser Thr Gly Gly Ser Lys Glu Ser Gly Gly Phe 
    50                  55                  60                  


Ile Pro Ile Arg Thr Ser Lys Asp Leu Ile His Trp Thr Ala Gln Ser 
65                  70                  75                  80  


Ala Ala Leu Ser Ala Leu Pro Asp Trp Ala Thr Lys Ala Val Pro Gly 
                85                  90                  95      


Ser Arg Asp Leu Trp Ala Pro Asp Ile Ser Phe Ala Asn Gly Arg Trp 
            100                 105                 110         


Arg Leu Tyr Tyr Ser Val Ser Thr Phe Gly Ser Asn His Ser Ala Ile 
        115                 120                 125             


Gly Leu Ala Thr Ser Pro Thr Leu Asp Pro Lys Ala Pro Gly Tyr Gly 
    130                 135                 140                 


Trp Arg Asp Glu Gly Val Val Val Arg Ser Thr Arg Asp Ser Asp Phe 
145                 150                 155                 160 


Asn Ala Ile Asp Pro Asn Phe Val Ile Asp Arg Glu Gly Arg His Trp 
                165                 170                 175     


Leu Ser Leu Gly Ser Phe Trp Ser Gly Leu Lys Leu Leu Ala Leu Asp 
            180                 185                 190         


Lys Glu Gly Lys Val Arg Pro Asp Thr Ala Pro Val Ser Ile Ala Gln 
        195                 200                 205             


Arg Pro Ala Pro Ala Gly Ala Pro Ala Pro Val Glu Ala Pro Phe Ile 
    210                 215                 220                 


Ile Asp Arg Gly Gly Tyr Tyr Trp Leu Ile Ala Ser Tyr Asp Tyr Cys 
225                 230                 235                 240 


Cys Lys Gly Val Asn Ser Thr Tyr Tyr Thr Val Ile Gly Arg Ser Lys 
                245                 250                 255     


Asp Ile Thr Gly Pro Tyr Leu Gly Lys Asp Gly Ser Ser Met Met Lys 
            260                 265                 270         


Gly Gly Gly Thr Ile Leu Leu Arg Ala Asp Leu Pro Glu Gln Gln Gln 
        275                 280                 285             


Phe Arg Gly Pro Gly His Ala Gly Val Leu Arg Asp Gly Glu Arg Asp 
    290                 295                 300                 


Tyr Leu Val Tyr His Ala Tyr Asp Arg Glu Lys Lys Gly Val Pro Thr 
305                 310                 315                 320 


Leu Arg Ile Ala Pro Leu Lys Trp Gly Ala Asp Gly Trp Pro Val Ala 
                325                 330                 335     


Glu Tyr 
        

<210> 481
<211> 1509
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 481
atgaaaaaag caaccatgat tatggacaag gattttgcca tcggcaagat cgacccccgc     60

atctatggct cctttatcga gcatctgggc cgtgccgtgt acggcggcat ctatgagccc    120

acccacccca ctgctgacga gaacggcttc cgtcaggacg tgatcgacat ggtgcgcaag    180

ttgaacgtgc cggtcacccg ctatcccggc ggcaacttcg tttccggctt caactgggaa    240

gacagcgttg gcccccggga tcagcggcct catcggctgg atctggcctg gttcaccacc    300

gagaccaatg aggtgggtct ccacgaattc gtggattggg ccaagaaggc caacaccgaa    360

gtgatgtacg ccgtcaatct gggcacccgt ggccccgatg cagcccgcaa tgtggtggaa    420

tacgccaacc acaagggcgg tagctactgg agcgacctgc gcatcaagaa cggtgccaag    480

gatcccttcg gcatcaagct gtggtgcctg ggcaacgaaa tggacggccc ctggcagatg    540

ggtcagaaga ccgcctacga atacggccgt gtggccagcg aagccggtaa gatgatgaag    600

tgggtggatc cctccatcga gctggtggcc tgcggttcct cctccttcca gatgccaacc    660

ttcggcacct gggagtacga gatgctcacc cagtgctacg atcagatcga ctatgtgagt    720

ctgcaccgct actacggcaa ccaaaccaac aacaccccag atttcctggc ccgcaacatg    780

gacctggacg gtttcatcaa aactgttgtg gccatctgcg atgccgtggg cggcgccaag    840

cactccaaga agaagatcaa cctgtccttt gacgagtgga acgtgtggta ccactccaac    900

gagcaggaca aggaagtgtg gaagcaggac aagtggaacc gtgccctgcc ccttctggag    960

gacatctaca acttcgagga tgcgctgctg gtgggcgcca tgctgatcac cttcctgaag   1020

aatgccgacc gtgtgaaggt ggcttgcctt gcccagctgg tgaacgtgat cgcgcccatc   1080

atgacccgca acggcggcgg cgtgtgggcc cagaccattt tctggcccct gatgcacgcc   1140

tccaagtacg gccgcggcac cgccctgcgc cccgtcattg acagccccac ctacgactgc   1200

tccgactatg agcaggtgcc gctggtggat ggcaccgcca ccctggggga cgacggctcc   1260

gtgaccatct tcgccgtgaa ccgtgacatg aacgaggata tcgtgctgaa tgccgacctg   1320

cgcggcttcg gcgatttgaa gatcgccgag catatcgtgc tgcaccacga cgacgtgaaa   1380

gccatcaaca ccgaagccaa ccccgacaac gtggctcccg ctgcgggcaa cggcggcatc   1440

atcggcggcg gacagctgga agtgaagctg cccagcctga gctggaatgt gatccgtttg   1500

gtgaagtaa                                                           1509

<210> 482
<211> 502
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (291)...(494)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (250)...(253)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<400> 482
Met Lys Lys Ala Thr Met Ile Met Asp Lys Asp Phe Ala Ile Gly Lys 
1               5                   10                  15      


Ile Asp Pro Arg Ile Tyr Gly Ser Phe Ile Glu His Leu Gly Arg Ala 
            20                  25                  30          


Val Tyr Gly Gly Ile Tyr Glu Pro Thr His Pro Thr Ala Asp Glu Asn 
        35                  40                  45              


Gly Phe Arg Gln Asp Val Ile Asp Met Val Arg Lys Leu Asn Val Pro 
    50                  55                  60                  


Val Thr Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Phe Asn Trp Glu 
65                  70                  75                  80  


Asp Ser Val Gly Pro Arg Asp Gln Arg Pro His Arg Leu Asp Leu Ala 
                85                  90                  95      


Trp Phe Thr Thr Glu Thr Asn Glu Val Gly Leu His Glu Phe Val Asp 
            100                 105                 110         


Trp Ala Lys Lys Ala Asn Thr Glu Val Met Tyr Ala Val Asn Leu Gly 
        115                 120                 125             


Thr Arg Gly Pro Asp Ala Ala Arg Asn Val Val Glu Tyr Ala Asn His 
    130                 135                 140                 


Lys Gly Gly Ser Tyr Trp Ser Asp Leu Arg Ile Lys Asn Gly Ala Lys 
145                 150                 155                 160 


Asp Pro Phe Gly Ile Lys Leu Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Pro Trp Gln Met Gly Gln Lys Thr Ala Tyr Glu Tyr Gly Arg Val Ala 
            180                 185                 190         


Ser Glu Ala Gly Lys Met Met Lys Trp Val Asp Pro Ser Ile Glu Leu 
        195                 200                 205             


Val Ala Cys Gly Ser Ser Ser Phe Gln Met Pro Thr Phe Gly Thr Trp 
    210                 215                 220                 


Glu Tyr Glu Met Leu Thr Gln Cys Tyr Asp Gln Ile Asp Tyr Val Ser 
225                 230                 235                 240 


Leu His Arg Tyr Tyr Gly Asn Gln Thr Asn Asn Thr Pro Asp Phe Leu 
                245                 250                 255     


Ala Arg Asn Met Asp Leu Asp Gly Phe Ile Lys Thr Val Val Ala Ile 
            260                 265                 270         


Cys Asp Ala Val Gly Gly Ala Lys His Ser Lys Lys Lys Ile Asn Leu 
        275                 280                 285             


Ser Phe Asp Glu Trp Asn Val Trp Tyr His Ser Asn Glu Gln Asp Lys 
    290                 295                 300                 


Glu Val Trp Lys Gln Asp Lys Trp Asn Arg Ala Leu Pro Leu Leu Glu 
305                 310                 315                 320 


Asp Ile Tyr Asn Phe Glu Asp Ala Leu Leu Val Gly Ala Met Leu Ile 
                325                 330                 335     


Thr Phe Leu Lys Asn Ala Asp Arg Val Lys Val Ala Cys Leu Ala Gln 
            340                 345                 350         


Leu Val Asn Val Ile Ala Pro Ile Met Thr Arg Asn Gly Gly Gly Val 
        355                 360                 365             


Trp Ala Gln Thr Ile Phe Trp Pro Leu Met His Ala Ser Lys Tyr Gly 
    370                 375                 380                 


Arg Gly Thr Ala Leu Arg Pro Val Ile Asp Ser Pro Thr Tyr Asp Cys 
385                 390                 395                 400 


Ser Asp Tyr Glu Gln Val Pro Leu Val Asp Gly Thr Ala Thr Leu Gly 
                405                 410                 415     


Asp Asp Gly Ser Val Thr Ile Phe Ala Val Asn Arg Asp Met Asn Glu 
            420                 425                 430         


Asp Ile Val Leu Asn Ala Asp Leu Arg Gly Phe Gly Asp Leu Lys Ile 
        435                 440                 445             


Ala Glu His Ile Val Leu His His Asp Asp Val Lys Ala Ile Asn Thr 
    450                 455                 460                 


Glu Ala Asn Pro Asp Asn Val Ala Pro Ala Ala Gly Asn Gly Gly Ile 
465                 470                 475                 480 


Ile Gly Gly Gly Gln Leu Glu Val Lys Leu Pro Ser Leu Ser Trp Asn 
                485                 490                 495     


Val Ile Arg Leu Val Lys 
            500         

<210> 483
<211> 1527
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 483
gtgagccagc cagatcccgc cgcagcaaag atcaccgtcg atccctcttt catcgtgggg     60

ccggtccgcc gtcgtacttt cggtgctttc gttgaacacc tcggccggtg cgtgtacacc    120

ggcatcttcg aacccggcca ccctgacgcg gacggggacg gcttccgcaa ggacgtcctg    180

gaactgaccc gcgaactggg cgtatcaacg gtgcggtacc cgggcggcaa ctttgtctcc    240

ggctaccgct gggaagacgg cgtggggccc gtggaccagc gccccacccg gctggacctg    300

gcctggcact caaccgaccc caatacggtg ggcgtggacg agttcgccaa gtggtgcgcc    360

aaggcagggg tggaacccat gatggccgtc aacctgggca cccgcggtgt ccaggaggcg    420

ctggacctcc tggagtactg caacatcgac ggcggtaccg ccctttccga ccagcgccgc    480

gcgaacggag ccgcaaatgg ttacggcatc aggatgtggt gcctgggcaa cgagatggac    540

ggcccctggc agatcggcca caagaacgcc ctcgagtacg ggcggctggc ggcggacacg    600

gcccgcggca tgcggatgat cgatccggac ctggagctgg tggcctgcgg cagctccggc    660

cccaccatgc ccaccttcgg tgagtgggag cgcgtggtcc tgacggagac ctacgacctg    720

gtggacctcg tctccgcgca ccagtacttc gaggacttcg gcgacctgca ggaacacctc    780

gcagccgcac acaaaatgga cgccttcatc ggtgacatcg tgagccacat cgaccacgtg    840

aagtcggtga agaagtccac caggcaggtg aacatctctt tcgatgagtg gaacgtgtgg    900

cacatgagcc gtgacgaatc caaagtgccc acgggcacgg attggcctgt ggctcccgta    960

ctgctggagg acacctacac ggtggcggac gccgtcgtag taggggacct cctcatcacg   1020

ctgctcagga acactgaccg ggtgcattcg gcaagcctgg cgcagctggt gaacgtgatc   1080

gcgcccatca tgaccgagcc cggcggccgg tcatggaagc agaccacctt ccaccccttc   1140

gccctgacct cccggcacgc gtcgggaaca gtgctgcagc tcgccgtcga atccccgctg   1200

gtcagcggcg gcaagacctc cggcttcgcg gccctctctg ccgtcgcaac gtatgacgcg   1260

gacaagggcg agaccgtggt gttcgcggtc aaccgctccg ccggccaggc gctcatcctg   1320

gatgctgcgg tagccgccct cggcgatgtc cgcgtggtgg aggcggtgac ctacgccaac   1380

aaggacccct actggcaggc cagcgcggac gattccacct cagtgctgcc ctcggacaac   1440

ggcaccgtgc aggtggacgg cggccggctc accgcagagc ttccggccgt gtcctggtcc   1500

atgatccggc tggcagtggg caaataa                                       1527

<210> 484
<211> 508
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (295)...(498)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (295)...(298)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (437)...(440)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (487)...(490)
<223> N-glycosylation site. Prosite id = PS00001

<400> 484
Met Ser Gln Pro Asp Pro Ala Ala Ala Lys Ile Thr Val Asp Pro Ser 
1               5                   10                  15      


Phe Ile Val Gly Pro Val Arg Arg Arg Thr Phe Gly Ala Phe Val Glu 
            20                  25                  30          


His Leu Gly Arg Cys Val Tyr Thr Gly Ile Phe Glu Pro Gly His Pro 
        35                  40                  45              


Asp Ala Asp Gly Asp Gly Phe Arg Lys Asp Val Leu Glu Leu Thr Arg 
    50                  55                  60                  


Glu Leu Gly Val Ser Thr Val Arg Tyr Pro Gly Gly Asn Phe Val Ser 
65                  70                  75                  80  


Gly Tyr Arg Trp Glu Asp Gly Val Gly Pro Val Asp Gln Arg Pro Thr 
                85                  90                  95      


Arg Leu Asp Leu Ala Trp His Ser Thr Asp Pro Asn Thr Val Gly Val 
            100                 105                 110         


Asp Glu Phe Ala Lys Trp Cys Ala Lys Ala Gly Val Glu Pro Met Met 
        115                 120                 125             


Ala Val Asn Leu Gly Thr Arg Gly Val Gln Glu Ala Leu Asp Leu Leu 
    130                 135                 140                 


Glu Tyr Cys Asn Ile Asp Gly Gly Thr Ala Leu Ser Asp Gln Arg Arg 
145                 150                 155                 160 


Ala Asn Gly Ala Ala Asn Gly Tyr Gly Ile Arg Met Trp Cys Leu Gly 
                165                 170                 175     


Asn Glu Met Asp Gly Pro Trp Gln Ile Gly His Lys Asn Ala Leu Glu 
            180                 185                 190         


Tyr Gly Arg Leu Ala Ala Asp Thr Ala Arg Gly Met Arg Met Ile Asp 
        195                 200                 205             


Pro Asp Leu Glu Leu Val Ala Cys Gly Ser Ser Gly Pro Thr Met Pro 
    210                 215                 220                 


Thr Phe Gly Glu Trp Glu Arg Val Val Leu Thr Glu Thr Tyr Asp Leu 
225                 230                 235                 240 


Val Asp Leu Val Ser Ala His Gln Tyr Phe Glu Asp Phe Gly Asp Leu 
                245                 250                 255     


Gln Glu His Leu Ala Ala Ala His Lys Met Asp Ala Phe Ile Gly Asp 
            260                 265                 270         


Ile Val Ser His Ile Asp His Val Lys Ser Val Lys Lys Ser Thr Arg 
        275                 280                 285             


Gln Val Asn Ile Ser Phe Asp Glu Trp Asn Val Trp His Met Ser Arg 
    290                 295                 300                 


Asp Glu Ser Lys Val Pro Thr Gly Thr Asp Trp Pro Val Ala Pro Val 
305                 310                 315                 320 


Leu Leu Glu Asp Thr Tyr Thr Val Ala Asp Ala Val Val Val Gly Asp 
                325                 330                 335     


Leu Leu Ile Thr Leu Leu Arg Asn Thr Asp Arg Val His Ser Ala Ser 
            340                 345                 350         


Leu Ala Gln Leu Val Asn Val Ile Ala Pro Ile Met Thr Glu Pro Gly 
        355                 360                 365             


Gly Arg Ser Trp Lys Gln Thr Thr Phe His Pro Phe Ala Leu Thr Ser 
    370                 375                 380                 


Arg His Ala Ser Gly Thr Val Leu Gln Leu Ala Val Glu Ser Pro Leu 
385                 390                 395                 400 


Val Ser Gly Gly Lys Thr Ser Gly Phe Ala Ala Leu Ser Ala Val Ala 
                405                 410                 415     


Thr Tyr Asp Ala Asp Lys Gly Glu Thr Val Val Phe Ala Val Asn Arg 
            420                 425                 430         


Ser Ala Gly Gln Ala Leu Ile Leu Asp Ala Ala Val Ala Ala Leu Gly 
        435                 440                 445             


Asp Val Arg Val Val Glu Ala Val Thr Tyr Ala Asn Lys Asp Pro Tyr 
    450                 455                 460                 


Trp Gln Ala Ser Ala Asp Asp Ser Thr Ser Val Leu Pro Ser Asp Asn 
465                 470                 475                 480 


Gly Thr Val Gln Val Asp Gly Gly Arg Leu Thr Ala Glu Leu Pro Ala 
                485                 490                 495     


Val Ser Trp Ser Met Ile Arg Leu Ala Val Gly Lys 
            500                 505             

<210> 485
<211> 1515
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 485
atgaagaaag ctaagatgat cctggacaag gattttgttg ttggtaaaat agacaaaagg     60

atctacggct cctttataga gcatctggga cgcgccgttt acggaggcat atacgagccc    120

ggacatccaa tggccgatga gctggggttt cgaaaggatg tcatggagtt cgtaaagaag    180

ctcaatgtgc cgatagtacg ctacccggga ggaaattttg tttcgggctt tcactgggag    240

gacagcgtgg ggcccaggga taaaaggccg aaacgcctgg atctggcctg gtttactacc    300

gagacaaatg aggtgggcct gcacgaattt gccgactggg cgaagaacgc gggaagcgat    360

ataatgtacg ccgtaaacct cggcagcagg ggaccggagc aggccaggga tatcgtggaa    420

tacgcaaacc acccctcggg cagtaagttc tctgatatgc gtatcgcaaa cggcagaaag    480

gatcccttta atatcaagct ctggtgtctg ggaaatgaaa tggacgggcc ctggcagatg    540

gggcagaaga ccgcgacgga atacggacgc attgccaatg aggcggccaa gatgatgaaa    600

tgggttgatc cttccatcga gcttgtggcc tgcggttcct cctccaccga aatgcccacc    660

ttcggaacct gggagcttac gatgcttgat gagtgctatg aaaacgtgga ctatgtgtca    720

ctccaccgct actatggcaa tcctaccgct gatactccgg gcttccttgc aagaaccatg    780

gatatggatg atttcataaa gagcgtggca tcgatctgcg atgccgtcaa gggcaaaaag    840

cacagcaagc atgtggtgaa cctgtccttt gatgagtgga acgtatggta tcattccaac    900

gagcaggata aggagatatg gaagcaggat aagtggaacc gtgcccttcc gctcctggag    960

gatgtatata actttgagga tgcccttctg gtagggtcga tgctgataac tctgcttaag   1020

aacgcagacc gtgtaaaggt tgcctgtctt gcacagcttg ttaatgtcat agcgcccatc   1080

atgacaagga acgggggagg cgcctgggca cagaccatct tctatccctt ctgccatgct   1140

tcaacctatg gccgcggtac atctcttaag gcccttgtgg agagccctgt atactcctgc   1200

aaggactatg atgatgttcc ttatatcgat gcgaccgcta caatggacga cgaaggcggc   1260

gtgaccgtct ttgccgttaa ccgtgatatg gaagaggatt atgagctgga ggcagacctg   1320

cgttccttcg gggagcttgc gatttcggag catatcgttc ttcatcacga cgatgtgaag   1380

gctgtcaaca cggaggatgc tcccgagaac gttgttcccg gaaagggaga gggcggcacc   1440

gtttctgacg gtaaggcatc cgtaaagctc aggcccctca gctggaatgt tatccggttt   1500

gcaaagagga agtaa                                                    1515

<210> 486
<211> 504
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (291)...(494)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<400> 486
Met Lys Lys Ala Lys Met Ile Leu Asp Lys Asp Phe Val Val Gly Lys 
1               5                   10                  15      


Ile Asp Lys Arg Ile Tyr Gly Ser Phe Ile Glu His Leu Gly Arg Ala 
            20                  25                  30          


Val Tyr Gly Gly Ile Tyr Glu Pro Gly His Pro Met Ala Asp Glu Leu 
        35                  40                  45              


Gly Phe Arg Lys Asp Val Met Glu Phe Val Lys Lys Leu Asn Val Pro 
    50                  55                  60                  


Ile Val Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Phe His Trp Glu 
65                  70                  75                  80  


Asp Ser Val Gly Pro Arg Asp Lys Arg Pro Lys Arg Leu Asp Leu Ala 
                85                  90                  95      


Trp Phe Thr Thr Glu Thr Asn Glu Val Gly Leu His Glu Phe Ala Asp 
            100                 105                 110         


Trp Ala Lys Asn Ala Gly Ser Asp Ile Met Tyr Ala Val Asn Leu Gly 
        115                 120                 125             


Ser Arg Gly Pro Glu Gln Ala Arg Asp Ile Val Glu Tyr Ala Asn His 
    130                 135                 140                 


Pro Ser Gly Ser Lys Phe Ser Asp Met Arg Ile Ala Asn Gly Arg Lys 
145                 150                 155                 160 


Asp Pro Phe Asn Ile Lys Leu Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Pro Trp Gln Met Gly Gln Lys Thr Ala Thr Glu Tyr Gly Arg Ile Ala 
            180                 185                 190         


Asn Glu Ala Ala Lys Met Met Lys Trp Val Asp Pro Ser Ile Glu Leu 
        195                 200                 205             


Val Ala Cys Gly Ser Ser Ser Thr Glu Met Pro Thr Phe Gly Thr Trp 
    210                 215                 220                 


Glu Leu Thr Met Leu Asp Glu Cys Tyr Glu Asn Val Asp Tyr Val Ser 
225                 230                 235                 240 


Leu His Arg Tyr Tyr Gly Asn Pro Thr Ala Asp Thr Pro Gly Phe Leu 
                245                 250                 255     


Ala Arg Thr Met Asp Met Asp Asp Phe Ile Lys Ser Val Ala Ser Ile 
            260                 265                 270         


Cys Asp Ala Val Lys Gly Lys Lys His Ser Lys His Val Val Asn Leu 
        275                 280                 285             


Ser Phe Asp Glu Trp Asn Val Trp Tyr His Ser Asn Glu Gln Asp Lys 
    290                 295                 300                 


Glu Ile Trp Lys Gln Asp Lys Trp Asn Arg Ala Leu Pro Leu Leu Glu 
305                 310                 315                 320 


Asp Val Tyr Asn Phe Glu Asp Ala Leu Leu Val Gly Ser Met Leu Ile 
                325                 330                 335     


Thr Leu Leu Lys Asn Ala Asp Arg Val Lys Val Ala Cys Leu Ala Gln 
            340                 345                 350         


Leu Val Asn Val Ile Ala Pro Ile Met Thr Arg Asn Gly Gly Gly Ala 
        355                 360                 365             


Trp Ala Gln Thr Ile Phe Tyr Pro Phe Cys His Ala Ser Thr Tyr Gly 
    370                 375                 380                 


Arg Gly Thr Ser Leu Lys Ala Leu Val Glu Ser Pro Val Tyr Ser Cys 
385                 390                 395                 400 


Lys Asp Tyr Asp Asp Val Pro Tyr Ile Asp Ala Thr Ala Thr Met Asp 
                405                 410                 415     


Asp Glu Gly Gly Val Thr Val Phe Ala Val Asn Arg Asp Met Glu Glu 
            420                 425                 430         


Asp Tyr Glu Leu Glu Ala Asp Leu Arg Ser Phe Gly Glu Leu Ala Ile 
        435                 440                 445             


Ser Glu His Ile Val Leu His His Asp Asp Val Lys Ala Val Asn Thr 
    450                 455                 460                 


Glu Asp Ala Pro Glu Asn Val Val Pro Gly Lys Gly Glu Gly Gly Thr 
465                 470                 475                 480 


Val Ser Asp Gly Lys Ala Ser Val Lys Leu Arg Pro Leu Ser Trp Asn 
                485                 490                 495     


Val Ile Arg Phe Ala Lys Arg Lys 
            500                 


<210> 487
<211> 1515
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 487
atgaaaaaag ccaccatgat cctggacaaa gacttctcga tcggcaagat cgacccccgc     60

atgtacgggt ccttcatcga gcacctgggc cgcgcggtct acggcggcat ctatgagccc    120

acccatccca ccgcggataa gaacggcttc cgccgcgacg tcattgaaat ggtccggaag    180

ctcggcgtcc cggtcgtccg ctatcccggc ggcaacttcg tctccggctt caactgggag    240

gattccatcg gcccgcggga ccaacggccg aagcgcctgg acctggcctg gttcaccacc    300

gagaccaacg aggtcggcct gcatgagttc tgcgactggg cgaaggcggc ggacaccacc    360

gtcatgtacg cggtgaacct cggcacccgc ggcccggacg cggcccgcaa cgtcgtcgag    420

tacgccaacc acaagggcgg cagctactgg tccgacctgc ggatcaaaaa cggcgcgaag    480

aacccgctgg ggatcaagct ctggtgcctg ggcaacgaga tggacggtcc ctggcagatc    540

ggccacaaga ctgcctacga gtacggccgc gtagctaacg aggccgccaa ggtcatgaag    600

tgggtcgacc cctccatcga gctggttgcc tgcggcagcg ccgcccacga catgccgacc    660

tacggcgact gggaatacac catgctcaac gagtgctacg agaacgtgga ctacgtctcc    720

ctccaccgct actacggcaa ccccaccaac gacacccccg gcttcctggc ccgcagcatg    780

gacctggatg atttcatcag ggaagtcgtc gcgatctgtg atgccgtcgg cggccggaag    840

cattcaaaga agaagctgaa cctgtccttt gacgagtgga acgtctggta ccactccaac    900

cagcaggacc aggaggtctg gaaggcggac aagtggggcc gcgccctgcc gctgctcgag    960

gacgtctaca actttgagga cgcgctgctc gccggcgcga tcctgatcac cttcctgaag   1020

aacgccgacc gcgtgaaggt cgcctgcctc gcccagctgg tgaacgtcat cgccccgatc   1080

atgacccgca acggcggcgg cgtctgggcc cagacgatct tctggccgat gatgcacgcc   1140

tcgaagtacg gccggggcac cgccctgcgg cccgtcctct cctccccggt ctacgactgc   1200

agggacttcg agaaggtccc gttggtggac gccgcggcga ccctcgggga cgacggcagc   1260

gtcacgatct tcgcgatcaa ccgggacggg aaggaggaca tcgccctcga ttgcgacctc   1320

cgcgccttca gcggcctggt gcctgccgag cacatcgtcc tgcaccatga cgacgtgaag   1380

gccgtcaaca ccgagaagaa ccctgacgag gtggtcccga agaacggccg caaggcgaag   1440

ctcgacgttg gaaagatgac cgtcaaactc cccgccctct cctggaacgt catccgcctg   1500

accccggaaa aataa                                                    1515

<210> 488
<211> 504
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (291)...(494)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<400> 488
Met Lys Lys Ala Thr Met Ile Leu Asp Lys Asp Phe Ser Ile Gly Lys 
1               5                   10                  15      


Ile Asp Pro Arg Met Tyr Gly Ser Phe Ile Glu His Leu Gly Arg Ala 
            20                  25                  30          


Val Tyr Gly Gly Ile Tyr Glu Pro Thr His Pro Thr Ala Asp Lys Asn 
        35                  40                  45              


Gly Phe Arg Arg Asp Val Ile Glu Met Val Arg Lys Leu Gly Val Pro 
    50                  55                  60                  


Val Val Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Phe Asn Trp Glu 
65                  70                  75                  80  


Asp Ser Ile Gly Pro Arg Asp Gln Arg Pro Lys Arg Leu Asp Leu Ala 
                85                  90                  95      


Trp Phe Thr Thr Glu Thr Asn Glu Val Gly Leu His Glu Phe Cys Asp 
            100                 105                 110         


Trp Ala Lys Ala Ala Asp Thr Thr Val Met Tyr Ala Val Asn Leu Gly 
        115                 120                 125             


Thr Arg Gly Pro Asp Ala Ala Arg Asn Val Val Glu Tyr Ala Asn His 
    130                 135                 140                 


Lys Gly Gly Ser Tyr Trp Ser Asp Leu Arg Ile Lys Asn Gly Ala Lys 
145                 150                 155                 160 


Asn Pro Leu Gly Ile Lys Leu Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Pro Trp Gln Ile Gly His Lys Thr Ala Tyr Glu Tyr Gly Arg Val Ala 
            180                 185                 190         


Asn Glu Ala Ala Lys Val Met Lys Trp Val Asp Pro Ser Ile Glu Leu 
        195                 200                 205             


Val Ala Cys Gly Ser Ala Ala His Asp Met Pro Thr Tyr Gly Asp Trp 
    210                 215                 220                 


Glu Tyr Thr Met Leu Asn Glu Cys Tyr Glu Asn Val Asp Tyr Val Ser 
225                 230                 235                 240 


Leu His Arg Tyr Tyr Gly Asn Pro Thr Asn Asp Thr Pro Gly Phe Leu 
                245                 250                 255     


Ala Arg Ser Met Asp Leu Asp Asp Phe Ile Arg Glu Val Val Ala Ile 
            260                 265                 270         


Cys Asp Ala Val Gly Gly Arg Lys His Ser Lys Lys Lys Leu Asn Leu 
        275                 280                 285             


Ser Phe Asp Glu Trp Asn Val Trp Tyr His Ser Asn Gln Gln Asp Gln 
    290                 295                 300                 


Glu Val Trp Lys Ala Asp Lys Trp Gly Arg Ala Leu Pro Leu Leu Glu 
305                 310                 315                 320 


Asp Val Tyr Asn Phe Glu Asp Ala Leu Leu Ala Gly Ala Ile Leu Ile 
                325                 330                 335     


Thr Phe Leu Lys Asn Ala Asp Arg Val Lys Val Ala Cys Leu Ala Gln 
            340                 345                 350         


Leu Val Asn Val Ile Ala Pro Ile Met Thr Arg Asn Gly Gly Gly Val 
        355                 360                 365             


Trp Ala Gln Thr Ile Phe Trp Pro Met Met His Ala Ser Lys Tyr Gly 
    370                 375                 380                 


Arg Gly Thr Ala Leu Arg Pro Val Leu Ser Ser Pro Val Tyr Asp Cys 
385                 390                 395                 400 


Arg Asp Phe Glu Lys Val Pro Leu Val Asp Ala Ala Ala Thr Leu Gly 
                405                 410                 415     


Asp Asp Gly Ser Val Thr Ile Phe Ala Ile Asn Arg Asp Gly Lys Glu 
            420                 425                 430         


Asp Ile Ala Leu Asp Cys Asp Leu Arg Ala Phe Ser Gly Leu Val Pro 
        435                 440                 445             


Ala Glu His Ile Val Leu His His Asp Asp Val Lys Ala Val Asn Thr 
    450                 455                 460                 


Glu Lys Asn Pro Asp Glu Val Val Pro Lys Asn Gly Arg Lys Ala Lys 
465                 470                 475                 480 


Leu Asp Val Gly Lys Met Thr Val Lys Leu Pro Ala Leu Ser Trp Asn 
                485                 490                 495     


Val Ile Arg Leu Thr Pro Glu Lys 
            500                 

<210> 489
<211> 1506
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 489
atgaagaaag caaggataat aatagataag gattttacca taggcgagat cgacaagagg     60

atattcggct cctttataga gcatctgggc cgcgccgtgt acggaggcat atatgagccg    120

ggtcatcccg aggctgatga gctgggcttt cgtaaggatg ttttaggcct ggtaaaaaag    180

ctcaatatac cggtagtccg ctatcccggc ggaaatttcg tatccggctt taactgggag    240

gacagcgtag gacccaggga gaagcgcccg aagaggctgg atctggcctg gtttactacg    300

gagacgaacg aggtggggct tcatgaattc gcggactgga cgaagaaggc cggaagtgag    360

cttatgtatg cggtaaactt aggcagcagg ggccccgagc aggccaggga tattgtggaa    420

tatgccaatc atatttccgg cagcaaatat tccgatatgc gtattgccaa cggaagaaag    480

gagcccttcg ggataaagct ctggtgcctg ggcaatgaga tggacgggcc ctggcagatg    540

gggcagaaga gcgcaaggga atatggccgg gtggccaatg aagccgccaa gatgatgaaa    600

tgggtggatc cctccatcga ggtagtggcc tgcggctcct cctctacgga gatgcccacc    660

ttcggctcct gggagctgga aatgctggag gaatgctatg aaaacgtgga ttatgtatcc    720

ctccacaggt attacggcaa tcccaccggg gatacaccgg gcttccttgc caggaccatg    780

gatatggacg gctttataaa aagcgttgcc gcgatctgtg atgcggtcag gggaaagaag    840

cacagctccc atatcgtgaa tctttccttt gatgaatgga atgtctggta tcattcaaac    900

gagcaggata aggagatctg gaagcaggac aaatggaaca gggcccttcc gcttctggag    960

gatgtttata attttgagga tgcactcctg gtgggttcaa tgctcatcac ccttattaaa   1020

aacgctgaca gggtaaagat agcctgcctg gcccagctcg ttaatgtcat cgcacccatc   1080

atgaccagga acggcggggg agtatgggcc cagaccacct tttatccctt tatgcatgcc   1140

tctctttacg gccgcggtgt cgcgctcaat gcccttacag acagtcccgt ttattcctgt   1200

gaggactatg aaaatgtacc cttcatagac gctgcggcgg tgatggatgg cgatgagctt   1260

accgtttttg cggttaaccg ggatatggag gaggattacg agctggagct tgacttaagg   1320

tgctttaagg agctgtctgt aaaggagcac atcctgctgc atcatgatga cgtaaaggcg   1380

gtcaataccg aggaggcccc tgaaaccgtt gtccctgtgg caggaccggg cggaaggctc   1440

gagggaggaa aggcgacggt caggataccg gccttaagct ggaatgtgat acgttttacg   1500

gtataa                                                              1506

<210> 490
<211> 501
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (291)...(493)
<223> Alpha-L-arabinofuranosidase C-terminus

<220> 
<221> SITE
<222> (291)...(294)
<223> N-glycosylation site. Prosite id = PS00001

<400> 490
Met Lys Lys Ala Arg Ile Ile Ile Asp Lys Asp Phe Thr Ile Gly Glu 
1               5                   10                  15      


Ile Asp Lys Arg Ile Phe Gly Ser Phe Ile Glu His Leu Gly Arg Ala 
            20                  25                  30          


Val Tyr Gly Gly Ile Tyr Glu Pro Gly His Pro Glu Ala Asp Glu Leu 
        35                  40                  45              


Gly Phe Arg Lys Asp Val Leu Gly Leu Val Lys Lys Leu Asn Ile Pro 
    50                  55                  60                  


Val Val Arg Tyr Pro Gly Gly Asn Phe Val Ser Gly Phe Asn Trp Glu 
65                  70                  75                  80  


Asp Ser Val Gly Pro Arg Glu Lys Arg Pro Lys Arg Leu Asp Leu Ala 
                85                  90                  95      


Trp Phe Thr Thr Glu Thr Asn Glu Val Gly Leu His Glu Phe Ala Asp 
            100                 105                 110         


Trp Thr Lys Lys Ala Gly Ser Glu Leu Met Tyr Ala Val Asn Leu Gly 
        115                 120                 125             


Ser Arg Gly Pro Glu Gln Ala Arg Asp Ile Val Glu Tyr Ala Asn His 
    130                 135                 140                 


Ile Ser Gly Ser Lys Tyr Ser Asp Met Arg Ile Ala Asn Gly Arg Lys 
145                 150                 155                 160 


Glu Pro Phe Gly Ile Lys Leu Trp Cys Leu Gly Asn Glu Met Asp Gly 
                165                 170                 175     


Pro Trp Gln Met Gly Gln Lys Ser Ala Arg Glu Tyr Gly Arg Val Ala 
            180                 185                 190         


Asn Glu Ala Ala Lys Met Met Lys Trp Val Asp Pro Ser Ile Glu Val 
        195                 200                 205             


Val Ala Cys Gly Ser Ser Ser Thr Glu Met Pro Thr Phe Gly Ser Trp 
    210                 215                 220                 


Glu Leu Glu Met Leu Glu Glu Cys Tyr Glu Asn Val Asp Tyr Val Ser 
225                 230                 235                 240 


Leu His Arg Tyr Tyr Gly Asn Pro Thr Gly Asp Thr Pro Gly Phe Leu 
                245                 250                 255     


Ala Arg Thr Met Asp Met Asp Gly Phe Ile Lys Ser Val Ala Ala Ile 
            260                 265                 270         


Cys Asp Ala Val Arg Gly Lys Lys His Ser Ser His Ile Val Asn Leu 
        275                 280                 285             


Ser Phe Asp Glu Trp Asn Val Trp Tyr His Ser Asn Glu Gln Asp Lys 
    290                 295                 300                 


Glu Ile Trp Lys Gln Asp Lys Trp Asn Arg Ala Leu Pro Leu Leu Glu 
305                 310                 315                 320 


Asp Val Tyr Asn Phe Glu Asp Ala Leu Leu Val Gly Ser Met Leu Ile 
                325                 330                 335     


Thr Leu Ile Lys Asn Ala Asp Arg Val Lys Ile Ala Cys Leu Ala Gln 
            340                 345                 350         


Leu Val Asn Val Ile Ala Pro Ile Met Thr Arg Asn Gly Gly Gly Val 
        355                 360                 365             


Trp Ala Gln Thr Thr Phe Tyr Pro Phe Met His Ala Ser Leu Tyr Gly 
    370                 375                 380                 


Arg Gly Val Ala Leu Asn Ala Leu Thr Asp Ser Pro Val Tyr Ser Cys 
385                 390                 395                 400 


Glu Asp Tyr Glu Asn Val Pro Phe Ile Asp Ala Ala Ala Val Met Asp 
                405                 410                 415     


Gly Asp Glu Leu Thr Val Phe Ala Val Asn Arg Asp Met Glu Glu Asp 
            420                 425                 430         


Tyr Glu Leu Glu Leu Asp Leu Arg Cys Phe Lys Glu Leu Ser Val Lys 
        435                 440                 445             


Glu His Ile Leu Leu His His Asp Asp Val Lys Ala Val Asn Thr Glu 
    450                 455                 460                 


Glu Ala Pro Glu Thr Val Val Pro Val Ala Gly Pro Gly Gly Arg Leu 
465                 470                 475                 480 


Glu Gly Gly Lys Ala Thr Val Arg Ile Pro Ala Leu Ser Trp Asn Val 
                485                 490                 495     


Ile Arg Phe Thr Val 
            500     

<210> 491
<211> 1434
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 491
gtgagcaacc ctgccacccc gcccgcagtc ggcgtcctcg acgagcgtgc gccgctgacc     60

ttcccgccgg gcttcctctg gggagcggcc accgccgcgt accagatcga aggggcagcg    120

gccgagggtg ggcgcacccc gtcgatctgg gacaccttca gccacacgga gggcaagacg    180

gtctccgggc acaccggtga cgtcgcctgc gaccactacc accggctctc cgacgacgtg    240

cggctgatgg ccgagctggg gttgaagtcg taccgcttct ccgtctcctg gccgcgggtg    300

cagccgggcg ggtcgggacc ggtgaacgcc gaagggctgg acttctaccg gcggctggtc    360

gacgagttgc tgaccaacgg catcgagccc tggatcaccc tctaccactg ggacctgccc    420

caggagttgg aggacgccgg cggttggccg gcccgggaca ccgccgcccg gttcgccgac    480

tacgcccagc tgatggcgga cgcgctgggt gaccgggtga agtactggac caccctcaac    540

gagccctggt gctcggcctt cctcggctac ggctccggcg tacacgcgcc gggccgctcg    600

gacggcgccg ccgccgtcca ggccgggcac cacctgatgc tcggccacgg gctcgcggtg    660

caggcgctgc gcgcggctcg gccggaggcg cagctcggcg tgaccgtcaa cctgtacccg    720

gtcacgccgg ccagcgacac gcccggcgac gtggacgccg cccggcgcat cgacgggctg    780

gccaaccggt tcttcctcga cccgctgctg cgcggggagt accccgcgga cctggtcgcc    840

gacctggcca aggtgaccga cttcgggcac gtgcgggacg gggacctggc cgtgatcgcc    900

acgccgctgg acctggtcgg ggtgaactac tacagccggc acgtggtggc cgcgccggca    960

gccggcgagg agccggagaa gtactggcgg gcgccgtcct gctggccggg cagcgaggag   1020

gtccggttcg tcacccgggg cgtgccggtg accgacatgg gctgggagat cgacgcaccc   1080

ggcctggtgg agacgctgcg ccgggtccac gaggagtaca ccgacctgcc gctctacgtg   1140

accgagaacg ggtccgcctt cgtcgacgcg gtggtcgacg gccgggtgga cgacaccgac   1200

cggctggcgt acttcgacgc gcacctgcgg gcctcgcacg aagcgatcag cgccggagtg   1260

cccctgcagg ggtactttgc ctggtcgctg ttggataatt tcgaatgggc ctggggttac   1320

accaagcggt tcggcatggt ctacgtcgac tacgacagcc agaagcgcat tcccaagtcc   1380

agtgccaggt ggtacgcgga ggtgattcga cgcaacggtc tggccgcaca ataa         1434

<210> 492
<211> 477
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (17)...(474)
<223> Glycosyl hydrolase family 1

<220> 
<221> SITE
<222> (25)...(39)
<223> Glycosyl hydrolases family 1 N-terminal signature. Prosite id = PS00653

<220> 
<221> SITE
<222> (388)...(391)
<223> N-glycosylation site. Prosite id = PS00001

<400> 492
Met Ser Asn Pro Ala Thr Pro Pro Ala Val Gly Val Leu Asp Glu Arg 
1               5                   10                  15      


Ala Pro Leu Thr Phe Pro Pro Gly Phe Leu Trp Gly Ala Ala Thr Ala 
            20                  25                  30          


Ala Tyr Gln Ile Glu Gly Ala Ala Ala Glu Gly Gly Arg Thr Pro Ser 
        35                  40                  45              


Ile Trp Asp Thr Phe Ser His Thr Glu Gly Lys Thr Val Ser Gly His 
    50                  55                  60                  


Thr Gly Asp Val Ala Cys Asp His Tyr His Arg Leu Ser Asp Asp Val 
65                  70                  75                  80  


Arg Leu Met Ala Glu Leu Gly Leu Lys Ser Tyr Arg Phe Ser Val Ser 
                85                  90                  95      


Trp Pro Arg Val Gln Pro Gly Gly Ser Gly Pro Val Asn Ala Glu Gly 
            100                 105                 110         


Leu Asp Phe Tyr Arg Arg Leu Val Asp Glu Leu Leu Thr Asn Gly Ile 
        115                 120                 125             


Glu Pro Trp Ile Thr Leu Tyr His Trp Asp Leu Pro Gln Glu Leu Glu 
    130                 135                 140                 


Asp Ala Gly Gly Trp Pro Ala Arg Asp Thr Ala Ala Arg Phe Ala Asp 
145                 150                 155                 160 


Tyr Ala Gln Leu Met Ala Asp Ala Leu Gly Asp Arg Val Lys Tyr Trp 
                165                 170                 175     


Thr Thr Leu Asn Glu Pro Trp Cys Ser Ala Phe Leu Gly Tyr Gly Ser 
            180                 185                 190         


Gly Val His Ala Pro Gly Arg Ser Asp Gly Ala Ala Ala Val Gln Ala 
        195                 200                 205             


Gly His His Leu Met Leu Gly His Gly Leu Ala Val Gln Ala Leu Arg 
    210                 215                 220                 


Ala Ala Arg Pro Glu Ala Gln Leu Gly Val Thr Val Asn Leu Tyr Pro 
225                 230                 235                 240 


Val Thr Pro Ala Ser Asp Thr Pro Gly Asp Val Asp Ala Ala Arg Arg 
                245                 250                 255     


Ile Asp Gly Leu Ala Asn Arg Phe Phe Leu Asp Pro Leu Leu Arg Gly 
            260                 265                 270         


Glu Tyr Pro Ala Asp Leu Val Ala Asp Leu Ala Lys Val Thr Asp Phe 
        275                 280                 285             


Gly His Val Arg Asp Gly Asp Leu Ala Val Ile Ala Thr Pro Leu Asp 
    290                 295                 300                 


Leu Val Gly Val Asn Tyr Tyr Ser Arg His Val Val Ala Ala Pro Ala 
305                 310                 315                 320 


Ala Gly Glu Glu Pro Glu Lys Tyr Trp Arg Ala Pro Ser Cys Trp Pro 
                325                 330                 335     


Gly Ser Glu Glu Val Arg Phe Val Thr Arg Gly Val Pro Val Thr Asp 
            340                 345                 350         


Met Gly Trp Glu Ile Asp Ala Pro Gly Leu Val Glu Thr Leu Arg Arg 
        355                 360                 365             


Val His Glu Glu Tyr Thr Asp Leu Pro Leu Tyr Val Thr Glu Asn Gly 
    370                 375                 380                 


Ser Ala Phe Val Asp Ala Val Val Asp Gly Arg Val Asp Asp Thr Asp 
385                 390                 395                 400 


Arg Leu Ala Tyr Phe Asp Ala His Leu Arg Ala Ser His Glu Ala Ile 
                405                 410                 415     


Ser Ala Gly Val Pro Leu Gln Gly Tyr Phe Ala Trp Ser Leu Leu Asp 
            420                 425                 430         


Asn Phe Glu Trp Ala Trp Gly Tyr Thr Lys Arg Phe Gly Met Val Tyr 
        435                 440                 445             


Val Asp Tyr Asp Ser Gln Lys Arg Ile Pro Lys Ser Ser Ala Arg Trp 
    450                 455                 460                 


Tyr Ala Glu Val Ile Arg Arg Asn Gly Leu Ala Ala Gln 
465                 470                 475         



<210> 493
<211> 2592
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 493
ttgtcacaca aaaccatacg cgcatttgta cagaagtttg ccgttttggc cttgacggcg     60

gttccttttc cacacgacac tcactttatc gtgagcacat acccgaacgc cgtgtctcag    120

cgcatggcga tattaaaaac agggagtaaa ccaacaatga ttcatatcaa aacccccaag    180

ctgtcgcgcc gcctgttcgg cgcttccagt ttcgcgctga tgctgagcgc ggccacacat    240

gctgcggcac aaaaggcgaa aatgcccaaa atgccaaagc tggccaagga cgctccgctc    300

tataagcaac cgaacgcgcc gatcaatgat cgcgtcgagg acttgctggg ccgcatgacg    360

ctggaagaaa aagtagctca gatgcagtgc atctggcaga tgaaagcccg cgttcaggac    420

gctcagggca atttcgacgc taaaaaggcg tcggctgagc atccgaacgg catgggtatg    480

ctgggccgcc ccagcgatcg tcgtatggcc gccatcgacg tggccggcgc ggctgccggt    540

gacgcgggcg atattcagaa ccgcaacgcc tatgatacgg cggtctatgt caacgccgct    600

cagcgctggg ccgttgagga cacccgtctc ggcatcccga tgatgaccca cgaagaatcg    660

ctgcacggct atgtggcccg cgatgcgact tccttcccgc aggccattgg tctggcctcg    720

tccttcgatc ccgaaatggc cgagcgtatc ttcagcgtct gcgcgcgtga aatgcgcgcg    780

cgcggtgcgt tcctggcgct gtcgcccgtg gtggatattg cccgcgatcc gcgttggggc    840

cgtatcgaag aaacctatgg cgaagacgcg cacgtcaacg ccgaaatggg tatcgcggcg    900

gttaacggct ttaccggccg cacgcttccg ttggccaagg ataaggtgtt cgccacgctt    960

aagcacatga ccggtcacgg tgagccgcaa aacggcacga acgttggccc ggcacaggtg   1020

tccgagcgcg tgctgcgtga agacttcttc ccgccgttcg agcgtatcat caaggaaacc   1080

aaaattgcgg cggtcatgcc gtcctataac gaaatcgacg gtctgccgtc acacgccaac   1140

cgctggctgc tgacgaccat tctgcgcggc gagtggggct ttgaaggtac gacggtgtcc   1200

gactattacg ccatccgcga actgatcgag cgtcacaagc tggtgcctga tcttaaggaa   1260

gccgcctatc gtgccgtcca tgccggtgtg gatgtcgaaa ccgctgaccc cagcgcctat   1320

ccgtttatcc ctgagctgat tgccgaaggc cgtctgaccc ttgacgaagt tgatggtccg   1380

gtgcgtcgta ttctgcgtga aaagttcgag gcgggtctgt ttgaaaaccc ctatgtcgac   1440

ccgaacgtcg ctgacagcct gaccggcctg ccggacgctg tggctctggc ccatgaagcg   1500

gctaccaaat cggtggttct gctgaagaac aatggcctgc tgccgctggt gcataacaag   1560

gtgggcaagg tgctggttct gggtacgcac gccaaagaca cccccatcgg cggttattcc   1620

gacattccgc gtcacgttgt gtcgatcctt gatggtcttg aaaaagaagg caaggagcac   1680

ggctttgaag tggcctattc cgaagccgtg cgcatcacca aggaacgcat ctggggtcag   1740

gacgaggtca atttcgttga gccggaagtc aaccgccaac ttatcgccga agccgttgag   1800

gctgccaaaa ctgctgacac catcatcatg gtcatcggcg acaacgagca gacctcacgc   1860

gaagcctggg ccgacaacca ccttggcgac cgtcacaccc tgcatctgat gggtgaacag   1920

atggaactgg cccgtgcgat cttcgccctg aagaagccga ccgttacctt cctgctcaat   1980

ggtcgtccga tgatcatcga agaactggtc gaagggtctg acgccctgat cgaaggctgg   2040

tacatgggtc aggaaaccgg ctatgcggcc gctgatatcc tgttcggtcg cgccaacccg   2100

ggcggcaagc ttccggtctc gttcccgcgc agcgaaggtc agcttccggt ctattacaac   2160

cacaagccca cggcgcgccg cggctatctt gatggttcga ccaagcctct gttcccgttc   2220

ggctatggcc tcagctacac caccttcgac atgtccgcac cgcgcctgtc gcaagccact   2280

attggcattg acggcagcgt tgaagtctcg gtcgatgtca ccaacaccgg cgcccgcgca   2340

ggtgatgaag tggttcaggt ctatatccgc gacgacttct cgtccgtcac ccgtccggtg   2400

cttgagctta agcacttcaa gcgcgtgagc ctgcaaccgg gtgaaaagaa gacggtgagc   2460

ttcaccatcg gcaagcaaca gctccagttc tacggcattg acatgaagcg tatcgtagag   2520

ccgggcacct tcaccatctc cgctggcccg aacagcgtcg atctgaagtc tgtcacgttg   2580

accgtggcct aa                                                       2592

<210> 494
<211> 863
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (204)...(435)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (505)...(748)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (336)...(339)
<223> N-glycosylation site. Prosite id = PS00001

<400> 494
Met Ser His Lys Thr Ile Arg Ala Phe Val Gln Lys Phe Ala Val Leu 
1               5                   10                  15      


Ala Leu Thr Ala Val Pro Phe Pro His Asp Thr His Phe Ile Val Ser 
            20                  25                  30          


Thr Tyr Pro Asn Ala Val Ser Gln Arg Met Ala Ile Leu Lys Thr Gly 
        35                  40                  45              


Ser Lys Pro Thr Met Ile His Ile Lys Thr Pro Lys Leu Ser Arg Arg 
    50                  55                  60                  


Leu Phe Gly Ala Ser Ser Phe Ala Leu Met Leu Ser Ala Ala Thr His 
65                  70                  75                  80  


Ala Ala Ala Gln Lys Ala Lys Met Pro Lys Met Pro Lys Leu Ala Lys 
                85                  90                  95      


Asp Ala Pro Leu Tyr Lys Gln Pro Asn Ala Pro Ile Asn Asp Arg Val 
            100                 105                 110         


Glu Asp Leu Leu Gly Arg Met Thr Leu Glu Glu Lys Val Ala Gln Met 
        115                 120                 125             


Gln Cys Ile Trp Gln Met Lys Ala Arg Val Gln Asp Ala Gln Gly Asn 
    130                 135                 140                 


Phe Asp Ala Lys Lys Ala Ser Ala Glu His Pro Asn Gly Met Gly Met 
145                 150                 155                 160 


Leu Gly Arg Pro Ser Asp Arg Arg Met Ala Ala Ile Asp Val Ala Gly 
                165                 170                 175     


Ala Ala Ala Gly Asp Ala Gly Asp Ile Gln Asn Arg Asn Ala Tyr Asp 
            180                 185                 190         


Thr Ala Val Tyr Val Asn Ala Ala Gln Arg Trp Ala Val Glu Asp Thr 
        195                 200                 205             


Arg Leu Gly Ile Pro Met Met Thr His Glu Glu Ser Leu His Gly Tyr 
    210                 215                 220                 


Val Ala Arg Asp Ala Thr Ser Phe Pro Gln Ala Ile Gly Leu Ala Ser 
225                 230                 235                 240 


Ser Phe Asp Pro Glu Met Ala Glu Arg Ile Phe Ser Val Cys Ala Arg 
                245                 250                 255     


Glu Met Arg Ala Arg Gly Ala Phe Leu Ala Leu Ser Pro Val Val Asp 
            260                 265                 270         


Ile Ala Arg Asp Pro Arg Trp Gly Arg Ile Glu Glu Thr Tyr Gly Glu 
        275                 280                 285             


Asp Ala His Val Asn Ala Glu Met Gly Ile Ala Ala Val Asn Gly Phe 
    290                 295                 300                 


Thr Gly Arg Thr Leu Pro Leu Ala Lys Asp Lys Val Phe Ala Thr Leu 
305                 310                 315                 320 


Lys His Met Thr Gly His Gly Glu Pro Gln Asn Gly Thr Asn Val Gly 
                325                 330                 335     


Pro Ala Gln Val Ser Glu Arg Val Leu Arg Glu Asp Phe Phe Pro Pro 
            340                 345                 350         


Phe Glu Arg Ile Ile Lys Glu Thr Lys Ile Ala Ala Val Met Pro Ser 
        355                 360                 365             


Tyr Asn Glu Ile Asp Gly Leu Pro Ser His Ala Asn Arg Trp Leu Leu 
    370                 375                 380                 


Thr Thr Ile Leu Arg Gly Glu Trp Gly Phe Glu Gly Thr Thr Val Ser 
385                 390                 395                 400 


Asp Tyr Tyr Ala Ile Arg Glu Leu Ile Glu Arg His Lys Leu Val Pro 
                405                 410                 415     


Asp Leu Lys Glu Ala Ala Tyr Arg Ala Val His Ala Gly Val Asp Val 
            420                 425                 430         


Glu Thr Ala Asp Pro Ser Ala Tyr Pro Phe Ile Pro Glu Leu Ile Ala 
        435                 440                 445             


Glu Gly Arg Leu Thr Leu Asp Glu Val Asp Gly Pro Val Arg Arg Ile 
    450                 455                 460                 


Leu Arg Glu Lys Phe Glu Ala Gly Leu Phe Glu Asn Pro Tyr Val Asp 
465                 470                 475                 480 


Pro Asn Val Ala Asp Ser Leu Thr Gly Leu Pro Asp Ala Val Ala Leu 
                485                 490                 495     


Ala His Glu Ala Ala Thr Lys Ser Val Val Leu Leu Lys Asn Asn Gly 
            500                 505                 510         


Leu Leu Pro Leu Val His Asn Lys Val Gly Lys Val Leu Val Leu Gly 
        515                 520                 525             


Thr His Ala Lys Asp Thr Pro Ile Gly Gly Tyr Ser Asp Ile Pro Arg 
    530                 535                 540                 


His Val Val Ser Ile Leu Asp Gly Leu Glu Lys Glu Gly Lys Glu His 
545                 550                 555                 560 


Gly Phe Glu Val Ala Tyr Ser Glu Ala Val Arg Ile Thr Lys Glu Arg 
                565                 570                 575     


Ile Trp Gly Gln Asp Glu Val Asn Phe Val Glu Pro Glu Val Asn Arg 
            580                 585                 590         


Gln Leu Ile Ala Glu Ala Val Glu Ala Ala Lys Thr Ala Asp Thr Ile 
        595                 600                 605             


Ile Met Val Ile Gly Asp Asn Glu Gln Thr Ser Arg Glu Ala Trp Ala 
    610                 615                 620                 


Asp Asn His Leu Gly Asp Arg His Thr Leu His Leu Met Gly Glu Gln 
625                 630                 635                 640 


Met Glu Leu Ala Arg Ala Ile Phe Ala Leu Lys Lys Pro Thr Val Thr 
                645                 650                 655     


Phe Leu Leu Asn Gly Arg Pro Met Ile Ile Glu Glu Leu Val Glu Gly 
            660                 665                 670         


Ser Asp Ala Leu Ile Glu Gly Trp Tyr Met Gly Gln Glu Thr Gly Tyr 
        675                 680                 685             


Ala Ala Ala Asp Ile Leu Phe Gly Arg Ala Asn Pro Gly Gly Lys Leu 
    690                 695                 700                 


Pro Val Ser Phe Pro Arg Ser Glu Gly Gln Leu Pro Val Tyr Tyr Asn 
705                 710                 715                 720 


His Lys Pro Thr Ala Arg Arg Gly Tyr Leu Asp Gly Ser Thr Lys Pro 
                725                 730                 735     


Leu Phe Pro Phe Gly Tyr Gly Leu Ser Tyr Thr Thr Phe Asp Met Ser 
            740                 745                 750         


Ala Pro Arg Leu Ser Gln Ala Thr Ile Gly Ile Asp Gly Ser Val Glu 
        755                 760                 765             


Val Ser Val Asp Val Thr Asn Thr Gly Ala Arg Ala Gly Asp Glu Val 
    770                 775                 780                 


Val Gln Val Tyr Ile Arg Asp Asp Phe Ser Ser Val Thr Arg Pro Val 
785                 790                 795                 800 


Leu Glu Leu Lys His Phe Lys Arg Val Ser Leu Gln Pro Gly Glu Lys 
                805                 810                 815     


Lys Thr Val Ser Phe Thr Ile Gly Lys Gln Gln Leu Gln Phe Tyr Gly 
            820                 825                 830         


Ile Asp Met Lys Arg Ile Val Glu Pro Gly Thr Phe Thr Ile Ser Ala 
        835                 840                 845             


Gly Pro Asn Ser Val Asp Leu Lys Ser Val Thr Leu Thr Val Ala 
    850                 855                 860             



<210> 495
<211> 972
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 495
atgaaaaaac caagatacct ggttaaggat ttatacaccg ctgatccttc ggcgcacgta     60

ttcaacggga agatttacat ctacccctca cacgatgtag aatccggcat tccggagaat    120

gacctcgggg atcatttcga catgcgggat taccatgtat tctccatgga ttcgatcgac    180

ggtgaggtga ccgaccacgg ggttgtcctg gctgttaagg acatcccttg ggcgggccgc    240

cagctctggg ccccggatgc cgccttcaaa aacggccaat attaccttta tttcccgctg    300

aaggataaaa atgatatttt cagaataggt gtggcggtga gtgataagcc cgaaggccca    360

ttcactcctc aaagtgatcc gataaaggga agcttcagca ttgatccggc ggtattggac    420

gacggtgacg gaaactttta catgtatttc gggggattgt ggggaggaca acttcagcgg    480

taccgcaaca acaaggccat agaatgcggt cacgaaccgg ccggtgatga acccgctctg    540

tgcgccaggg tggcccgact gagagatgac atgctggaat tcgccgaaga accccgcgat    600

gtggtcattt tggatgaaaa cggcgaacca ctaagggctg gcgatcacga caggcgctac    660

tttgaagggc cgtggatgca caaatacaag ggcaagtact atttctccta ttcaaccgga    720

aatacccact ttctgtgtta tgccacaggc gacagtccat atgggccttt cacctatcag    780

ggtgtgatcc tcacaccggt gataggttgg accacccacc attcgattgt ggaattccat    840

gggaaatggt acctcttcca ccacgacagt aagccatcgg ggggcaaaac ctggcttcgg    900

agtataaaag tggttgagtt ggagtataat cccgacggaa ccataaagac tctggacggc    960

ctggctgact aa                                                        972

<210> 496
<211> 323
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (21)...(315)
<223> Glycosyl hydrolases family 43

<400> 496
Met Lys Lys Pro Arg Tyr Leu Val Lys Asp Leu Tyr Thr Ala Asp Pro 
1               5                   10                  15      


Ser Ala His Val Phe Asn Gly Lys Ile Tyr Ile Tyr Pro Ser His Asp 
            20                  25                  30          


Val Glu Ser Gly Ile Pro Glu Asn Asp Leu Gly Asp His Phe Asp Met 
        35                  40                  45              


Arg Asp Tyr His Val Phe Ser Met Asp Ser Ile Asp Gly Glu Val Thr 
    50                  55                  60                  


Asp His Gly Val Val Leu Ala Val Lys Asp Ile Pro Trp Ala Gly Arg 
65                  70                  75                  80  


Gln Leu Trp Ala Pro Asp Ala Ala Phe Lys Asn Gly Gln Tyr Tyr Leu 
                85                  90                  95      


Tyr Phe Pro Leu Lys Asp Lys Asn Asp Ile Phe Arg Ile Gly Val Ala 
            100                 105                 110         


Val Ser Asp Lys Pro Glu Gly Pro Phe Thr Pro Gln Ser Asp Pro Ile 
        115                 120                 125             


Lys Gly Ser Phe Ser Ile Asp Pro Ala Val Leu Asp Asp Gly Asp Gly 
    130                 135                 140                 


Asn Phe Tyr Met Tyr Phe Gly Gly Leu Trp Gly Gly Gln Leu Gln Arg 
145                 150                 155                 160 


Tyr Arg Asn Asn Lys Ala Ile Glu Cys Gly His Glu Pro Ala Gly Asp 
                165                 170                 175     


Glu Pro Ala Leu Cys Ala Arg Val Ala Arg Leu Arg Asp Asp Met Leu 
            180                 185                 190         


Glu Phe Ala Glu Glu Pro Arg Asp Val Val Ile Leu Asp Glu Asn Gly 
        195                 200                 205             


Glu Pro Leu Arg Ala Gly Asp His Asp Arg Arg Tyr Phe Glu Gly Pro 
    210                 215                 220                 


Trp Met His Lys Tyr Lys Gly Lys Tyr Tyr Phe Ser Tyr Ser Thr Gly 
225                 230                 235                 240 


Asn Thr His Phe Leu Cys Tyr Ala Thr Gly Asp Ser Pro Tyr Gly Pro 
                245                 250                 255     


Phe Thr Tyr Gln Gly Val Ile Leu Thr Pro Val Ile Gly Trp Thr Thr 
            260                 265                 270         


His His Ser Ile Val Glu Phe His Gly Lys Trp Tyr Leu Phe His His 
        275                 280                 285             


Asp Ser Lys Pro Ser Gly Gly Lys Thr Trp Leu Arg Ser Ile Lys Val 
    290                 295                 300                 


Val Glu Leu Glu Tyr Asn Pro Asp Gly Thr Ile Lys Thr Leu Asp Gly 
305                 310                 315                 320 


Leu Ala Asp 
            

<210> 497
<211> 1572
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 497
atgaccctag ccaatccggt tctttccggg ttcaaccccg atccgtccat tctgcgggtc     60

ggtgatgatt actacattgc cacgtccacc tttgagtggt ttcccggagt gcagatactg    120

cactccaggg acctggtgca ttggcgacag atcgcctttc cgctgaatcg ctactcgcaa    180

ctggatttgc gcggtcatcc caactccgcc ggagtctggg cgccgtgtct gagctacagc    240

gacgggctct tctatctgat ctttaccgac acgcgcagct ggaccggagc cttcaaggat    300

acgcacaact atctggtaac cgcgccggcg attaccggcc cctggtccga accggtgtac    360

ctgaacagtt ccggttttga tccctccctg tttcacgacg acgatggccg caagtggctg    420

gtcaacatgg tgtgggacca ccggccggag cgaccgcact tcggtggcat actgttacag    480

gagtacgacc cggagcagca gcgtctgacg gggccggtgc gcaacatctt tcgcggtacc    540

gaactgggat tggtggaagg cccgcatctg tacaaggtgg acggctacta ttacctgctg    600

accgccgagg gtggtacctt tctcactcac gccgccactg tggcacggtc ccgggacatc    660

ggcggtccct atgaagtgat gcccggcaat ccgctgatca ccagcgccca tcgacctgaa    720

ctgcggctca agtccgccgg tcacggctcg ctggtacagc atcgcgatgg cagttggagc    780

ctggcgcatc tgtgtcgccg ccacctgccc aatggccgcg ccattctcgg gcgggagacg    840

gccctgcaga atatcgagtg ggtggatggc tggccgcgac tggcgtccgg cgatgtggtg    900

ccgctggatg agttccagcc gccgccactg ccgccacatc cctggccacc cgaaccggcc    960

cgggatgatt tcgatgcgcc ggaactggcg ctgtgctggc aggcgcccag agtcgcgctg   1020

gacgggcaga tgctcagcct gagtgagcgg ccagggcact tgcgtctgtt tggccgcgag   1080

agcccgcgct cgcactttga acagagtctg gtggcgcggc ggcagcaggc gtttcatatt   1140

gaggcgtcca cctgtctgcg tttcgagccg gagcattttc agcagatggc cggtctgatg   1200

gcctactaca acaccgataa tttctattac ctgtttgtct cccggtccga gcatgcacag   1260

aagtgtctgg gcttgatgcg ctgtgaacag ggtcaggttt cctggcccat tgaaaaggaa   1320

tacccgctgg acaattggga gagcatctac ctgaaactgg tgatcgacca ccagcgcatc   1380

aatttctact attcaccgga cggcgaacac tggacgattg ccgggtttga gcaggacgcg   1440

tcgatactgt ccgatgaaca cgccgtgcct ctgggtttta ccggaaactt tgtcggtatg   1500

gcgtgtcagg atctgtcggg tacccgcagg gcggccgact tcgattggtt tgaatatcgg   1560

gagatggggt aa                                                       1572

<210> 498
<211> 523
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (3)...(292)
<223> Glycosyl hydrolases family 43

<220> 
<221> SITE
<222> (123)...(126)
<223> N-glycosylation site. Prosite id = PS00001

<400> 498
Met Thr Leu Ala Asn Pro Val Leu Ser Gly Phe Asn Pro Asp Pro Ser 
1               5                   10                  15      


Ile Leu Arg Val Gly Asp Asp Tyr Tyr Ile Ala Thr Ser Thr Phe Glu 
            20                  25                  30          


Trp Phe Pro Gly Val Gln Ile Leu His Ser Arg Asp Leu Val His Trp 
        35                  40                  45              


Arg Gln Ile Ala Phe Pro Leu Asn Arg Tyr Ser Gln Leu Asp Leu Arg 
    50                  55                  60                  


Gly His Pro Asn Ser Ala Gly Val Trp Ala Pro Cys Leu Ser Tyr Ser 
65                  70                  75                  80  


Asp Gly Leu Phe Tyr Leu Ile Phe Thr Asp Thr Arg Ser Trp Thr Gly 
                85                  90                  95      


Ala Phe Lys Asp Thr His Asn Tyr Leu Val Thr Ala Pro Ala Ile Thr 
            100                 105                 110         


Gly Pro Trp Ser Glu Pro Val Tyr Leu Asn Ser Ser Gly Phe Asp Pro 
        115                 120                 125             


Ser Leu Phe His Asp Asp Asp Gly Arg Lys Trp Leu Val Asn Met Val 
    130                 135                 140                 


Trp Asp His Arg Pro Glu Arg Pro His Phe Gly Gly Ile Leu Leu Gln 
145                 150                 155                 160 


Glu Tyr Asp Pro Glu Gln Gln Arg Leu Thr Gly Pro Val Arg Asn Ile 
                165                 170                 175     


Phe Arg Gly Thr Glu Leu Gly Leu Val Glu Gly Pro His Leu Tyr Lys 
            180                 185                 190         


Val Asp Gly Tyr Tyr Tyr Leu Leu Thr Ala Glu Gly Gly Thr Phe Leu 
        195                 200                 205             


Thr His Ala Ala Thr Val Ala Arg Ser Arg Asp Ile Gly Gly Pro Tyr 
    210                 215                 220                 


Glu Val Met Pro Gly Asn Pro Leu Ile Thr Ser Ala His Arg Pro Glu 
225                 230                 235                 240 


Leu Arg Leu Lys Ser Ala Gly His Gly Ser Leu Val Gln His Arg Asp 
                245                 250                 255     


Gly Ser Trp Ser Leu Ala His Leu Cys Arg Arg His Leu Pro Asn Gly 
            260                 265                 270         


Arg Ala Ile Leu Gly Arg Glu Thr Ala Leu Gln Asn Ile Glu Trp Val 
        275                 280                 285             


Asp Gly Trp Pro Arg Leu Ala Ser Gly Asp Val Val Pro Leu Asp Glu 
    290                 295                 300                 


Phe Gln Pro Pro Pro Leu Pro Pro His Pro Trp Pro Pro Glu Pro Ala 
305                 310                 315                 320 


Arg Asp Asp Phe Asp Ala Pro Glu Leu Ala Leu Cys Trp Gln Ala Pro 
                325                 330                 335     


Arg Val Ala Leu Asp Gly Gln Met Leu Ser Leu Ser Glu Arg Pro Gly 
            340                 345                 350         


His Leu Arg Leu Phe Gly Arg Glu Ser Pro Arg Ser His Phe Glu Gln 
        355                 360                 365             


Ser Leu Val Ala Arg Arg Gln Gln Ala Phe His Ile Glu Ala Ser Thr 
    370                 375                 380                 


Cys Leu Arg Phe Glu Pro Glu His Phe Gln Gln Met Ala Gly Leu Met 
385                 390                 395                 400 


Ala Tyr Tyr Asn Thr Asp Asn Phe Tyr Tyr Leu Phe Val Ser Arg Ser 
                405                 410                 415     


Glu His Ala Gln Lys Cys Leu Gly Leu Met Arg Cys Glu Gln Gly Gln 
            420                 425                 430         


Val Ser Trp Pro Ile Glu Lys Glu Tyr Pro Leu Asp Asn Trp Glu Ser 
        435                 440                 445             


Ile Tyr Leu Lys Leu Val Ile Asp His Gln Arg Ile Asn Phe Tyr Tyr 
    450                 455                 460                 


Ser Pro Asp Gly Glu His Trp Thr Ile Ala Gly Phe Glu Gln Asp Ala 
465                 470                 475                 480 


Ser Ile Leu Ser Asp Glu His Ala Val Pro Leu Gly Phe Thr Gly Asn 
                485                 490                 495     


Phe Val Gly Met Ala Cys Gln Asp Leu Ser Gly Thr Arg Arg Ala Ala 
            500                 505                 510         


Asp Phe Asp Trp Phe Glu Tyr Arg Glu Met Gly 
        515                 520             


<210> 499
<211> 2358
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 499
atgaagaaga gagcttttag ctttagtctt tgtgttgcca ttatttcgac tttctggttg     60

cctgttgcac acatgagctc gcaagacaat cagccggact acaaaaaccc gcgtcttccg    120

gtcgataggc gcgtcgcgga cttgctgtcg cgcatgacgc tcgaagaaaa agtggcgcag    180

ttggtctgtc tctggggtag ccggccgcaa gttaatccgc aaaccaattt cgccaccgat    240

cgcggtgact tttctcccgc aaaagcggcg gaagtcatga agcacggcat cggccagatc    300

gcgcgccaac gagaacgtaa agatccgcgc cagggcgcga cgttcgccaa cgcggtgcag    360

aagtggttga tcgaaaacac gcggctcgga atcccagcga ttcttcacga cgagatactt    420

cacggccata tggccaaagg cagcacgagc ttcccgcagc cgatcgcact cgctacaacc    480

tgggacccgg acttcatcac gaaagtcttt acggcgggcg cgctcgagac gcgggcgcgc    540

ggaagtcatc aagtgcttgg tccaaacctc gacctggccc gcgacgctcg ctggggacga    600

actgaagaga cctacggcga agatccctat ctcacgtcgc gcatggccgt cgcgatcgtg    660

cgcgctttgc agggacccgg accgggtgtc gacggcgatc acgtcatcgc cacggccaaa    720

cactttgcgg cccacggcca accggagagc ggcaccaaca tcgcgcccgt aaatttttca    780

gaacgaacgt tgcgcgaata cttcctgccg agcttcaaag cggcggtgac tgaggccggc    840

atcatgagcg tgatgccttc gtacaacgag atcgacggcg tgccgtctca cgctaacaag    900

tggctgttgc aagacctcct gcgcgaagaa tggggcttcg acggtcacgt cgtctccgac    960

tactacgcga ttccgcaaat gatggatctg catcgcatcg ccggtgataa agcagcgact   1020

gcgaagctcg caatcgaggc cggcgtcgac accgaacttc cggatcctga ttcgttcccg   1080

actctcttgc gcttggtgaa agagggacag gtctcggaag cgacgctcaa tcgagcggtc   1140

gcgcgtaacc tgcgcgcgaa gtttctgctc ggattgtttg agaacccgta cgtcgacgtc   1200

gaacgtgctg tgcggatcac gaactcgagc gaacaccggg cgttggcggc tgaagcggcg   1260

cgcagatcga tcacgctgct gaagaaccaa aacaacctgc tgccgttgaa ccgaaacact   1320

ctgaaatcga tcgcggtgat tggcccaaac gccgcgcagg tccatctcgg cggatacagt   1380

gatcagccag gccgcggcgt cagtgtggtc cagggtatca aagacaaggt aggcggctcg   1440

atcaaggtcg cgtacgccga aggctgcaag ataaccaaag aaggcggcga ttggttcgcc   1500

gataccgcta ctctcagcga tccggctgag gatcgaaagt tgatcgccga agctgtgcaa   1560

gtggcaaaga cagccgacgt tgcgctgctc gtactcggcg gcaacgaaga cacgaacaaa   1620

gaaggctggg ccgacaatca tctcggcgat cgcgatagtc tcgagttgat cgggcggcaa   1680

aacgatctcg tcaaggccat tctcgagacc gggaaaccga ccatagttct gcttatcaac   1740

agcggtccgc tttcgatcaa ctacatcgcc gaaaacgttc ctgcgattct cgaaggtttc   1800

tatctagggc aggaaacagg cgtcggcgtc gccgatgtcc tgttcggcga cttcaacccg   1860

gcgggcaaac tgacgatcag ttttccacga tcagtcggac agttgccgct ctactacaac   1920

cgcaagccaa ccgcgcgccg cggctatctg tttgcaaaca aggagccgtt gtttccattc   1980

ggatttgggc ttagctacac gacgttcgcc tattccgatc taaaagttac tccggcgaag   2040

ataggcgtcg cgggtgaggc gcgggtcagc gtcacggtca ggaacagcgg gagccgtgcg   2100

ggcgacgagg tcgtgcaact ctacattcgc gaccttgtga gttcggttac acggcccatc   2160

atggagttga aagacttcaa gcgcatcccg ctggcgccgg gcgagagtaa gaccgtcgag   2220

tttgtcatca cgcccgaaaa actttctctt ctggatctga atatgaaaag cgtggtcgag   2280

cctggatggt tcgatatcat ggtcggaaca agctcagtaa aatatgagac agtcaaactt   2340

gaggttgcgg cgaagtag                                                 2358

<210> 500
<211> 785
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(25)

<220> 
<221> DOMAIN
<222> (123)...(354)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (424)...(668)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (261)...(264)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (414)...(417)
<223> N-glycosylation site. Prosite id = PS00001

<400> 500
Met Lys Lys Arg Ala Phe Ser Phe Ser Leu Cys Val Ala Ile Ile Ser 
1               5                   10                  15      


Thr Phe Trp Leu Pro Val Ala His Met Ser Ser Gln Asp Asn Gln Pro 
            20                  25                  30          


Asp Tyr Lys Asn Pro Arg Leu Pro Val Asp Arg Arg Val Ala Asp Leu 
        35                  40                  45              


Leu Ser Arg Met Thr Leu Glu Glu Lys Val Ala Gln Leu Val Cys Leu 
    50                  55                  60                  


Trp Gly Ser Arg Pro Gln Val Asn Pro Gln Thr Asn Phe Ala Thr Asp 
65                  70                  75                  80  


Arg Gly Asp Phe Ser Pro Ala Lys Ala Ala Glu Val Met Lys His Gly 
                85                  90                  95      


Ile Gly Gln Ile Ala Arg Gln Arg Glu Arg Lys Asp Pro Arg Gln Gly 
            100                 105                 110         


Ala Thr Phe Ala Asn Ala Val Gln Lys Trp Leu Ile Glu Asn Thr Arg 
        115                 120                 125             


Leu Gly Ile Pro Ala Ile Leu His Asp Glu Ile Leu His Gly His Met 
    130                 135                 140                 


Ala Lys Gly Ser Thr Ser Phe Pro Gln Pro Ile Ala Leu Ala Thr Thr 
145                 150                 155                 160 


Trp Asp Pro Asp Phe Ile Thr Lys Val Phe Thr Ala Gly Ala Leu Glu 
                165                 170                 175     


Thr Arg Ala Arg Gly Ser His Gln Val Leu Gly Pro Asn Leu Asp Leu 
            180                 185                 190         


Ala Arg Asp Ala Arg Trp Gly Arg Thr Glu Glu Thr Tyr Gly Glu Asp 
        195                 200                 205             


Pro Tyr Leu Thr Ser Arg Met Ala Val Ala Ile Val Arg Ala Leu Gln 
    210                 215                 220                 


Gly Pro Gly Pro Gly Val Asp Gly Asp His Val Ile Ala Thr Ala Lys 
225                 230                 235                 240 


His Phe Ala Ala His Gly Gln Pro Glu Ser Gly Thr Asn Ile Ala Pro 
                245                 250                 255     


Val Asn Phe Ser Glu Arg Thr Leu Arg Glu Tyr Phe Leu Pro Ser Phe 
            260                 265                 270         


Lys Ala Ala Val Thr Glu Ala Gly Ile Met Ser Val Met Pro Ser Tyr 
        275                 280                 285             


Asn Glu Ile Asp Gly Val Pro Ser His Ala Asn Lys Trp Leu Leu Gln 
    290                 295                 300                 


Asp Leu Leu Arg Glu Glu Trp Gly Phe Asp Gly His Val Val Ser Asp 
305                 310                 315                 320 


Tyr Tyr Ala Ile Pro Gln Met Met Asp Leu His Arg Ile Ala Gly Asp 
                325                 330                 335     


Lys Ala Ala Thr Ala Lys Leu Ala Ile Glu Ala Gly Val Asp Thr Glu 
            340                 345                 350         


Leu Pro Asp Pro Asp Ser Phe Pro Thr Leu Leu Arg Leu Val Lys Glu 
        355                 360                 365             


Gly Gln Val Ser Glu Ala Thr Leu Asn Arg Ala Val Ala Arg Asn Leu 
    370                 375                 380                 


Arg Ala Lys Phe Leu Leu Gly Leu Phe Glu Asn Pro Tyr Val Asp Val 
385                 390                 395                 400 


Glu Arg Ala Val Arg Ile Thr Asn Ser Ser Glu His Arg Ala Leu Ala 
                405                 410                 415     


Ala Glu Ala Ala Arg Arg Ser Ile Thr Leu Leu Lys Asn Gln Asn Asn 
            420                 425                 430         


Leu Leu Pro Leu Asn Arg Asn Thr Leu Lys Ser Ile Ala Val Ile Gly 
        435                 440                 445             


Pro Asn Ala Ala Gln Val His Leu Gly Gly Tyr Ser Asp Gln Pro Gly 
    450                 455                 460                 


Arg Gly Val Ser Val Val Gln Gly Ile Lys Asp Lys Val Gly Gly Ser 
465                 470                 475                 480 


Ile Lys Val Ala Tyr Ala Glu Gly Cys Lys Ile Thr Lys Glu Gly Gly 
                485                 490                 495     


Asp Trp Phe Ala Asp Thr Ala Thr Leu Ser Asp Pro Ala Glu Asp Arg 
            500                 505                 510         


Lys Leu Ile Ala Glu Ala Val Gln Val Ala Lys Thr Ala Asp Val Ala 
        515                 520                 525             


Leu Leu Val Leu Gly Gly Asn Glu Asp Thr Asn Lys Glu Gly Trp Ala 
    530                 535                 540                 


Asp Asn His Leu Gly Asp Arg Asp Ser Leu Glu Leu Ile Gly Arg Gln 
545                 550                 555                 560 


Asn Asp Leu Val Lys Ala Ile Leu Glu Thr Gly Lys Pro Thr Ile Val 
                565                 570                 575     


Leu Leu Ile Asn Ser Gly Pro Leu Ser Ile Asn Tyr Ile Ala Glu Asn 
            580                 585                 590         


Val Pro Ala Ile Leu Glu Gly Phe Tyr Leu Gly Gln Glu Thr Gly Val 
        595                 600                 605             


Gly Val Ala Asp Val Leu Phe Gly Asp Phe Asn Pro Ala Gly Lys Leu 
    610                 615                 620                 


Thr Ile Ser Phe Pro Arg Ser Val Gly Gln Leu Pro Leu Tyr Tyr Asn 
625                 630                 635                 640 


Arg Lys Pro Thr Ala Arg Arg Gly Tyr Leu Phe Ala Asn Lys Glu Pro 
                645                 650                 655     


Leu Phe Pro Phe Gly Phe Gly Leu Ser Tyr Thr Thr Phe Ala Tyr Ser 
            660                 665                 670         


Asp Leu Lys Val Thr Pro Ala Lys Ile Gly Val Ala Gly Glu Ala Arg 
        675                 680                 685             


Val Ser Val Thr Val Arg Asn Ser Gly Ser Arg Ala Gly Asp Glu Val 
    690                 695                 700                 


Val Gln Leu Tyr Ile Arg Asp Leu Val Ser Ser Val Thr Arg Pro Ile 
705                 710                 715                 720 


Met Glu Leu Lys Asp Phe Lys Arg Ile Pro Leu Ala Pro Gly Glu Ser 
                725                 730                 735     


Lys Thr Val Glu Phe Val Ile Thr Pro Glu Lys Leu Ser Leu Leu Asp 
            740                 745                 750         


Leu Asn Met Lys Ser Val Val Glu Pro Gly Trp Phe Asp Ile Met Val 
        755                 760                 765             


Gly Thr Ser Ser Val Lys Tyr Glu Thr Val Lys Leu Glu Val Ala Ala 
    770                 775                 780                 


Lys 
785 

<210> 501
<211> 972
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 501
atgaagatac ccaaatattt agttccaaac gattatatgg ccgatccggc cgtacatgta     60

tttaatgaca ggctttatat ttacccatct catgaccgtg agagtggtat tgaagagaat    120

gataacggtg atcatttcga tatgaaagat tatcatgtat tctcgacgga tgatctagag    180

aatggtactc tcatcgatca tggagttgtc ttggatacaa aaaatattcc gtgggcaggt    240

cgtcaacttt gggattgcga cgtggcatat aaagatggta aatattatct ctactttccg    300

ctgaaagatc aaacggacat ttttcggatt ggagttgctg tcagcgataa acctgagggt    360

ccatttattc ctgaagataa tccaatcaag gggagctaca gcatagatcc ggcaatattg    420

aatgatggcg gcggcaatta ttacatgtat ttcggtggtt tgtggggtgg tcaacttcaa    480

cgctaccgta ataacaaagc tttggaatgt gcggtattgc ctgaaaatga tgaattggca    540

ctcaattctt tggtagttaa actcagcgac gatatgttgg agtttgcgga agaaccgaag    600

gcagtcatta ttcttgatga aaagggcgaa cccttgaaag ccggtgattc cgaacgccgt    660

ttctttgagg cctcttgggt tcataaatac aatggaaaat actatttttc ttattctacc    720

ggagatacac atctgctctg ttatgctgtg ggtgatcatc catatggccc gttcacttat    780

cagggcgtga tccttactcc tgtagttggc tggacgactc atcatgccat cgtagaattt    840

aaaaataaat ggtatctatt ctttcacgat tgtgtgccgt caggaggacg aacttggttg    900

agaagtctga aagtggttga actagaatat gatgattcgg gtaaaataaa aacgatcgag    960

ggtactttat aa                                                        972

<210> 502
<211> 323
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (13)...(316)
<223> Glycosyl hydrolases family 43

<220> 
<221> SITE
<222> (61)...(64)
<223> N-glycosylation site. Prosite id = PS00001

<400> 502
Met Lys Ile Pro Lys Tyr Leu Val Pro Asn Asp Tyr Met Ala Asp Pro 
1               5                   10                  15      


Ala Val His Val Phe Asn Asp Arg Leu Tyr Ile Tyr Pro Ser His Asp 
            20                  25                  30          


Arg Glu Ser Gly Ile Glu Glu Asn Asp Asn Gly Asp His Phe Asp Met 
        35                  40                  45              


Lys Asp Tyr His Val Phe Ser Thr Asp Asp Leu Glu Asn Gly Thr Leu 
    50                  55                  60                  


Ile Asp His Gly Val Val Leu Asp Thr Lys Asn Ile Pro Trp Ala Gly 
65                  70                  75                  80  


Arg Gln Leu Trp Asp Cys Asp Val Ala Tyr Lys Asp Gly Lys Tyr Tyr 
                85                  90                  95      


Leu Tyr Phe Pro Leu Lys Asp Gln Thr Asp Ile Phe Arg Ile Gly Val 
            100                 105                 110         


Ala Val Ser Asp Lys Pro Glu Gly Pro Phe Ile Pro Glu Asp Asn Pro 
        115                 120                 125             


Ile Lys Gly Ser Tyr Ser Ile Asp Pro Ala Ile Leu Asn Asp Gly Gly 
    130                 135                 140                 


Gly Asn Tyr Tyr Met Tyr Phe Gly Gly Leu Trp Gly Gly Gln Leu Gln 
145                 150                 155                 160 


Arg Tyr Arg Asn Asn Lys Ala Leu Glu Cys Ala Val Leu Pro Glu Asn 
                165                 170                 175     


Asp Glu Leu Ala Leu Asn Ser Leu Val Val Lys Leu Ser Asp Asp Met 
            180                 185                 190         


Leu Glu Phe Ala Glu Glu Pro Lys Ala Val Ile Ile Leu Asp Glu Lys 
        195                 200                 205             


Gly Glu Pro Leu Lys Ala Gly Asp Ser Glu Arg Arg Phe Phe Glu Ala 
    210                 215                 220                 


Ser Trp Val His Lys Tyr Asn Gly Lys Tyr Tyr Phe Ser Tyr Ser Thr 
225                 230                 235                 240 


Gly Asp Thr His Leu Leu Cys Tyr Ala Val Gly Asp His Pro Tyr Gly 
                245                 250                 255     


Pro Phe Thr Tyr Gln Gly Val Ile Leu Thr Pro Val Val Gly Trp Thr 
            260                 265                 270         


Thr His His Ala Ile Val Glu Phe Lys Asn Lys Trp Tyr Leu Phe Phe 
        275                 280                 285             


His Asp Cys Val Pro Ser Gly Gly Arg Thr Trp Leu Arg Ser Leu Lys 
    290                 295                 300                 


Val Val Glu Leu Glu Tyr Asp Asp Ser Gly Lys Ile Lys Thr Ile Glu 
305                 310                 315                 320 


Gly Thr Leu 
            


<210> 503
<211> 966
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 503
atgaaaaaag caaaatatct tttcccgaat gactatatgg cagatccttc tgcgcacgtt     60

tttgaaggaa aaatctacat atatccctct cacgaccgag aaagcggaat tgaagaaaat    120

gacaacggcg atcattttga catgaatgat taccatgtat tttctcttga cgatgttgaa    180

aatggagata tcaccgatca cggggtagtg ctttccgtca aggatattcc gtggtcagga    240

cgccaacttt gggactgtga cgtggcagaa aaaaacggaa aatattatat gtattatccg    300

ctaaaagata aaacggatat tttcagaatc ggagtggcag taggcgaaaa accttacggg    360

cctttcatcc ctgagaaaca tccgatttta ggaagctaca gcattgatcc ttgtgtgttt    420

gatgacaatg gaacgcatta tctgtatttc ggaggaatct ggggaggaca attacaacgt    480

taccgtgaga ataaagcatt ggaatgtgct gttattcctg aaaatgacga gcctgccatt    540

tcttcaaaag tagttcggct gagtgacgat atgcttgaat ttgcggagga acccaaagac    600

ttaaaaattc ttgacgaaaa cggaaacgaa cttctgcacg gagatccgca ccgttttttt    660

gaagcatctt ggatgcataa gtacaacgga aaatattact tctcatattc aacgggagac    720

acgcatcttc tctgttacgc agtgggagac aatccttacg gacctttcac tttcaaaggg    780

gaaattttaa cacctgttgt cggttggaca acgcatcaca gtattgttga attcaaagga    840

aaatggtatt tgttcttcca cgattctgta ccaagtggcg gaagaacctg gctaagaagt    900

atgaaagtag tagaacttga gtacaataac gatggtacca tcaaaactat tgagggggaa    960

gaatag                                                               966

<210> 504
<211> 321
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (6)...(314)
<223> Glycosyl hydrolases family 43

<220> 
<221> SITE
<222> (145)...(148)
<223> N-glycosylation site. Prosite id = PS00001

<400> 504
Met Lys Lys Ala Lys Tyr Leu Phe Pro Asn Asp Tyr Met Ala Asp Pro 
1               5                   10                  15      


Ser Ala His Val Phe Glu Gly Lys Ile Tyr Ile Tyr Pro Ser His Asp 
            20                  25                  30          


Arg Glu Ser Gly Ile Glu Glu Asn Asp Asn Gly Asp His Phe Asp Met 
        35                  40                  45              


Asn Asp Tyr His Val Phe Ser Leu Asp Asp Val Glu Asn Gly Asp Ile 
    50                  55                  60                  


Thr Asp His Gly Val Val Leu Ser Val Lys Asp Ile Pro Trp Ser Gly 
65                  70                  75                  80  


Arg Gln Leu Trp Asp Cys Asp Val Ala Glu Lys Asn Gly Lys Tyr Tyr 
                85                  90                  95      


Met Tyr Tyr Pro Leu Lys Asp Lys Thr Asp Ile Phe Arg Ile Gly Val 
            100                 105                 110         


Ala Val Gly Glu Lys Pro Tyr Gly Pro Phe Ile Pro Glu Lys His Pro 
        115                 120                 125             


Ile Leu Gly Ser Tyr Ser Ile Asp Pro Cys Val Phe Asp Asp Asn Gly 
    130                 135                 140                 


Thr His Tyr Leu Tyr Phe Gly Gly Ile Trp Gly Gly Gln Leu Gln Arg 
145                 150                 155                 160 


Tyr Arg Glu Asn Lys Ala Leu Glu Cys Ala Val Ile Pro Glu Asn Asp 
                165                 170                 175     


Glu Pro Ala Ile Ser Ser Lys Val Val Arg Leu Ser Asp Asp Met Leu 
            180                 185                 190         


Glu Phe Ala Glu Glu Pro Lys Asp Leu Lys Ile Leu Asp Glu Asn Gly 
        195                 200                 205             


Asn Glu Leu Leu His Gly Asp Pro His Arg Phe Phe Glu Ala Ser Trp 
    210                 215                 220                 


Met His Lys Tyr Asn Gly Lys Tyr Tyr Phe Ser Tyr Ser Thr Gly Asp 
225                 230                 235                 240 


Thr His Leu Leu Cys Tyr Ala Val Gly Asp Asn Pro Tyr Gly Pro Phe 
                245                 250                 255     


Thr Phe Lys Gly Glu Ile Leu Thr Pro Val Val Gly Trp Thr Thr His 
            260                 265                 270         


His Ser Ile Val Glu Phe Lys Gly Lys Trp Tyr Leu Phe Phe His Asp 
        275                 280                 285             


Ser Val Pro Ser Gly Gly Arg Thr Trp Leu Arg Ser Met Lys Val Val 
    290                 295                 300                 


Glu Leu Glu Tyr Asn Asn Asp Gly Thr Ile Lys Thr Ile Glu Gly Glu 
305                 310                 315                 320 


Glu 
    

<210> 505
<211> 975
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 505
atgaagaagc caagatatct tgtagaccat ctatacacag ccgacccttc ggcccatgtt     60

ttcaatggaa aattatacat ctatccatcg cacgatgttg aatcaggaat tcccgagaat    120

gacaacggcg accattttga tatgcgcgat taccacgtgt tttcgatgga cgatattgat    180

gggcctgtta ctgatcatgg tgttgctttg gatgttaaaa atattccctg gtcgggtcgt    240

cagttatggg cacccgatgc agcctgcaaa aacggtaaat actatcttta ttttccctta    300

aaggataaaa acgacatttt ccgtattggc gttgctgtga gcgataagcc cgaaggtccg    360

tttattccac aggagaaccc aattaaagga agtttcagca tcgaccctgc tgttttagag    420

gacaatgatg gtaaatacta tatgtatttt ggcggactat ggggcggtca gcttcaaaga    480

taccgcaaca ataaagccat agagtgcggt catgaacctg ctgataacga acctgctttg    540

tgtgctaaag ttgctgtgtt aagcgatgac atgctggagt ttggtgaaga accccgcgat    600

gtggttattc tcgacgaaaa aggagaacca ttgaaagccg gcgaccacga tcgtcgttat    660

tttgaaggcc catggatgca caaatacaaa ggaaaatact acttctccta ctccaccggc    720

aatacacata aactttgcta tgccgttggc gatagccctt atggcccttt cacctataaa    780

ggagttattc taaccccggt ggtaggttgg actacacatc actctatatg cgagtttaaa    840

ggtaaatggt atctcttcca ccacgacagt gttccttcag gaggcaaaac ctggttaaga    900

agcattaaag ttgttgagtt ggaaatgaat gacgatggca ctattgttac catggaaggt    960

ggtggcaact ggtag                                                     975

<210> 506
<211> 324
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> DOMAIN
<222> (4)...(315)
<223> Glycosyl hydrolases family 43

<400> 506
Met Lys Lys Pro Arg Tyr Leu Val Asp His Leu Tyr Thr Ala Asp Pro 
1               5                   10                  15      


Ser Ala His Val Phe Asn Gly Lys Leu Tyr Ile Tyr Pro Ser His Asp 
            20                  25                  30          


Val Glu Ser Gly Ile Pro Glu Asn Asp Asn Gly Asp His Phe Asp Met 
        35                  40                  45              


Arg Asp Tyr His Val Phe Ser Met Asp Asp Ile Asp Gly Pro Val Thr 
    50                  55                  60                  


Asp His Gly Val Ala Leu Asp Val Lys Asn Ile Pro Trp Ser Gly Arg 
65                  70                  75                  80  


Gln Leu Trp Ala Pro Asp Ala Ala Cys Lys Asn Gly Lys Tyr Tyr Leu 
                85                  90                  95      


Tyr Phe Pro Leu Lys Asp Lys Asn Asp Ile Phe Arg Ile Gly Val Ala 
            100                 105                 110         


Val Ser Asp Lys Pro Glu Gly Pro Phe Ile Pro Gln Glu Asn Pro Ile 
        115                 120                 125             


Lys Gly Ser Phe Ser Ile Asp Pro Ala Val Leu Glu Asp Asn Asp Gly 
    130                 135                 140                 


Lys Tyr Tyr Met Tyr Phe Gly Gly Leu Trp Gly Gly Gln Leu Gln Arg 
145                 150                 155                 160 


Tyr Arg Asn Asn Lys Ala Ile Glu Cys Gly His Glu Pro Ala Asp Asn 
                165                 170                 175     


Glu Pro Ala Leu Cys Ala Lys Val Ala Val Leu Ser Asp Asp Met Leu 
            180                 185                 190         


Glu Phe Gly Glu Glu Pro Arg Asp Val Val Ile Leu Asp Glu Lys Gly 
        195                 200                 205             


Glu Pro Leu Lys Ala Gly Asp His Asp Arg Arg Tyr Phe Glu Gly Pro 
    210                 215                 220                 


Trp Met His Lys Tyr Lys Gly Lys Tyr Tyr Phe Ser Tyr Ser Thr Gly 
225                 230                 235                 240 


Asn Thr His Lys Leu Cys Tyr Ala Val Gly Asp Ser Pro Tyr Gly Pro 
                245                 250                 255     


Phe Thr Tyr Lys Gly Val Ile Leu Thr Pro Val Val Gly Trp Thr Thr 
            260                 265                 270         


His His Ser Ile Cys Glu Phe Lys Gly Lys Trp Tyr Leu Phe His His 
        275                 280                 285             


Asp Ser Val Pro Ser Gly Gly Lys Thr Trp Leu Arg Ser Ile Lys Val 
    290                 295                 300                 


Val Glu Leu Glu Met Asn Asp Asp Gly Thr Ile Val Thr Met Glu Gly 
305                 310                 315                 320 


Gly Gly Asn Trp 
                

<210> 507
<211> 1440
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 507
atgcaaaatc gtcgagaatt tttacaactt ttatttgccg gtgccggtgc cggacttgtt     60

ttgccgcaga tttctttcgg gcagactaaa caagccgacg cctggacgac cgagtatccg    120

aagattttag ccagaatcaa accgccgaaa tttcgcaaaa aagattttcc gatcaccaaa    180

tatggagccg ttgcggacgg gaaaaccctg gcgaccgaaa gcatcaaaaa agccatcgaa    240

gcgtgcgcca aatcgggcgg cgggcgcgtc gtcgtgcccc agggagaatt tttgaccggc    300

gcgattcatt tgaaatcaaa cgtcaatctg cacatcacga aaggcgcgac cgtcaaattt    360

tccaccaacc cgaaagatta tctgccgatc gttcacacgc gctgggaagg gatggaattg    420

atgcatattt cgcctttaat ttatgcctac gagcaaacca acatcgccgt caccggcgag    480

ggaacgctcg acgggcaggg caaggctttt ttctggaaat ggcacggaaa cccgcgctac    540

ggcggaaatc cggatgtgat cagccagcgt ccggcgcgcg cccggctgta tgaaatgatg    600

gaaaaaggcg tgcctgtggc ggagcggatt ttcggcgaaa ctcagtatct tcgcccgcag    660

tttatccagc cctataaatg caaaaatgtt ttgatcgaag gcgttaaaat catcgattcg    720

ccgatgtggg aagttcaccc cgttttgtgc gaaaacgtga cgatccgaaa acttcatatt    780

tctacccacg gaccgaacaa cgacgggtgc gatccggaaa gctgcaagga cgttttgatc    840

gaagactgct atttcgacac cggcgacgat tgcattgcca tcaaggcggg gcgcaatgaa    900

gacgggcgac gcatcaatgt tccgaccgaa aacgtcgtcg tgcgcgggtg cgtgatgaag    960

gacggtcacg gcggaatcac catcggaagc gagatttccg gcggcgtgcg aaatgttttc   1020

gcggaaaaca accggctcga cagcgcggat ttgtggactg cgctgagagt gaaaaacaac   1080

gcttcgcgcg gcggaaaact ggagaatttt tacttccgcg atatcaccgt cgggcaggtc   1140

tcgcgcgcgg tcgtcgaaat agattttaat tacgaggaag gcgctaaagg aaaacacacg   1200

ccggtcgttc gcaattacgt ggtcgaaaat ctaacctgca ataaaggcaa tcgagcggtc   1260

gatctgcagg gcttggacaa cgccccgatt tacgacatca cgatgaaaaa ctgtacgttt   1320

aacgtggtcg aaaagccgag cgtcgtgaaa aacgtcaaag gcgtcaaact ggaaaacgtg   1380

aagattaacg gcaaagtcgt cgagagtctg gaaaatgctg caacgacggc taaaaaataa   1440

<210> 508
<211> 479
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(27)

<220> 
<221> DOMAIN
<222> (82)...(461)
<223> Glycosyl hydrolases family 28

<220> 
<221> SITE
<222> (255)...(258)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (365)...(368)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (416)...(419)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (443)...(446)
<223> N-glycosylation site. Prosite id = PS00001

<400> 508
Met Gln Asn Arg Arg Glu Phe Leu Gln Leu Leu Phe Ala Gly Ala Gly 
1               5                   10                  15      


Ala Gly Leu Val Leu Pro Gln Ile Ser Phe Gly Gln Thr Lys Gln Ala 
            20                  25                  30          


Asp Ala Trp Thr Thr Glu Tyr Pro Lys Ile Leu Ala Arg Ile Lys Pro 
        35                  40                  45              


Pro Lys Phe Arg Lys Lys Asp Phe Pro Ile Thr Lys Tyr Gly Ala Val 
    50                  55                  60                  


Ala Asp Gly Lys Thr Leu Ala Thr Glu Ser Ile Lys Lys Ala Ile Glu 
65                  70                  75                  80  


Ala Cys Ala Lys Ser Gly Gly Gly Arg Val Val Val Pro Gln Gly Glu 
                85                  90                  95      


Phe Leu Thr Gly Ala Ile His Leu Lys Ser Asn Val Asn Leu His Ile 
            100                 105                 110         


Thr Lys Gly Ala Thr Val Lys Phe Ser Thr Asn Pro Lys Asp Tyr Leu 
        115                 120                 125             


Pro Ile Val His Thr Arg Trp Glu Gly Met Glu Leu Met His Ile Ser 
    130                 135                 140                 


Pro Leu Ile Tyr Ala Tyr Glu Gln Thr Asn Ile Ala Val Thr Gly Glu 
145                 150                 155                 160 


Gly Thr Leu Asp Gly Gln Gly Lys Ala Phe Phe Trp Lys Trp His Gly 
                165                 170                 175     


Asn Pro Arg Tyr Gly Gly Asn Pro Asp Val Ile Ser Gln Arg Pro Ala 
            180                 185                 190         


Arg Ala Arg Leu Tyr Glu Met Met Glu Lys Gly Val Pro Val Ala Glu 
        195                 200                 205             


Arg Ile Phe Gly Glu Thr Gln Tyr Leu Arg Pro Gln Phe Ile Gln Pro 
    210                 215                 220                 


Tyr Lys Cys Lys Asn Val Leu Ile Glu Gly Val Lys Ile Ile Asp Ser 
225                 230                 235                 240 


Pro Met Trp Glu Val His Pro Val Leu Cys Glu Asn Val Thr Ile Arg 
                245                 250                 255     


Lys Leu His Ile Ser Thr His Gly Pro Asn Asn Asp Gly Cys Asp Pro 
            260                 265                 270         


Glu Ser Cys Lys Asp Val Leu Ile Glu Asp Cys Tyr Phe Asp Thr Gly 
        275                 280                 285             


Asp Asp Cys Ile Ala Ile Lys Ala Gly Arg Asn Glu Asp Gly Arg Arg 
    290                 295                 300                 


Ile Asn Val Pro Thr Glu Asn Val Val Val Arg Gly Cys Val Met Lys 
305                 310                 315                 320 


Asp Gly His Gly Gly Ile Thr Ile Gly Ser Glu Ile Ser Gly Gly Val 
                325                 330                 335     


Arg Asn Val Phe Ala Glu Asn Asn Arg Leu Asp Ser Ala Asp Leu Trp 
            340                 345                 350         


Thr Ala Leu Arg Val Lys Asn Asn Ala Ser Arg Gly Gly Lys Leu Glu 
        355                 360                 365             


Asn Phe Tyr Phe Arg Asp Ile Thr Val Gly Gln Val Ser Arg Ala Val 
    370                 375                 380                 


Val Glu Ile Asp Phe Asn Tyr Glu Glu Gly Ala Lys Gly Lys His Thr 
385                 390                 395                 400 


Pro Val Val Arg Asn Tyr Val Val Glu Asn Leu Thr Cys Asn Lys Gly 
                405                 410                 415     


Asn Arg Ala Val Asp Leu Gln Gly Leu Asp Asn Ala Pro Ile Tyr Asp 
            420                 425                 430         


Ile Thr Met Lys Asn Cys Thr Phe Asn Val Val Glu Lys Pro Ser Val 
        435                 440                 445             


Val Lys Asn Val Lys Gly Val Lys Leu Glu Asn Val Lys Ile Asn Gly 
    450                 455                 460                 


Lys Val Val Glu Ser Leu Glu Asn Ala Ala Thr Thr Ala Lys Lys 
465                 470                 475                 

<210> 509
<211> 1377
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 509
atgacgacgc gacgcgaatt cattcgagat cttttggttg gcggcgtagt ggtcgctgtt     60

gcaccgcgtt tcctggcgtt ttcttcggtg gcgagtccgt gggaaacggt gatgccttcg    120

atcctcgaac gcatcaagcc accgcgtttt ccgaaacgca cgtgctatct caaccggttt    180

ggagcaaaag gcgacgggca aactgattgc acttcagctt ttcgacgcgc aatcgatcag    240

tgttcgaaag cgggcggtgg caaagtgatc gttccgcagg gaatgtatct caccggcgca    300

attcacttga agagcaacgt caatctcgag atctccgaag gcgcgacgat caagttcagt    360

caaaacccga aagactatct cccggtggtt ttttcgcgtt gggaaggcgt cgaagtattc    420

aactactcac ctttcatcta cgcatttgaa cagcagaaca tcgcgatcac gggcaagggc    480

acgctcgatg ggcagagtga taacgaacac tggtggccat ggaacggacg cgccaggtac    540

ggttggaaag aagggatgag ccaccagcgt ccggatcgaa acgcgctctt tgcgatggcg    600

gaaaaaggtg tttcggttcg cgaacgtgtt ttcggcgagg gtcattactt aaggccgcag    660

ttcattcagc cgtatcgctg ccagaacgta ttgatcgacg gagttacgat acgaaactcg    720

ccgatgtggg aaattcatcc ggtgctgtgc cggaatgtca tcgtgcaaaa cgtgcacatt    780

aacagtcatg gaccaaacaa cgatggctgc aatcccgagt cgtgcactga tgtgctgatt    840

aagaactgtt acttcgacac tggcgacgac tgtatcgcgg tcaaatcagg acgcaacgcg    900

gacggccggc ggcttaaagc gccgacagag aacgtgatcg tgcaagactg tcaaatgaaa    960

gatggacacg gcgggatcac tgtcggcagt gagatctcag gcggtgtgag aaatctgttt   1020

gcggagaact gccggcttga tagtccaaac ctggaccatg ctttgcgggt taagaacaac   1080

gcgatgcgtg gagggctgct cgagaatttg cacttccgaa acatcgaagt tggtcaggtg   1140

gcgcatgcag tgatcacgat cgattttaat tacgaggaag gcgcgaaagg atcgttcacg   1200

ccggtggttc gtgactacac tgtcgatggt ttgcgcagca cgcgaagcaa atacgcgctc   1260

gacgttcaag gtctgtcggg cgcgccgatc gtaaatctgc gtctgacgaa ttgcacgttc   1320

gacaatgttg ccgaagggaa cgtcgtgaag aatgttaagg acgcgacaat tcaaaaa      1377

<210> 510
<211> 459
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(31)

<220> 
<221> DOMAIN
<222> (81)...(458)
<223> Glycosyl hydrolases family 28

<220> 
<221> SITE
<222> (443)...(446)
<223> N-glycosylation site. Prosite id = PS00001

<400> 510
Met Thr Thr Arg Arg Glu Phe Ile Arg Asp Leu Leu Val Gly Gly Val 
1               5                   10                  15      


Val Val Ala Val Ala Pro Arg Phe Leu Ala Phe Ser Ser Val Ala Ser 
            20                  25                  30          


Pro Trp Glu Thr Val Met Pro Ser Ile Leu Glu Arg Ile Lys Pro Pro 
        35                  40                  45              


Arg Phe Pro Lys Arg Thr Cys Tyr Leu Asn Arg Phe Gly Ala Lys Gly 
    50                  55                  60                  


Asp Gly Gln Thr Asp Cys Thr Ser Ala Phe Arg Arg Ala Ile Asp Gln 
65                  70                  75                  80  


Cys Ser Lys Ala Gly Gly Gly Lys Val Ile Val Pro Gln Gly Met Tyr 
                85                  90                  95      


Leu Thr Gly Ala Ile His Leu Lys Ser Asn Val Asn Leu Glu Ile Ser 
            100                 105                 110         


Glu Gly Ala Thr Ile Lys Phe Ser Gln Asn Pro Lys Asp Tyr Leu Pro 
        115                 120                 125             


Val Val Phe Ser Arg Trp Glu Gly Val Glu Val Phe Asn Tyr Ser Pro 
    130                 135                 140                 


Phe Ile Tyr Ala Phe Glu Gln Gln Asn Ile Ala Ile Thr Gly Lys Gly 
145                 150                 155                 160 


Thr Leu Asp Gly Gln Ser Asp Asn Glu His Trp Trp Pro Trp Asn Gly 
                165                 170                 175     


Arg Ala Arg Tyr Gly Trp Lys Glu Gly Met Ser His Gln Arg Pro Asp 
            180                 185                 190         


Arg Asn Ala Leu Phe Ala Met Ala Glu Lys Gly Val Ser Val Arg Glu 
        195                 200                 205             


Arg Val Phe Gly Glu Gly His Tyr Leu Arg Pro Gln Phe Ile Gln Pro 
    210                 215                 220                 


Tyr Arg Cys Gln Asn Val Leu Ile Asp Gly Val Thr Ile Arg Asn Ser 
225                 230                 235                 240 


Pro Met Trp Glu Ile His Pro Val Leu Cys Arg Asn Val Ile Val Gln 
                245                 250                 255     


Asn Val His Ile Asn Ser His Gly Pro Asn Asn Asp Gly Cys Asn Pro 
            260                 265                 270         


Glu Ser Cys Thr Asp Val Leu Ile Lys Asn Cys Tyr Phe Asp Thr Gly 
        275                 280                 285             


Asp Asp Cys Ile Ala Val Lys Ser Gly Arg Asn Ala Asp Gly Arg Arg 
    290                 295                 300                 


Leu Lys Ala Pro Thr Glu Asn Val Ile Val Gln Asp Cys Gln Met Lys 
305                 310                 315                 320 


Asp Gly His Gly Gly Ile Thr Val Gly Ser Glu Ile Ser Gly Gly Val 
                325                 330                 335     


Arg Asn Leu Phe Ala Glu Asn Cys Arg Leu Asp Ser Pro Asn Leu Asp 
            340                 345                 350         


His Ala Leu Arg Val Lys Asn Asn Ala Met Arg Gly Gly Leu Leu Glu 
        355                 360                 365             


Asn Leu His Phe Arg Asn Ile Glu Val Gly Gln Val Ala His Ala Val 
    370                 375                 380                 


Ile Thr Ile Asp Phe Asn Tyr Glu Glu Gly Ala Lys Gly Ser Phe Thr 
385                 390                 395                 400 


Pro Val Val Arg Asp Tyr Thr Val Asp Gly Leu Arg Ser Thr Arg Ser 
                405                 410                 415     


Lys Tyr Ala Leu Asp Val Gln Gly Leu Ser Gly Ala Pro Ile Val Asn 
            420                 425                 430         


Leu Arg Leu Thr Asn Cys Thr Phe Asp Asn Val Ala Glu Gly Asn Val 
        435                 440                 445             


Val Lys Asn Val Lys Asp Ala Thr Ile Gln Lys 
    450                 455                 



<210> 511
<211> 1398
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 511
atgattaacc gtcgagattt cataaaagac ctcatcatca cctccgccgg agtcgcggtt     60

ctcccgcaac tggcgttcgg acaaaacgat ccctggaaaa ctcaataccc gcagatcctc    120

gcgcggatca aaccgccgaa atttccgaag cgcgatttcg tcatcacgaa gttcggcgcg    180

aaggcgggaa ccgatagcac gcaagcgatc gctaaagccc tcgacgcttg cgcgaaagcc    240

ggcggcggac gcgtcgtcgt acccgccggc gaatttctca ccggtgcgat ccatctcaag    300

tcgaacacca atctctacgt ctcaaaaggc gcgactctga agttttcgac cgaccccgaa    360

aaatatctgc cgatcgttca cacgcggtgg gaagggatgg agttgatgca tctctcgccg    420

ttcatctacg cgtacgagca gacgaacatc gcgatcaccg gcgagggcac gctcgacggc    480

caaggcaaat cgttcttttg gaagtggcac ggcaacccgc gatacggcgg caaccccgaa    540

gtgatcagtc agcaaaaagc gcgggcgcga ctttacgaga tgatggacaa gaacgtaccc    600

gtcgcggagc gcgtgttcgg tatcgggcac tatctccggc cgcagttcat ccagccgtac    660

aaatgtaaga acgtcttgat cgaaggcgtg acgatcatcg actcgccgat gtgggaagtt    720

catccggtgc tttgcgagaa tgtcaccgtc cgcaatcttc acatctcgtc gcacggtccg    780

aacaacgacg gctgcgatcc cgagtcgtgc aaagacgtcc tgatcgacaa ctgcttcttc    840

gacaccggtg acgactgcat cgcgatcaag tcgggtcgca ataacgacgg tcgtcgtctg    900

aacacaccga ccgagaacat catcgtccgc aactgcacga tgaaagacgg tcacggtggt    960

atcacggtcg gaagcgagat ctcgggcggc gtgcgaaact tgttcgcaca cgattgcaag   1020

atggacagtg cggatctgtg gaccgcgctc cgggtaaaga acaacgcatc gcggggcggc   1080

atgctggaga atttctattt ccgcaacatc accgtcgggc aagtcgcgcg tgctgtggtc   1140

gagatcgatt tcaactatga agaaggcgcg aagggatcgt acacaccggt catgcgcaac   1200

tacgtggtcg aggatctgac gtgcaccagc gggaaccggc ccgtcgatct gcaaggatta   1260

gacaacgcgc caatttacga tgtgtcgctg cgtaacacga ccttcggcgc gatgaagaac   1320

aagagcgtcg tgaagaatgt ccgaggactg aagatcgaaa acgttaccgt cagcggcacg   1380

cgcgtggaga gtttatga                                                 1398

<210> 512
<211> 465
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(27)

<220> 
<221> DOMAIN
<222> (77)...(459)
<223> Glycosyl hydrolases family 28

<220> 
<221> SITE
<222> (250)...(253)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (315)...(318)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (315)...(329)
<223> Polygalacturonase active site. Prosite id = PS00502

<220> 
<221> SITE
<222> (360)...(363)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (374)...(377)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (438)...(441)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (446)...(449)
<223> N-glycosylation site. Prosite id = PS00001

<400> 512
Met Ile Asn Arg Arg Asp Phe Ile Lys Asp Leu Ile Ile Thr Ser Ala 
1               5                   10                  15      


Gly Val Ala Val Leu Pro Gln Leu Ala Phe Gly Gln Asn Asp Pro Trp 
            20                  25                  30          


Lys Thr Gln Tyr Pro Gln Ile Leu Ala Arg Ile Lys Pro Pro Lys Phe 
        35                  40                  45              


Pro Lys Arg Asp Phe Val Ile Thr Lys Phe Gly Ala Lys Ala Gly Thr 
    50                  55                  60                  


Asp Ser Thr Gln Ala Ile Ala Lys Ala Leu Asp Ala Cys Ala Lys Ala 
65                  70                  75                  80  


Gly Gly Gly Arg Val Val Val Pro Ala Gly Glu Phe Leu Thr Gly Ala 
                85                  90                  95      


Ile His Leu Lys Ser Asn Thr Asn Leu Tyr Val Ser Lys Gly Ala Thr 
            100                 105                 110         


Leu Lys Phe Ser Thr Asp Pro Glu Lys Tyr Leu Pro Ile Val His Thr 
        115                 120                 125             


Arg Trp Glu Gly Met Glu Leu Met His Leu Ser Pro Phe Ile Tyr Ala 
    130                 135                 140                 


Tyr Glu Gln Thr Asn Ile Ala Ile Thr Gly Glu Gly Thr Leu Asp Gly 
145                 150                 155                 160 


Gln Gly Lys Ser Phe Phe Trp Lys Trp His Gly Asn Pro Arg Tyr Gly 
                165                 170                 175     


Gly Asn Pro Glu Val Ile Ser Gln Gln Lys Ala Arg Ala Arg Leu Tyr 
            180                 185                 190         


Glu Met Met Asp Lys Asn Val Pro Val Ala Glu Arg Val Phe Gly Ile 
        195                 200                 205             


Gly His Tyr Leu Arg Pro Gln Phe Ile Gln Pro Tyr Lys Cys Lys Asn 
    210                 215                 220                 


Val Leu Ile Glu Gly Val Thr Ile Ile Asp Ser Pro Met Trp Glu Val 
225                 230                 235                 240 


His Pro Val Leu Cys Glu Asn Val Thr Val Arg Asn Leu His Ile Ser 
                245                 250                 255     


Ser His Gly Pro Asn Asn Asp Gly Cys Asp Pro Glu Ser Cys Lys Asp 
            260                 265                 270         


Val Leu Ile Asp Asn Cys Phe Phe Asp Thr Gly Asp Asp Cys Ile Ala 
        275                 280                 285             


Ile Lys Ser Gly Arg Asn Asn Asp Gly Arg Arg Leu Asn Thr Pro Thr 
    290                 295                 300                 


Glu Asn Ile Ile Val Arg Asn Cys Thr Met Lys Asp Gly His Gly Gly 
305                 310                 315                 320 


Ile Thr Val Gly Ser Glu Ile Ser Gly Gly Val Arg Asn Leu Phe Ala 
                325                 330                 335     


His Asp Cys Lys Met Asp Ser Ala Asp Leu Trp Thr Ala Leu Arg Val 
            340                 345                 350         


Lys Asn Asn Ala Ser Arg Gly Gly Met Leu Glu Asn Phe Tyr Phe Arg 
        355                 360                 365             


Asn Ile Thr Val Gly Gln Val Ala Arg Ala Val Val Glu Ile Asp Phe 
    370                 375                 380                 


Asn Tyr Glu Glu Gly Ala Lys Gly Ser Tyr Thr Pro Val Met Arg Asn 
385                 390                 395                 400 


Tyr Val Val Glu Asp Leu Thr Cys Thr Ser Gly Asn Arg Pro Val Asp 
                405                 410                 415     


Leu Gln Gly Leu Asp Asn Ala Pro Ile Tyr Asp Val Ser Leu Arg Asn 
            420                 425                 430         


Thr Thr Phe Gly Ala Met Lys Asn Lys Ser Val Val Lys Asn Val Arg 
        435                 440                 445             


Gly Leu Lys Ile Glu Asn Val Thr Val Ser Gly Thr Arg Val Glu Ser 
    450                 455                 460                 


Leu 
465 

<210> 513
<211> 1416
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 513
atgtcgtcac gacgcgagtt cattagagat ctgttgactg gcggcgcact gatcgccgtc     60

gcgccgcgtc tgtctgcgtt tgcagcggag gagaatccgt gggaaacggt gatgccttcg    120

atcgtgaaac gcatcaagcg acctcgtttc ccgatgcgca cgtttgatct cacggagttt    180

ggagcgaaag gtgatggacg aacagattgc acgttggctt tccgtcgcgc gatcgatcga    240

tgcacgaacg ccggtggtgg gagagtagtt gttccaccgg gttcgtatct cactggcgcc    300

attcatttga agagcaacgt cgaccttcat atctcagaag gtactacggt caagttcagc    360

cagaacccga aagactacct gcccgttgtt ttctcgcgtt gggaaggcgt cgaggtgttc    420

aactactcgc cttttatcta cgccttcgaa caaacgaaca ttgcgatcac tggcaagggc    480

acgctcaacg gtcaaagcga caacgaacac tggtggccct ggaacggacg tgccgcgtac    540

ggctggaaag aagggatgag caatcagcgt cccgatcgaa atgcgctgtt tgcgatggcc    600

gaaaaaggtg tcccggttca ggagcgcatt tttggtgagg gccattactt aaggccgcag    660

ttcattcaac cttatcgttg tgagaacgtg ctgatcgaag gtgtcactat tcgaaactcg    720

ccgatgtggg aaattcatcc ggtgctctgc cggaatgtca tcgtccaaaa tgtgatcatc    780

aacagtcatg gtccaaacaa cgacgggtgt aatcctgagt cgtgcacgga tgtgttgatt    840

aaggattgtg acttcgacac tggtgacgat tgtatcgcga tcaagtcagg ccgaaatgca    900

gatgggcggc gactgaaggc tcctactgaa aacattatcg tgactggttg tcgcatgaaa    960

gatggtcacg gcgggattac ggtgggcagc gagatttcgg gtggggtgcg aaatcttttc   1020

gcatccaact gccggctcga cagtccgaac ctggaccatg cattgcgggt taagaataac   1080

gctatgcgtg gcgggctgtt ggagaatctg cacttccgaa atatcgacgt cgggcaagtg   1140

gcgcacgcgg tgatcacgat cgatttcaat tatgaggaag gcgcgaaggg atcgttcacg   1200

ccagtcgttc gtgattacac cgtcgatggc cttcgcagca cgaaaagtaa gtacgcgctc   1260

gatgtgcagg gcttggcgac ggcgccgatc gtgaatctgc gtctaaccaa ctgcatcttc   1320

gacaatgtcg ctgaaggaaa tgttgtgaag aacgtaaagg atgcaactat cgagaatgtc   1380

aaaatcaatg gaaaaagcgt tgatgcagtg ccgtag                             1416

<210> 514
<211> 471
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 514
Met Ser Ser Arg Arg Glu Phe Ile Arg Asp Leu Leu Thr Gly Gly Ala 
1               5                   10                  15      


Leu Ile Ala Val Ala Pro Arg Leu Ser Ala Phe Ala Ala Glu Glu Asn 
            20                  25                  30          


Pro Trp Glu Thr Val Met Pro Ser Ile Val Lys Arg Ile Lys Arg Pro 
        35                  40                  45              


Arg Phe Pro Met Arg Thr Phe Asp Leu Thr Glu Phe Gly Ala Lys Gly 
    50                  55                  60                  


Asp Gly Arg Thr Asp Cys Thr Leu Ala Phe Arg Arg Ala Ile Asp Arg 
65                  70                  75                  80  


Cys Thr Asn Ala Gly Gly Gly Arg Val Val Val Pro Pro Gly Ser Tyr 
                85                  90                  95      


Leu Thr Gly Ala Ile His Leu Lys Ser Asn Val Asp Leu His Ile Ser 
            100                 105                 110         


Glu Gly Thr Thr Val Lys Phe Ser Gln Asn Pro Lys Asp Tyr Leu Pro 
        115                 120                 125             


Val Val Phe Ser Arg Trp Glu Gly Val Glu Val Phe Asn Tyr Ser Pro 
    130                 135                 140                 


Phe Ile Tyr Ala Phe Glu Gln Thr Asn Ile Ala Ile Thr Gly Lys Gly 
145                 150                 155                 160 


Thr Leu Asn Gly Gln Ser Asp Asn Glu His Trp Trp Pro Trp Asn Gly 
                165                 170                 175     


Arg Ala Ala Tyr Gly Trp Lys Glu Gly Met Ser Asn Gln Arg Pro Asp 
            180                 185                 190         


Arg Asn Ala Leu Phe Ala Met Ala Glu Lys Gly Val Pro Val Gln Glu 
        195                 200                 205             


Arg Ile Phe Gly Glu Gly His Tyr Leu Arg Pro Gln Phe Ile Gln Pro 
    210                 215                 220                 


Tyr Arg Cys Glu Asn Val Leu Ile Glu Gly Val Thr Ile Arg Asn Ser 
225                 230                 235                 240 


Pro Met Trp Glu Ile His Pro Val Leu Cys Arg Asn Val Ile Val Gln 
                245                 250                 255     


Asn Val Ile Ile Asn Ser His Gly Pro Asn Asn Asp Gly Cys Asn Pro 
            260                 265                 270         


Glu Ser Cys Thr Asp Val Leu Ile Lys Asp Cys Asp Phe Asp Thr Gly 
        275                 280                 285             


Asp Asp Cys Ile Ala Ile Lys Ser Gly Arg Asn Ala Asp Gly Arg Arg 
    290                 295                 300                 


Leu Lys Ala Pro Thr Glu Asn Ile Ile Val Thr Gly Cys Arg Met Lys 
305                 310                 315                 320 


Asp Gly His Gly Gly Ile Thr Val Gly Ser Glu Ile Ser Gly Gly Val 
                325                 330                 335     


Arg Asn Leu Phe Ala Ser Asn Cys Arg Leu Asp Ser Pro Asn Leu Asp 
            340                 345                 350         


His Ala Leu Arg Val Lys Asn Asn Ala Met Arg Gly Gly Leu Leu Glu 
        355                 360                 365             


Asn Leu His Phe Arg Asn Ile Asp Val Gly Gln Val Ala His Ala Val 
    370                 375                 380                 


Ile Thr Ile Asp Phe Asn Tyr Glu Glu Gly Ala Lys Gly Ser Phe Thr 
385                 390                 395                 400 


Pro Val Val Arg Asp Tyr Thr Val Asp Gly Leu Arg Ser Thr Lys Ser 
                405                 410                 415     


Lys Tyr Ala Leu Asp Val Gln Gly Leu Ala Thr Ala Pro Ile Val Asn 
            420                 425                 430         


Leu Arg Leu Thr Asn Cys Ile Phe Asp Asn Val Ala Glu Gly Asn Val 
        435                 440                 445             


Val Lys Asn Val Lys Asp Ala Thr Ile Glu Asn Val Lys Ile Asn Gly 
    450                 455                 460                 


Lys Ser Val Asp Ala Val Pro 
465                 470     

<210> 515
<211> 2160
<212> DNA
<213> Cochliobolus heterostrophus ATCC 48331

<400> 515
atccctttcg atcagcgtgc ttcgaaacgt gcagtgggca gttgggatga tgcatataca     60

aaggcaacag cagcactcgc aaagctatcg caagatgaaa agattggaat cgtcacggga    120

actggctggt cgaagagtaa ttgtgtaggc aacaccaagc ctgctagctc gattggatac    180

cccgagctat gtctgcaaga tggtcccctt ggtgtccgct atgtccgggg tataaccgcc    240

tttgcagctg gcatccatgc tgccagcact tgggacatcg atctcatccg tgaaagaggt    300

gcttttctcg gcaacgaagc aaaacaattg ggtatccacg tacagctagg tccgtctgct    360

ggtccccttg gcaagtttgc caagggcggc cgtaactggg agggatttgg atcagatcct    420

tatctccaag gcatcatgat ggcacagacc attgagggca tgcaagaagc tggtgttcag    480

gccactgcta agcattggat tgtgaatgaa caagagctca accgagacac tatgagctct    540

gatgtgagcg atagggtcct gcgcgagttg tatgtttggc cattcgcgga cgccgctcac    600

agcaaggttg cggcattcat gtgcagctac aacaagctca actcaacatg ggcgtgtgaa    660

agcgaaggcg ttatgcaaaa actcttgaaa gacgagcttg gccaccgtgg atacatcatg    720

tcggattgga atgcgcaaca cactacgact ggcagtgcaa atggtggcat ggacatgacc    780

atgcccggca gtgattttag tggcggaaac gtgctctggg ggccgcaact caagacagcc    840

attagcaacg gtcaagttca gcaatcacgc ctcgatgata tggtcaagcg agtccttgct    900

gcgtggtacc ttatgggcca ggacaagggt tatcctgcga cttccttcaa ctcctggaac    960

attggaacaa aacaaatcag tggaaaccac gggactaatg tccgagcgac tgctcgagac   1020

ggcactatcc tgctgaagaa tgctaatggt gcgctacccc tgaagaagcc aaagagcatt   1080

gctgtcattg gaactgacag tattgtcgca cctcgcgggg ccaatgcttg tgtcgaccgt   1140

ggatgcaccg aaggtgttct taccatgggc tggggttcgg gatcagttga gcttccatcc   1200

aatatggtcg cgccactgga tgccatcaag acgcaggccc aaaaagacgg tacgacggtt   1260

acttcgtcgc ccaccgacaa tgcccagcaa ggcgccagcg cagctcaaaa tgctgagatt   1320

gccgtggtat gcatcaacag caacgctggt gaaggctaca tcaatgtgga aggcaacgct   1380

ggcgaccgaa ataacctaga cccctggcac aatggcaatg agcttgtcaa ggcggttgct   1440

gccgtgaaca agaagacggt ggttgtcgtt cacagcgttg gaccaatcat catggagcag   1500

tggattgaga atcccaacgt cgttgcagta gtgtgggcag gtctcccagg ccaagagccc   1560

ggcaacggcg ttgtcgacat catgtatggc gcagcgtcgc caagcggcaa gcttccttac   1620

accattgcga agaaagagtc tgactacggc accacaattg ccagtggcga tgacaagtcc   1680

tgggatctgt acatcgacta tcgctatttc gacaagcaga acatcacacc caggtttgaa   1740

tttggctttg gactctcata caccaacttc acctactcag agctcaccgt aactggcaag   1800

ccttctgctg gtcctgcgac tggtgctgta ggacctggag gccccgttga tctcttcgag   1860

actgtcgcta cagtcaccgc caagattgcc aactctggtg gtgttgctgg tgccgaggta   1920

ccgcagctgt atctgggcta ccctgcgtct accaactctc caccaaaaca actcaggggc   1980

ttcagcaagc tcaagttgga ggctggtgct agcggaaccg caacattcaa gctgaggagg   2040

agggacatga gtttctggga cgagaagacc aggaagtgga ctgttgcgac gggcgagtac   2100

actgtctttg tcggagcaag ctccagggac gtgaggctga cgggcaagat tgttgtgtaa   2160

<210> 516
<211> 719
<212> PRT
<213> Cochliobolus heterostrophus ATCC 48331

<220> 
<221> DOMAIN
<222> (52)...(262)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (342)...(589)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (217)...(220)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (598)...(601)
<223> N-glycosylation site. Prosite id = PS00001

<400> 516
Ile Pro Phe Asp Gln Arg Ala Ser Lys Arg Ala Val Gly Ser Trp Asp 
1               5                   10                  15      


Asp Ala Tyr Thr Lys Ala Thr Ala Ala Leu Ala Lys Leu Ser Gln Asp 
            20                  25                  30          


Glu Lys Ile Gly Ile Val Thr Gly Thr Gly Trp Ser Lys Ser Asn Cys 
        35                  40                  45              


Val Gly Asn Thr Lys Pro Ala Ser Ser Ile Gly Tyr Pro Glu Leu Cys 
    50                  55                  60                  


Leu Gln Asp Gly Pro Leu Gly Val Arg Tyr Val Arg Gly Ile Thr Ala 
65                  70                  75                  80  


Phe Ala Ala Gly Ile His Ala Ala Ser Thr Trp Asp Ile Asp Leu Ile 
                85                  90                  95      


Arg Glu Arg Gly Ala Phe Leu Gly Asn Glu Ala Lys Gln Leu Gly Ile 
            100                 105                 110         


His Val Gln Leu Gly Pro Ser Ala Gly Pro Leu Gly Lys Phe Ala Lys 
        115                 120                 125             


Gly Gly Arg Asn Trp Glu Gly Phe Gly Ser Asp Pro Tyr Leu Gln Gly 
    130                 135                 140                 


Ile Met Met Ala Gln Thr Ile Glu Gly Met Gln Glu Ala Gly Val Gln 
145                 150                 155                 160 


Ala Thr Ala Lys His Trp Ile Val Asn Glu Gln Glu Leu Asn Arg Asp 
                165                 170                 175     


Thr Met Ser Ser Asp Val Ser Asp Arg Val Leu Arg Glu Leu Tyr Val 
            180                 185                 190         


Trp Pro Phe Ala Asp Ala Ala His Ser Lys Val Ala Ala Phe Met Cys 
        195                 200                 205             


Ser Tyr Asn Lys Leu Asn Ser Thr Trp Ala Cys Glu Ser Glu Gly Val 
    210                 215                 220                 


Met Gln Lys Leu Leu Lys Asp Glu Leu Gly His Arg Gly Tyr Ile Met 
225                 230                 235                 240 


Ser Asp Trp Asn Ala Gln His Thr Thr Thr Gly Ser Ala Asn Gly Gly 
                245                 250                 255     


Met Asp Met Thr Met Pro Gly Ser Asp Phe Ser Gly Gly Asn Val Leu 
            260                 265                 270         


Trp Gly Pro Gln Leu Lys Thr Ala Ile Ser Asn Gly Gln Val Gln Gln 
        275                 280                 285             


Ser Arg Leu Asp Asp Met Val Lys Arg Val Leu Ala Ala Trp Tyr Leu 
    290                 295                 300                 


Met Gly Gln Asp Lys Gly Tyr Pro Ala Thr Ser Phe Asn Ser Trp Asn 
305                 310                 315                 320 


Ile Gly Thr Lys Gln Ile Ser Gly Asn His Gly Thr Asn Val Arg Ala 
                325                 330                 335     


Thr Ala Arg Asp Gly Thr Ile Leu Leu Lys Asn Ala Asn Gly Ala Leu 
            340                 345                 350         


Pro Leu Lys Lys Pro Lys Ser Ile Ala Val Ile Gly Thr Asp Ser Ile 
        355                 360                 365             


Val Ala Pro Arg Gly Ala Asn Ala Cys Val Asp Arg Gly Cys Thr Glu 
    370                 375                 380                 


Gly Val Leu Thr Met Gly Trp Gly Ser Gly Ser Val Glu Leu Pro Ser 
385                 390                 395                 400 


Asn Met Val Ala Pro Leu Asp Ala Ile Lys Thr Gln Ala Gln Lys Asp 
                405                 410                 415     


Gly Thr Thr Val Thr Ser Ser Pro Thr Asp Asn Ala Gln Gln Gly Ala 
            420                 425                 430         


Ser Ala Ala Gln Asn Ala Glu Ile Ala Val Val Cys Ile Asn Ser Asn 
        435                 440                 445             


Ala Gly Glu Gly Tyr Ile Asn Val Glu Gly Asn Ala Gly Asp Arg Asn 
    450                 455                 460                 


Asn Leu Asp Pro Trp His Asn Gly Asn Glu Leu Val Lys Ala Val Ala 
465                 470                 475                 480 


Ala Val Asn Lys Lys Thr Val Val Val Val His Ser Val Gly Pro Ile 
                485                 490                 495     


Ile Met Glu Gln Trp Ile Glu Asn Pro Asn Val Val Ala Val Val Trp 
            500                 505                 510         


Ala Gly Leu Pro Gly Gln Glu Pro Gly Asn Gly Val Val Asp Ile Met 
        515                 520                 525             


Tyr Gly Ala Ala Ser Pro Ser Gly Lys Leu Pro Tyr Thr Ile Ala Lys 
    530                 535                 540                 


Lys Glu Ser Asp Tyr Gly Thr Thr Ile Ala Ser Gly Asp Asp Lys Ser 
545                 550                 555                 560 


Trp Asp Leu Tyr Ile Asp Tyr Arg Tyr Phe Asp Lys Gln Asn Ile Thr 
                565                 570                 575     


Pro Arg Phe Glu Phe Gly Phe Gly Leu Ser Tyr Thr Asn Phe Thr Tyr 
            580                 585                 590         


Ser Glu Leu Thr Val Thr Gly Lys Pro Ser Ala Gly Pro Ala Thr Gly 
        595                 600                 605             


Ala Val Gly Pro Gly Gly Pro Val Asp Leu Phe Glu Thr Val Ala Thr 
    610                 615                 620                 


Val Thr Ala Lys Ile Ala Asn Ser Gly Gly Val Ala Gly Ala Glu Val 
625                 630                 635                 640 


Pro Gln Leu Tyr Leu Gly Tyr Pro Ala Ser Thr Asn Ser Pro Pro Lys 
                645                 650                 655     


Gln Leu Arg Gly Phe Ser Lys Leu Lys Leu Glu Ala Gly Ala Ser Gly 
            660                 665                 670         


Thr Ala Thr Phe Lys Leu Arg Arg Arg Asp Met Ser Phe Trp Asp Glu 
        675                 680                 685             


Lys Thr Arg Lys Trp Thr Val Ala Thr Gly Glu Tyr Thr Val Phe Val 
    690                 695                 700                 


Gly Ala Ser Ser Arg Asp Val Arg Leu Thr Gly Lys Ile Val Val 
705                 710                 715                 



<210> 517
<211> 1767
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 517
atgaccacaa cccgccgcac tatcctgaaa gccgccgcca gcgccggcgc gatcgccagc     60

accggctggc ccgccttggc cgccgcacag gccgcgcaag ccgccgaccc gtgggcccgc    120

gcccagcaga tcatcgaccg cttcgccaag ccgctcagct tcccgaacag ggacttcccg    180

atcaccgagt tcggcgccaa accctgcaag ctggtcaaag cccagggcct ggtcgaagta    240

agagtcaaag gcgaactcga aacgccagca ccgcaagcgc cggacgccta cccggcaatc    300

aaagccgcca tcgccgcagc gagcaaggcc ggaggagggc gcgtgctgat cccggccggc    360

aactggtact gcaagggccc tatcgtgctg ctgtcgaacg tgcacgtgca ccttgccaag    420

ggcgcgcaag tctacttcag cgccaacgcc aaggacttcg cccgcgacgg cgactacgac    480

tgcggcgcca acggcaagct ggtgctctcg cgctggcaag gcaacgattg cctgaacttc    540

tcgcccatgg tctacgcgcg cgggcaaaag aatatcgcca ttaccggcga agactggacc    600

agcatcctga acggccaggc cggcgtggcg ttcgaagacg gcagcggcaa tggctggtgg    660

ggcatgaacc ccgccggcgc gccgcccggc agcaccacgc accagggcgc agccaatccg    720

aacaacgccg aggagccaat cgccagactg cccacgcgcc acgcgaactg gagcgccgac    780

gacaagtacc tgccgctgct gtccgaagcc ggcgtgcccg ccgagcgccg cgtgttcggt    840

ctggggcact acctgcggcc gtcgatggtc gaattcgtcg actgcgggga tgtgctgatg    900

cagggctacc aggtcatcaa cacgccgttc tggattcatc acccggtcaa ctcacgcaac    960

attcacttct ccaaagtgcg catggaaagc atcggcccga attcggacgg tttcgatccc   1020

gagtcctgcg acaccatcct ggtggacggc tgcctgttca ataccggcga cgactgcatc   1080

gccatcaaat ccggcaagaa ccgagactcg caatacggcc caacgcgcaa tatggtggtc   1140

cagaactgca tcatgaaccg cggccacggc ggcgttacgc tgggcagcga aatggcgggt   1200

ggcatcgagc atatctacgc gcagaaaatc gaattccgca acgcgttctg ggaccacgac   1260

ccgctgggca cggccatccg aatgaagacg aacatgaacc gcggcggcta ccttcgtcat   1320

ttctacgtgc gcgacgtgac gctgccgaat ggcgtgcgta ccaagagcgg cttctacaag   1380

acgctgccgg gatctccgct ggcaggcaag gtctccacca gcggcggcgc tgttatcact   1440

atcgactgcg attacgcgcc gaatgacgac agcgtgcgcg tgcggccgcc gcaggtgtcg   1500

gacgtgcata tctcgaacgt ccgcgtcagc aatgtgaaaa cggccgaagg ctcgttctcc   1560

tgctaccagg ccatggtgct gctcgggccc gtggcggcca gcttcaacgg cgcgcctggc   1620

acggccatcc tgccgatcac gaatgtcacc gtcagcgatt cggacttcgg cacgccgcgc   1680

aacagcgcag agccctggtt cgcgttcaac gtgcagggac tcaagctgcg caacgtgcgc   1740

atcgatggca aggagtacaa cgtatga                                       1767

<210> 518
<211> 588
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(34)

<220> 
<221> DOMAIN
<222> (110)...(555)
<223> Glycosyl hydrolases family 28

<220> 
<221> SITE
<222> (259)...(262)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (556)...(559)
<223> N-glycosylation site. Prosite id = PS00001

<400> 518
Met Thr Thr Thr Arg Arg Thr Ile Leu Lys Ala Ala Ala Ser Ala Gly 
1               5                   10                  15      


Ala Ile Ala Ser Thr Gly Trp Pro Ala Leu Ala Ala Ala Gln Ala Ala 
            20                  25                  30          


Gln Ala Ala Asp Pro Trp Ala Arg Ala Gln Gln Ile Ile Asp Arg Phe 
        35                  40                  45              


Ala Lys Pro Leu Ser Phe Pro Asn Arg Asp Phe Pro Ile Thr Glu Phe 
    50                  55                  60                  


Gly Ala Lys Pro Cys Lys Leu Val Lys Ala Gln Gly Leu Val Glu Val 
65                  70                  75                  80  


Arg Val Lys Gly Glu Leu Glu Thr Pro Ala Pro Gln Ala Pro Asp Ala 
                85                  90                  95      


Tyr Pro Ala Ile Lys Ala Ala Ile Ala Ala Ala Ser Lys Ala Gly Gly 
            100                 105                 110         


Gly Arg Val Leu Ile Pro Ala Gly Asn Trp Tyr Cys Lys Gly Pro Ile 
        115                 120                 125             


Val Leu Leu Ser Asn Val His Val His Leu Ala Lys Gly Ala Gln Val 
    130                 135                 140                 


Tyr Phe Ser Ala Asn Ala Lys Asp Phe Ala Arg Asp Gly Asp Tyr Asp 
145                 150                 155                 160 


Cys Gly Ala Asn Gly Lys Leu Val Leu Ser Arg Trp Gln Gly Asn Asp 
                165                 170                 175     


Cys Leu Asn Phe Ser Pro Met Val Tyr Ala Arg Gly Gln Lys Asn Ile 
            180                 185                 190         


Ala Ile Thr Gly Glu Asp Trp Thr Ser Ile Leu Asn Gly Gln Ala Gly 
        195                 200                 205             


Val Ala Phe Glu Asp Gly Ser Gly Asn Gly Trp Trp Gly Met Asn Pro 
    210                 215                 220                 


Ala Gly Ala Pro Pro Gly Ser Thr Thr His Gln Gly Ala Ala Asn Pro 
225                 230                 235                 240 


Asn Asn Ala Glu Glu Pro Ile Ala Arg Leu Pro Thr Arg His Ala Asn 
                245                 250                 255     


Trp Ser Ala Asp Asp Lys Tyr Leu Pro Leu Leu Ser Glu Ala Gly Val 
            260                 265                 270         


Pro Ala Glu Arg Arg Val Phe Gly Leu Gly His Tyr Leu Arg Pro Ser 
        275                 280                 285             


Met Val Glu Phe Val Asp Cys Gly Asp Val Leu Met Gln Gly Tyr Gln 
    290                 295                 300                 


Val Ile Asn Thr Pro Phe Trp Ile His His Pro Val Asn Ser Arg Asn 
305                 310                 315                 320 


Ile His Phe Ser Lys Val Arg Met Glu Ser Ile Gly Pro Asn Ser Asp 
                325                 330                 335     


Gly Phe Asp Pro Glu Ser Cys Asp Thr Ile Leu Val Asp Gly Cys Leu 
            340                 345                 350         


Phe Asn Thr Gly Asp Asp Cys Ile Ala Ile Lys Ser Gly Lys Asn Arg 
        355                 360                 365             


Asp Ser Gln Tyr Gly Pro Thr Arg Asn Met Val Val Gln Asn Cys Ile 
    370                 375                 380                 


Met Asn Arg Gly His Gly Gly Val Thr Leu Gly Ser Glu Met Ala Gly 
385                 390                 395                 400 


Gly Ile Glu His Ile Tyr Ala Gln Lys Ile Glu Phe Arg Asn Ala Phe 
                405                 410                 415     


Trp Asp His Asp Pro Leu Gly Thr Ala Ile Arg Met Lys Thr Asn Met 
            420                 425                 430         


Asn Arg Gly Gly Tyr Leu Arg His Phe Tyr Val Arg Asp Val Thr Leu 
        435                 440                 445             


Pro Asn Gly Val Arg Thr Lys Ser Gly Phe Tyr Lys Thr Leu Pro Gly 
    450                 455                 460                 


Ser Pro Leu Ala Gly Lys Val Ser Thr Ser Gly Gly Ala Val Ile Thr 
465                 470                 475                 480 


Ile Asp Cys Asp Tyr Ala Pro Asn Asp Asp Ser Val Arg Val Arg Pro 
                485                 490                 495     


Pro Gln Val Ser Asp Val His Ile Ser Asn Val Arg Val Ser Asn Val 
            500                 505                 510         


Lys Thr Ala Glu Gly Ser Phe Ser Cys Tyr Gln Ala Met Val Leu Leu 
        515                 520                 525             


Gly Pro Val Ala Ala Ser Phe Asn Gly Ala Pro Gly Thr Ala Ile Leu 
    530                 535                 540                 


Pro Ile Thr Asn Val Thr Val Ser Asp Ser Asp Phe Gly Thr Pro Arg 
545                 550                 555                 560 


Asn Ser Ala Glu Pro Trp Phe Ala Phe Asn Val Gln Gly Leu Lys Leu 
                565                 570                 575     


Arg Asn Val Arg Ile Asp Gly Lys Glu Tyr Asn Val 
            580                 585             




<210> 519
<211> 2232
<212> DNA
<213> Cochliobolus heterostrophus ATCC 48331

<400> 519
gcaatcggtc ctgattgtac caatggtccc ctgagtacca atgcaatttg cgatgtcaat     60

gcgcctcctc atgagagggc agcggctcta gtcgcagcta tggaaccgca agaaaagcta    120

gataacctcg tcagtaaatc caaaggtgtg tcgagattag gtcttccagc gtataactgg    180

tggggcgaag ctctacacgg tgtagctgga gcgccaggaa tcaaattcgt cgaaccttat    240

aaaaacgcta cttcgtttcc tatgccaatc cttatgtcgg cagcttttga tgatgatctc    300

attttcaaaa ttgccaatat tatcgggaac gaggcccgag ccttcggaaa tggtggagtc    360

gctcctatgg actattggac ccctgacatc aatcccgtcc gcgatatacg atggggccga    420

gccagtgaat cacccggaga ggacattcga cgaataaaag ggtacaccaa ggctctgctt    480

gctggcctcg aaggtgacca agcccaaagg aagatcattg caacatgcaa acactatgtg    540

ggttacgaca tggaagcttg gggaggatac gatcgacaca acttcagtgc aaagatcacc    600

atgcaagacc tcgcagagta ctacatgccg ccattccagc aatgtgcgcg tgactcgaag    660

gtcgggtcat tcatgtgcag ctacaatgca gtcaacggtg ttccaacatg cgctgacacc    720

tacgttcttc aaacaatcct gagggaccac tggaactgga cagatagcaa taactacatt    780

actagcgatt gcgaagccgt tgcggatatc tctgagaacc acaaatatgt cgaaaccctt    840

gcgcaaggca ccgcacttgc ttttgccaag ggtatggatc ttagctgtga atacagtgga    900

tcgtcagata tcccaggagc ttggtcacaa ggtcttctga atctttctgt tatcgacaaa    960

gcattgactc gacaatatga aggcttagtc catgccggct actttgatgg cgcgaaggcg   1020

acttacgcaa acttgagtta taatgacatc aacacacccg aagcacgaca gctatccttg   1080

caagttacct ctgaaggttt ggtcatgcta aagaacgatc acacacttcc attgcctctc   1140

acgaagggat caaaggtggc tatgataggt ttctgggcca acgactcttc caaactccag   1200

ggcatctaca gcggtccacc tccttaccgg cactctccag tattcgctgg tgaacaaatg   1260

ggattagata tggccatagc ctggggccca atgattcaga actcaagtgt gcccgacaac   1320

tggactacca acgcgctcga cgcggccgag aagtccgact atattctcta ctttggtggt   1380

caagactgga cagtggcgca agaaggctac gatcgcacta caatcagttt tcctcaagtg   1440

caaatcgacc ttcttgccaa actggctaaa cttggcaagc cgcttgttgt catcacgctt   1500

ggtgatatga ctgatcactc ccctctcttg tccatggaag gcatcaactc aattatctgg   1560

gcgaattggc ctggccaaga tggcggtcca gcgatactaa acgtgatttc cggtgtgcat   1620

gctcctgcag gtcgtttgcc aataacggaa tacccggcag attatgtcaa gctctctatg   1680

cttgacatga acttgcgacc acatgccgag agccctggcc gtacttatcg ctggttcaat   1740

gagtctgttc agccatttgg cttcggtcta cattacacta cttttgaggc tggttttgct   1800

agcgaagaag gtctaaccta cgatatccag gaaaccttgg atagctgtac acagcagtac   1860

aaggatttgt gtgaggttgc accactggag gtcaccgtgg caaacaaggg taaccgaaca   1920

tcggatttcg tcgctctcgc tttcatcaag ggcgaggttg gacctaagcc atacccacta   1980

aagactctga ttacgtacgg gaggctcaga gatatccatg ggggcgcgaa gaagtcggcg   2040

tcacttccgc ttacacttgg agaattggcc agagtggatc aatcaggcaa caccgttatc   2100

tatcccggcg aatacaccct gctccttgac gagcctactc aggctgagct gaaattgact   2160

attacgggcg aggagacaat tctggacaaa tggccccagc cgccaaacgg aggcaatcgg   2220

accgtgcttt ga                                                       2232

<210> 520
<211> 743
<212> PRT
<213> Cochliobolus heterostrophus ATCC 48331

<220> 
<221> DOMAIN
<222> (47)...(297)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (367)...(594)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (83)...(86)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (255)...(258)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (318)...(321)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (349)...(352)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (400)...(403)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (440)...(443)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (446)...(449)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (588)...(591)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (647)...(650)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (750)...(753)
<223> N-glycosylation site. Prosite id = PS00001

<400> 520
Ala Ile Gly Pro Asp Cys Thr Asn Gly Pro Leu Ser Thr Asn Ala Ile 
1               5                   10                  15      


Cys Asp Val Asn Ala Pro Pro His Glu Arg Ala Ala Ala Leu Val Ala 
            20                  25                  30          


Ala Met Glu Pro Gln Glu Lys Leu Asp Asn Leu Val Ser Lys Ser Lys 
        35                  40                  45              


Gly Val Ser Arg Leu Gly Leu Pro Ala Tyr Asn Trp Trp Gly Glu Ala 
    50                  55                  60                  


Leu His Gly Val Ala Gly Ala Pro Gly Ile Lys Phe Val Glu Pro Tyr 
65                  70                  75                  80  


Lys Asn Ala Thr Ser Phe Pro Met Pro Ile Leu Met Ser Ala Ala Phe 
                85                  90                  95      


Asp Asp Asp Leu Ile Phe Lys Ile Ala Asn Ile Ile Gly Asn Glu Ala 
            100                 105                 110         


Arg Ala Phe Gly Asn Gly Gly Val Ala Pro Met Asp Tyr Trp Thr Pro 
        115                 120                 125             


Asp Ile Asn Pro Val Arg Asp Ile Arg Trp Gly Arg Ala Ser Glu Ser 
    130                 135                 140                 


Pro Gly Glu Asp Ile Arg Arg Ile Lys Gly Tyr Thr Lys Ala Leu Leu 
145                 150                 155                 160 


Ala Gly Leu Glu Gly Asp Gln Ala Gln Arg Lys Ile Ile Ala Thr Cys 
                165                 170                 175     


Lys His Tyr Val Gly Tyr Asp Met Glu Ala Trp Gly Gly Tyr Asp Arg 
            180                 185                 190         


His Asn Phe Ser Ala Lys Ile Thr Met Gln Asp Leu Ala Glu Tyr Tyr 
        195                 200                 205             


Met Pro Pro Phe Gln Gln Cys Ala Arg Asp Ser Lys Val Gly Ser Phe 
    210                 215                 220                 


Met Cys Ser Tyr Asn Ala Val Asn Gly Val Pro Thr Cys Ala Asp Thr 
225                 230                 235                 240 


Tyr Val Leu Gln Thr Ile Leu Arg Asp His Trp Asn Trp Thr Asp Ser 
                245                 250                 255     


Asn Asn Tyr Ile Thr Ser Asp Cys Glu Ala Val Ala Asp Ile Ser Glu 
            260                 265                 270         


Asn His Lys Tyr Val Glu Thr Leu Ala Gln Gly Thr Ala Leu Ala Phe 
        275                 280                 285             


Ala Lys Gly Met Asp Leu Ser Cys Glu Tyr Ser Gly Ser Ser Asp Ile 
    290                 295                 300                 


Pro Gly Ala Trp Ser Gln Gly Leu Leu Asn Leu Ser Val Ile Asp Lys 
305                 310                 315                 320 


Ala Leu Thr Arg Gln Tyr Glu Gly Leu Val His Ala Gly Tyr Phe Asp 
                325                 330                 335     


Gly Ala Lys Ala Thr Tyr Ala Asn Leu Ser Tyr Asn Asp Ile Asn Thr 
            340                 345                 350         


Pro Glu Ala Arg Gln Leu Ser Leu Gln Val Thr Ser Glu Gly Leu Val 
        355                 360                 365             


Met Leu Lys Asn Asp His Thr Leu Pro Leu Pro Leu Thr Lys Gly Ser 
    370                 375                 380                 


Lys Val Ala Met Ile Gly Phe Trp Ala Asn Asp Ser Ser Lys Leu Gln 
385                 390                 395                 400 


Gly Ile Tyr Ser Gly Pro Pro Pro Tyr Arg His Ser Pro Val Phe Ala 
                405                 410                 415     


Gly Glu Gln Met Gly Leu Asp Met Ala Ile Ala Trp Gly Pro Met Ile 
            420                 425                 430         


Gln Asn Ser Ser Val Pro Asp Asn Trp Thr Thr Asn Ala Leu Asp Ala 
        435                 440                 445             


Ala Glu Lys Ser Asp Tyr Ile Leu Tyr Phe Gly Gly Gln Asp Trp Thr 
    450                 455                 460                 


Val Ala Gln Glu Gly Tyr Asp Arg Thr Thr Ile Ser Phe Pro Gln Val 
465                 470                 475                 480 


Gln Ile Asp Leu Leu Ala Lys Leu Ala Lys Leu Gly Lys Pro Leu Val 
                485                 490                 495     


Val Ile Thr Leu Gly Asp Met Thr Asp His Ser Pro Leu Leu Ser Met 
            500                 505                 510         


Glu Gly Ile Asn Ser Ile Ile Trp Ala Asn Trp Pro Gly Gln Asp Gly 
        515                 520                 525             


Gly Pro Ala Ile Leu Asn Val Ile Ser Gly Val His Ala Pro Ala Gly 
    530                 535                 540                 


Arg Leu Pro Ile Thr Glu Tyr Pro Ala Asp Tyr Val Lys Leu Ser Met 
545                 550                 555                 560 


Leu Asp Met Asn Leu Arg Pro His Ala Glu Ser Pro Gly Arg Thr Tyr 
                565                 570                 575     


Arg Trp Phe Asn Glu Ser Val Gln Pro Phe Gly Phe Gly Leu His Tyr 
            580                 585                 590         


Thr Thr Phe Glu Ala Gly Phe Ala Ser Glu Glu Gly Leu Thr Tyr Asp 
        595                 600                 605             


Ile Gln Glu Thr Leu Asp Ser Cys Thr Gln Gln Tyr Lys Asp Leu Cys 
    610                 615                 620                 


Glu Val Ala Pro Leu Glu Val Thr Val Ala Asn Lys Gly Asn Arg Thr 
625                 630                 635                 640 


Ser Asp Phe Val Ala Leu Ala Phe Ile Lys Gly Glu Val Gly Pro Lys 
                645                 650                 655     


Pro Tyr Pro Leu Lys Thr Leu Ile Thr Tyr Gly Arg Leu Arg Asp Ile 
            660                 665                 670         


His Gly Gly Ala Lys Lys Ser Ala Ser Leu Pro Leu Thr Leu Gly Glu 
        675                 680                 685             


Leu Ala Arg Val Asp Gln Ser Gly Asn Thr Val Ile Tyr Pro Gly Glu 
    690                 695                 700                 


Tyr Thr Leu Leu Leu Asp Glu Pro Thr Gln Ala Glu Leu Lys Leu Thr 
705                 710                 715                 720 


Ile Thr Gly Glu Glu Thr Ile Leu Asp Lys Trp Pro Gln Pro Pro Asn 
                725                 730                 735     


Gly Gly Asn Arg Thr Val Leu 
            740             


<210> 521
<211> 2610
<212> DNA
<213> Cochliobolus heterostrophus ATCC 48331

<400> 521
atgctgtggc ttgcacaagc attgttggtc ggccttgccc aggcatcgcc caggttccct     60

cgtgctacca acgacaccgg cagtgattct ttgaacaatg cccagagccc gccattctac    120

ccaagtcctt gggtagatcc caccaccaag gactgggcgg ctgcctatga aaaagcaaag    180

gcttttgtta gccaattgac tcttattgag aaggtcaacc tcaccaccgg cactggatgg    240

cagagcgacc actgcgttgg taacgtgggc gctattcctc gccttggctt tgatcccctc    300

tgcctccagg acagccctct cggcatccgt ttcgcagact acgtttctgc tttcccagca    360

ggtggcacca ttgctgcatc atgggaccgc tatgagtttt acacccgcgg taacgagatg    420

ggtaaggagc accgaaggaa gggagtcgac gttcagcttg gtcctgccat tggacctctt    480

ggtcgccacc ccaagggcgg tcgtaactgg gaaggcttca gtcctgatcc tgtactttcc    540

ggtgtggccg tgagcgaaac agtccgcggt atccaggatg ctggtgtcat tgcctgcact    600

aagcacttcc ttctgaacga gcaagaacat ttccgtcagc ccggcagttt cggagatatc    660

ccctttgtcg atgccatcag ctccaatacc gatgacacga ctctacacga gctctacctg    720

tggccctttg ccgacgccgt ccgcgctggt actggtgcca tcatgtgctc ttacaacaag    780

gccaacaact cgcaactctg ccaaaactcg caccttcaaa actatattct caagggcgag    840

cttggcttcc agggtttcat tgtatctgac tgggatgcac agcactcggg cgttgcgtcg    900

gcttatgctg gattggacat gactatgcct ggtgatactg gattcaacac tggactgtcc    960

ttctggggcg ctaacatgac cgtctccatt ctcaacggca ccattcccca gtggcgtctc   1020

gacgatgcgg ccatccgtat catgaccgca tactactttg tcggccttga tgagtctatc   1080

cctgtcaact ttgacagctg gcaaactagc acgtacggat tcgagcattt tttcggaaag   1140

aagggcttcg gtctgatcaa caagcacatt gacgttcgcg aggagcactt ccgctccatc   1200

cgccgctctg ctgccaagtc aaccgttctc ctcaagaact ctggcgtcct tcccctctct   1260

ggaaaggaga agtggactgc tgtatttgga gaagatgctg gcgaaaaccc gctgggcccc   1320

aacggatgcg ctgaccgcgg ctgcgactct ggcaccttgg ccatgggctg gggttcggga   1380

actgcagact tcccttacct cgtcactcct ctcgaagcca tcaagcgtga ggttggcgag   1440

aatggcggcg tgatcacttc ggtcacagac aactacgcca cttcgcagat ccagaccatg   1500

gccagcaggg ccagccactc gattgtcttc gtcaatgccg actctggtga aggttacatc   1560

actgttgata acaacatggg tgaccgcaac aacatgactg tgtggggcaa tggtgatgtg   1620

cttgtcaaga atatctctgc tctgtgcaac aacacgattg tggttatcca ctctgtcggc   1680

ccagtcatta ttgacgcctg gaaggccaac gacaacgtga ctgccattct ctgggctggt   1740

cttcctggcc aggagtctgg taactcgatt gctgacattc tatacggaca ccacaaccct   1800

ggtggcaagc tccccttcac cattggcagc tcttcagagg agtatggccc tgatgtcatc   1860

tacgagccca cgaacggcat cctcagccct caggccaact ttgaagaggg cgtcttcatt   1920

gactaccgcg cgtttgacaa ggcgggcatt gagcccacgt acgaatttgg ctttggtctt   1980

tcgtacacga cttttgaata ctcggacctc aaggtcactg cgcagtctgc cgaggcttac   2040

aagcctttca ccggccagac ttcggctgcc cctacattcg gaaacttcag caagaacccc   2100

gaggactacc agtaccctcc cggccttgtt taccccgaca cgttcatcta cccctacctc   2160

aactcgactg acctcaagac ggcatctcag gatcccgagt acggcctcaa cgttacctgg   2220

cccaagggct ctaccgatgg ctcgcctcag acccgcattg cggctggtgg tgcgcccggc   2280

ggtaaccccc agctctggga cgttttgttc aaggtcgagg ccacgatcac caacactggt   2340

cacgttgctg gtgacgaggt ggcccaggcg tacatctcgc ttggtggccc caacgacccc   2400

aaggtgctac tccgtgactt tgaccgcttg accatcaagc ctggtgagag cgctgttttc   2460

acagccaaca tcacccgccg tgatgtcagc aactgggaca ctgtcagcca gaactgggtc   2520

attaccgagt accccaagac gatccacgtt ggtgccagtt cgaggaacct tcctctttct   2580

gccccactgg acactagcag ctttagataa                                    2610

<210> 522
<211> 869
<212> PRT
<213> Cochliobolus heterostrophus ATCC 48331

<220> 
<221> DOMAIN
<222> (89)...(310)
<223> Glycosyl hydrolase family 3 N terminal domain

<220> 
<221> DOMAIN
<222> (408)...(642)
<223> Glycosyl hydrolase family 3 C terminal domain

<220> 
<221> SITE
<222> (24)...(27)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (74)...(77)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (266)...(269)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (280)...(297)
<223> Glycosyl hydrolases family 3 active site. Prosite id = PS00775

<220> 
<221> SITE
<222> (337)...(340)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (539)...(542)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (552)...(555)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (558)...(561)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (580)...(583)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (705)...(708)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (732)...(735)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (748)...(751)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (835)...(838)
<223> N-glycosylation site. Prosite id = PS00001

<400> 522
Met Leu Trp Leu Ala Gln Ala Leu Leu Val Gly Leu Ala Gln Ala Ser 
1               5                   10                  15      


Pro Arg Phe Pro Arg Ala Thr Asn Asp Thr Gly Ser Asp Ser Leu Asn 
            20                  25                  30          


Asn Ala Gln Ser Pro Pro Phe Tyr Pro Ser Pro Trp Val Asp Pro Thr 
        35                  40                  45              


Thr Lys Asp Trp Ala Ala Ala Tyr Glu Lys Ala Lys Ala Phe Val Ser 
    50                  55                  60                  


Gln Leu Thr Leu Ile Glu Lys Val Asn Leu Thr Thr Gly Thr Gly Trp 
65                  70                  75                  80  


Gln Ser Asp His Cys Val Gly Asn Val Gly Ala Ile Pro Arg Leu Gly 
                85                  90                  95      


Phe Asp Pro Leu Cys Leu Gln Asp Ser Pro Leu Gly Ile Arg Phe Ala 
            100                 105                 110         


Asp Tyr Val Ser Ala Phe Pro Ala Gly Gly Thr Ile Ala Ala Ser Trp 
        115                 120                 125             


Asp Arg Tyr Glu Phe Tyr Thr Arg Gly Asn Glu Met Gly Lys Glu His 
    130                 135                 140                 


Arg Arg Lys Gly Val Asp Val Gln Leu Gly Pro Ala Ile Gly Pro Leu 
145                 150                 155                 160 


Gly Arg His Pro Lys Gly Gly Arg Asn Trp Glu Gly Phe Ser Pro Asp 
                165                 170                 175     


Pro Val Leu Ser Gly Val Ala Val Ser Glu Thr Val Arg Gly Ile Gln 
            180                 185                 190         


Asp Ala Gly Val Ile Ala Cys Thr Lys His Phe Leu Leu Asn Glu Gln 
        195                 200                 205             


Glu His Phe Arg Gln Pro Gly Ser Phe Gly Asp Ile Pro Phe Val Asp 
    210                 215                 220                 


Ala Ile Ser Ser Asn Thr Asp Asp Thr Thr Leu His Glu Leu Tyr Leu 
225                 230                 235                 240 


Trp Pro Phe Ala Asp Ala Val Arg Ala Gly Thr Gly Ala Ile Met Cys 
                245                 250                 255     


Ser Tyr Asn Lys Ala Asn Asn Ser Gln Leu Cys Gln Asn Ser His Leu 
            260                 265                 270         


Gln Asn Tyr Ile Leu Lys Gly Glu Leu Gly Phe Gln Gly Phe Ile Val 
        275                 280                 285             


Ser Asp Trp Asp Ala Gln His Ser Gly Val Ala Ser Ala Tyr Ala Gly 
    290                 295                 300                 


Leu Asp Met Thr Met Pro Gly Asp Thr Gly Phe Asn Thr Gly Leu Ser 
305                 310                 315                 320 


Phe Trp Gly Ala Asn Met Thr Val Ser Ile Leu Asn Gly Thr Ile Pro 
                325                 330                 335     


Gln Trp Arg Leu Asp Asp Ala Ala Ile Arg Ile Met Thr Ala Tyr Tyr 
            340                 345                 350         


Phe Val Gly Leu Asp Glu Ser Ile Pro Val Asn Phe Asp Ser Trp Gln 
        355                 360                 365             


Thr Ser Thr Tyr Gly Phe Glu His Phe Phe Gly Lys Lys Gly Phe Gly 
    370                 375                 380                 


Leu Ile Asn Lys His Ile Asp Val Arg Glu Glu His Phe Arg Ser Ile 
385                 390                 395                 400 


Arg Arg Ser Ala Ala Lys Ser Thr Val Leu Leu Lys Asn Ser Gly Val 
                405                 410                 415     


Leu Pro Leu Ser Gly Lys Glu Lys Trp Thr Ala Val Phe Gly Glu Asp 
            420                 425                 430         


Ala Gly Glu Asn Pro Leu Gly Pro Asn Gly Cys Ala Asp Arg Gly Cys 
        435                 440                 445             


Asp Ser Gly Thr Leu Ala Met Gly Trp Gly Ser Gly Thr Ala Asp Phe 
    450                 455                 460                 


Pro Tyr Leu Val Thr Pro Leu Glu Ala Ile Lys Arg Glu Val Gly Glu 
465                 470                 475                 480 


Asn Gly Gly Val Ile Thr Ser Val Thr Asp Asn Tyr Ala Thr Ser Gln 
                485                 490                 495     


Ile Gln Thr Met Ala Ser Arg Ala Ser His Ser Ile Val Phe Val Asn 
            500                 505                 510         


Ala Asp Ser Gly Glu Gly Tyr Ile Thr Val Asp Asn Asn Met Gly Asp 
        515                 520                 525             


Arg Asn Asn Met Thr Val Trp Gly Asn Gly Asp Val Leu Val Lys Asn 
    530                 535                 540                 


Ile Ser Ala Leu Cys Asn Asn Thr Ile Val Val Ile His Ser Val Gly 
545                 550                 555                 560 


Pro Val Ile Ile Asp Ala Trp Lys Ala Asn Asp Asn Val Thr Ala Ile 
                565                 570                 575     


Leu Trp Ala Gly Leu Pro Gly Gln Glu Ser Gly Asn Ser Ile Ala Asp 
            580                 585                 590         


Ile Leu Tyr Gly His His Asn Pro Gly Gly Lys Leu Pro Phe Thr Ile 
        595                 600                 605             


Gly Ser Ser Ser Glu Glu Tyr Gly Pro Asp Val Ile Tyr Glu Pro Thr 
    610                 615                 620                 


Asn Gly Ile Leu Ser Pro Gln Ala Asn Phe Glu Glu Gly Val Phe Ile 
625                 630                 635                 640 


Asp Tyr Arg Ala Phe Asp Lys Ala Gly Ile Glu Pro Thr Tyr Glu Phe 
                645                 650                 655     


Gly Phe Gly Leu Ser Tyr Thr Thr Phe Glu Tyr Ser Asp Leu Lys Val 
            660                 665                 670         


Thr Ala Gln Ser Ala Glu Ala Tyr Lys Pro Phe Thr Gly Gln Thr Ser 
        675                 680                 685             


Ala Ala Pro Thr Phe Gly Asn Phe Ser Lys Asn Pro Glu Asp Tyr Gln 
    690                 695                 700                 


Tyr Pro Pro Gly Leu Val Tyr Pro Asp Thr Phe Ile Tyr Pro Tyr Leu 
705                 710                 715                 720 


Asn Ser Thr Asp Leu Lys Thr Ala Ser Gln Asp Pro Glu Tyr Gly Leu 
                725                 730                 735     


Asn Val Thr Trp Pro Lys Gly Ser Thr Asp Gly Ser Pro Gln Thr Arg 
            740                 745                 750         


Ile Ala Ala Gly Gly Ala Pro Gly Gly Asn Pro Gln Leu Trp Asp Val 
        755                 760                 765             


Leu Phe Lys Val Glu Ala Thr Ile Thr Asn Thr Gly His Val Ala Gly 
    770                 775                 780                 


Asp Glu Val Ala Gln Ala Tyr Ile Ser Leu Gly Gly Pro Asn Asp Pro 
785                 790                 795                 800 


Lys Val Leu Leu Arg Asp Phe Asp Arg Leu Thr Ile Lys Pro Gly Glu 
                805                 810                 815     


Ser Ala Val Phe Thr Ala Asn Ile Thr Arg Arg Asp Val Ser Asn Trp 
            820                 825                 830         


Asp Thr Val Ser Gln Asn Trp Val Ile Thr Glu Tyr Pro Lys Thr Ile 
        835                 840                 845             


His Val Gly Ala Ser Ser Arg Asn Leu Pro Leu Ser Ala Pro Leu Asp 
    850                 855                 860                 


Thr Ser Ser Phe Arg 
865            

<210> 523
<211> 642
<212> DNA
<213> Unknown

<220> 
<223> Obtained from environmental sample

<400> 523
atgtttatgt taagtaagaa aattttgatg gtgttattaa caatttcaat gagttttatt     60

agcttattta cagtaaccgc gtatgcagct tcgacagact actggcaaaa ttggactgat    120

ggtggtggga cagtaaatgc taccaatgga tctgatggca attacagtgt ttcatggaca    180

aattgcggga attttgttgt cggtaaaggc tggactaccg gatcagcatc tagggtaata    240

aactacaatg ccggcgcctt ttcgccgtcc ggtaatgggt atttggctct ctatgggtgg    300

acgagaaact cactcataga atattacgtt gttgatagct ggggtactta tagacctact    360

ggaacttata agggcactgt gactagtgat ggggggacat atgacatata tacgactaca    420

cgaactaacg caccttccat tgacggaact gcaactttta cccagttctg gagtgtaagg    480

cagtcgaaga gaccgaccgg taccaacaac accattactt ttagcaacca cgttaacgca    540

tggaagagta aagggatgaa tttggggagt agttggtctt atcaggtatt agcgacagag    600

ggatatcaaa gtagtgggta ctctaatgta acggtctggt aa                       642

<210> 524
<211> 213
<212> PRT
<213> Unknown

<220> 
<223> Obtained from environmental sample

<220> 
<221> SIGNAL
<222> (1)...(29)

<220> 
<221> DOMAIN
<222> (29)...(212)
<223> Glycosyl hydrolases family 11

<220> 
<221> SITE
<222> (37)...(40)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (46)...(49)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (54)...(57)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (105)...(115)
<223> Glycosyl hydrolases family 11 active site signature 1. Prosite id = PS00776

<220> 
<221> SITE
<222> (171)...(174)
<223> N-glycosylation site. Prosite id = PS00001

<220> 
<221> SITE
<222> (200)...(211)
<223> Glycosyl hydrolases family 11 active site signature 2. Prosite id = PS00777

<220> 
<221> SITE
<222> (212)...(215)
<223> N-glycosylation site. Prosite id = PS00001

<400> 524
Met Phe Met Leu Ser Lys Lys Ile Leu Met Val Leu Leu Thr Ile Ser 
1               5                   10                  15      


Met Ser Phe Ile Ser Leu Phe Thr Val Thr Ala Tyr Ala Ala Ser Thr 
            20                  25                  30          


Asp Tyr Trp Gln Asn Trp Thr Asp Gly Gly Gly Thr Val Asn Ala Thr 
        35                  40                  45              


Asn Gly Ser Asp Gly Asn Tyr Ser Val Ser Trp Thr Asn Cys Gly Asn 
    50                  55                  60                  


Phe Val Val Gly Lys Gly Trp Thr Thr Gly Ser Ala Ser Arg Val Ile 
65                  70                  75                  80  


Asn Tyr Asn Ala Gly Ala Phe Ser Pro Ser Gly Asn Gly Tyr Leu Ala 
                85                  90                  95      


Leu Tyr Gly Trp Thr Arg Asn Ser Leu Ile Glu Tyr Tyr Val Val Asp 
            100                 105                 110         


Ser Trp Gly Thr Tyr Arg Pro Thr Gly Thr Tyr Lys Gly Thr Val Thr 
        115                 120                 125             


Ser Asp Gly Gly Thr Tyr Asp Ile Tyr Thr Thr Thr Arg Thr Asn Ala 
    130                 135                 140                 


Pro Ser Ile Asp Gly Thr Ala Thr Phe Thr Gln Phe Trp Ser Val Arg 
145                 150                 155                 160 


Gln Ser Lys Arg Pro Thr Gly Thr Asn Asn Thr Ile Thr Phe Ser Asn 
                165                 170                 175     


His Val Asn Ala Trp Lys Ser Lys Gly Met Asn Leu Gly Ser Ser Trp 
            180                 185                 190         


Ser Tyr Gln Val Leu Ala Thr Glu Gly Tyr Gln Ser Ser Gly Tyr Ser 
        195                 200                 205             


Asn Val Thr Val Trp 
    210             



